donut_experiment_bayesian_trial_17

This model is a fine-tuned version of naver-clova-ix/donut-base on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.4635
Bleu: 0.0675
Precisions: [0.8301886792452831, 0.7738095238095238, 0.7272727272727273, 0.6895424836601307]
Brevity Penalty: 0.0895
Length Ratio: 0.2930
Translation Length: 477
Reference Length: 1628
Cer: 0.7603
Wer: 0.8297

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Bleu	Precisions	Brevity Penalty	Length Ratio	Translation Length	Reference Length	Cer	Wer
0.8044	1.0	253	0.7112	0.0610	[0.7535641547861507, 0.6497695852534562, 0.5809018567639257, 0.5125]	0.0987	0.3016	491	1628	0.7647	0.8548
0.3513	2.0	506	0.5640	0.0632	[0.7908902691511387, 0.7089201877934272, 0.6449864498644986, 0.5801282051282052]	0.0934	0.2967	483	1628	0.7549	0.8416
0.2101	3.0	759	0.4754	0.0666	[0.8198757763975155, 0.744131455399061, 0.6802168021680217, 0.6217948717948718]	0.0934	0.2967	483	1628	0.7508	0.8282
0.0756	4.0	1012	0.4635	0.0675	[0.8301886792452831, 0.7738095238095238, 0.7272727272727273, 0.6895424836601307]	0.0895	0.2930	477	1628	0.7603	0.8297