incorrect use of T5 model?

#39

by X-niper - opened Jul 24

Discussion

X-niper

Jul 24

•

edited Jul 24

We compare the fp16 T5 model in this repo and the fp32 T5 model with the same parameters and find that the outputs for the same text prompt are not the same.

Possible solution may be: cast the model to fp32 and do inference with autocast(bf16) or directly with fp32.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment