Is this instruction following model?

by rjmehta - opened Dec 3, 2023

Discussion

rjmehta

Dec 3, 2023

The Avg looks pretty impressive. Is this pretrained, finetuned, or instruction tuned model?

bjoernp

Disco Research org Dec 3, 2023

•

edited Dec 3, 2023

Yeah it's finetuned on a large set of diverse tasks (~400 million tokens x 3 epochs). Thanks for your interest in our model! Also check out DiscoResearch/DiscoLM-120b for even stronger benchmark performance :)

bjoernp changed discussion status to closed Dec 3, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment