70b fine tuned coding model?
Is it possible to get a 70b model fine tuned on a coding dataset? I think the community would really benefit from it. I have a dataset like this actually if you need one. Link bellow:
https://huggingface.co/datasets/rombodawg/MegaCodeTraining112k
This is a great idea. Thanks!
hello
@rombodawg We will be working on it and let you know when it's available.
@hunkim
just so you are aware this is the newest dataset ive released, its an updated version of the one i linked before
https://huggingface.co/datasets/rombodawg/LosslessMegaCodeTrainingV2_1m_Evol_Uncensored
@hunkim
Ive actually recently updated and made an even better coding dataset, i hate to make you restart if you've already started training the model. but this dataset should produce a much better model. Just wanted to make you aware.
https://huggingface.co/datasets/rombodawg/LosslessMegaCodeTrainingV3_2.2m_Evol