This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Unable to build the model tree, the base model loops to the model itself. Learn more.