This is a test model because the previous attempt failed.

Prompt format is: ChatML

Trained with regular LoRA (not quantized/QLoRA) and LoRA rank was 128 and Alpha set to 32. Trained for 5000 steps (0.11 epoch).

Uploaded model

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for mpasila/Viking-SlimSonnet-v0.2-LoRA-7B

Base model

Adapter

this model