**Testing** ===================== **Hyperparameters** --------------- * **Temperature**: 0.9 * **Penalize repeat sequence**: 1.05 * **Consider N tokens for penalize**: 256 * **Penalize repetition of newlines** * **Top-K sampling**: 40 * **Top-P sampling**: 0.95 * **Min-P sampling**: 0.05 **LLaMAcpp Version** ------------------ * b3527-2-g2d5dd7bb **File** ------------------ * Cathallama-70B.Q4_0.gguf **Test Cases** -------------- | Test Case | Result | | --- | --- | | Ball on cup | OK | | Door window combination | OK | | Big duck small horse | OK | | JSON | OK | | Killers | OK | | Dragon | OK | | Poem | OK | | Jane faster | OK | | Shirts | OK | | Sisters | OK | | Python snake game | OK* | | Story | OK | *best I ever saw on local LLMs including Qwen2 72b at 8bpw, Llama 3 70b 8bpw Note: See sample generations on the main folder of the repo.