This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

I evaluated the model using my Freedom of Speech benchmark. It is indeed better than the original model in terms of censorshit, only lacking 3.5 points behind the abliterated Llama3 which so far holds the crown. Good job, I really liked Hermes2Solar overall performance, the only thing it was lacking in was freedom and now that's pretty much fixed by you.

image.png

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment