bn22
/

Nous-Hermes-2-SOLAR-10.7B-MISALIGNED

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Adding Evaluation Results

#2

by leaderboard-pr-bot - opened Mar 4

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

leaderboard-pr-bot

Mar 4

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Adding Evaluation Results3b31e4a9

Jun 12

•

I evaluated the model using my Freedom of Speech benchmark. It is indeed better than the original model in terms of censorshit, only lacking 3.5 points behind the abliterated Llama3 which so far holds the crown. Good job, I really liked Hermes2Solar overall performance, the only thing it was lacking in was freedom and now that's pretty much fixed by you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment