Alina Lozovskaya

alozowski

AI & ML interests

NLP in all aspects

Organizations

alozowski's activity

New activity in open-llm-leaderboard/results 1 day ago

Model fail, re-eval request 😊

8
#885 opened about 1 month ago by dnhkng

How to calculate GPQA score?

4
#928 opened 6 days ago by JJaeuk

🚩 Report: Not working

1
#939 opened 2 days ago by Lyte

Why failed

1
#936 opened 3 days ago by DZgas
New activity in open-llm-leaderboard/requests 2 days ago

failed

6
#59 opened 6 days ago by legolasyiu
New activity in open-llm-leaderboard/open_llm_leaderboard 10 days ago

check-submit

5
#920 opened 10 days ago by alozowski
New activity in open-llm-leaderboard/results 10 days ago

Missing Llama 3.1 405B

1
#15 opened 13 days ago by lukestanley
New activity in open-llm-leaderboard/open_llm_leaderboard 13 days ago

Model evaluation failed

1
#916 opened 15 days ago by CoolSpring

bump-up-gradio

5
#918 opened 13 days ago by alozowski
New activity in open-llm-leaderboard/open_llm_leaderboard 16 days ago

Can't login error

2
#914 opened 16 days ago by legolasyiu

Upload HOUSHANG.pth

5
#912 opened 16 days ago by Huschang

IFEval reproduction problem

8
#911 opened 16 days ago by LamTungTran
New activity in open-llm-leaderboard/open_llm_leaderboard 17 days ago

Still pending

6
#900 opened 24 days ago by legolasyiu

Incomplete model

1
#909 opened 19 days ago by MaziyarPanahi

bump-up-transformers

5
#910 opened 17 days ago by alozowski
New activity in open-llm-leaderboard/open_llm_leaderboard 22 days ago

Model evaluations failed

4
#884 opened about 1 month ago by DavidGF

Incorrect ifeval benchmark

5
#879 opened about 1 month ago by DavidGF
New activity in open-llm-leaderboard/requests 22 days ago

all failed tests

1
#57 opened 24 days ago by legolasyiu
New activity in open-llm-leaderboard/open_llm_leaderboard 22 days ago

Model Failed: StableProse

3
#894 opened 27 days ago by nlpguy