Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ContextualAI
/
archangel_sft-dpo_llama30b
like
0
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
20437aa
archangel_sft-dpo_llama30b
Commit History
Upload tokenizer
20437aa
verified
xwinxu
commited on
Jan 11
Upload tokenizer
9a3a075
verified
xwinxu
commited on
Jan 11
Upload LlamaForCausalLM
1a77e16
verified
stas
commited on
Jan 11
Upload tokenizer
09bf240
verified
stas
commited on
Jan 11
Upload README.md with huggingface_hub
ef572e5
xwinxu
commited on
Jan 9
Upload tokenizer
25023ab
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
29348b1
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
7cd98da
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
284dcc0
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
f23369e
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
99d4ad3
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
47acf14
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
6671aeb
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
b19b8f4
stas
commited on
Dec 3, 2023
Upload tokenizer
2476754
stas
commited on
Dec 3, 2023
initial commit
601ec62
stas
commited on
Dec 3, 2023