Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ContextualAI
/
archangel_sft-dpo_llama13b
like
0
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
3fcda91
archangel_sft-dpo_llama13b
Commit History
Upload LlamaForCausalLM
3fcda91
xwinxu
commited on
Jan 8
Upload tokenizer
bfb4ade
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
0167681
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
05af523
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
805ab86
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
dd912b4
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
77128aa
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
eb09ec1
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
8e8020c
stas
commited on
Dec 3, 2023
Upload tokenizer
c3f8fd1
stas
commited on
Dec 2, 2023
initial commit
082aaba
stas
commited on
Dec 2, 2023