NeMo
English
nvidia
rlhf
llama3
zhilinw commited on
Commit
2f90845
1 Parent(s): 9add37e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -34,8 +34,8 @@ You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Alig
34
 
35
  ## References
36
 
 
37
  * [PPO method](https://arxiv.org/abs/2203.02155)
38
- * [HelpSteer](https://arxiv.org/abs/2311.09528)
39
  * [Llama 3: Open Foundation and Instruct Models](https://ai.meta.com/blog/meta-llama-3/) <br>
40
  * [Meta's Llama 3 Webpage](https://llama.meta.com/llama3/) <br>
41
  * [Meta's Llama 3 Model Card](https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md) <br>
 
34
 
35
  ## References
36
 
37
+ * [HelpSteer2](https://arxiv.org/abs/2406.08673)
38
  * [PPO method](https://arxiv.org/abs/2203.02155)
 
39
  * [Llama 3: Open Foundation and Instruct Models](https://ai.meta.com/blog/meta-llama-3/) <br>
40
  * [Meta's Llama 3 Webpage](https://llama.meta.com/llama3/) <br>
41
  * [Meta's Llama 3 Model Card](https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md) <br>