mdouglas
/

llmc-gpt2-774M-150B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

llm.c checkpoint: GPT-2 774M

This is a HF/safetensors conversion of the llm.c checkpoint of a 774M parameter run on 150B tokens from FineWeb.

Training was conducted on a single 8xA100 80GB SXM node for ~6 days.

See discussion on GitHub for more information.

Downloads last month: 85

Safetensors

Model size

774M params

Tensor type

BF16

·

Inference Examples

Text Generation

Inference API (serverless) is not available, repository is disabled.

Dataset used to train mdouglas/llmc-gpt2-774M-150B

Collection including mdouglas/llmc-gpt2-774M-150B

llm.c

Models trained with llm.c • 2 items • Updated Jun 12