Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
SmolLM-360M-Instruct-quantized.w8a8
like
0
Text Generation
Transformers
Safetensors
English
llama
int8
vllm
conversational
text-generation-inference
Inference Endpoints
8-bit precision
arxiv:
2210.17323
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
SmolLM-360M-Instruct-quantized.w8a8
1 contributor
History:
3 commits
alexmarques
Create README.md
e6cf3b6
verified
27 days ago
.gitattributes
1.52 kB
initial commit
28 days ago
README.md
6.41 kB
Create README.md
27 days ago
config.json
1.84 kB
Upload folder using huggingface_hub
28 days ago
generation_config.json
156 Bytes
Upload folder using huggingface_hub
28 days ago
merges.txt
466 kB
Upload folder using huggingface_hub
28 days ago
model.safetensors
504 MB
LFS
Upload folder using huggingface_hub
28 days ago
recipe.yaml
488 Bytes
Upload folder using huggingface_hub
28 days ago
special_tokens_map.json
655 Bytes
Upload folder using huggingface_hub
28 days ago
test_prompts.py
3.98 kB
Upload folder using huggingface_hub
28 days ago
tokenizer.json
2.1 MB
Upload folder using huggingface_hub
28 days ago
tokenizer_config.json
3.59 kB
Upload folder using huggingface_hub
28 days ago
vocab.json
801 kB
Upload folder using huggingface_hub
28 days ago