Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
SmolLM-1.7B-Instruct-quantized.w8a8
like
0
Text Generation
Transformers
Safetensors
English
llama
int8
vllm
conversational
text-generation-inference
Inference Endpoints
8-bit precision
arxiv:
2210.17323
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
SmolLM-1.7B-Instruct-quantized.w8a8
1 contributor
History:
3 commits
alexmarques
Create README.md
4d31352
verified
27 days ago
.gitattributes
1.52 kB
initial commit
27 days ago
README.md
6.4 kB
Create README.md
27 days ago
config.json
1.84 kB
Upload folder using huggingface_hub
27 days ago
generation_config.json
132 Bytes
Upload folder using huggingface_hub
27 days ago
merges.txt
466 kB
Upload folder using huggingface_hub
27 days ago
model.safetensors
2.01 GB
LFS
Upload folder using huggingface_hub
27 days ago
recipe.yaml
173 Bytes
Upload folder using huggingface_hub
27 days ago
special_tokens_map.json
655 Bytes
Upload folder using huggingface_hub
27 days ago
tokenizer.json
2.1 MB
Upload folder using huggingface_hub
27 days ago
tokenizer_config.json
3.59 kB
Upload folder using huggingface_hub
27 days ago
vocab.json
801 kB
Upload folder using huggingface_hub
27 days ago