F16 model size is same as the original F32 one?

by KSimulation - opened Aug 15

Aug 15

F16 model size is same as the original F32 one?

[23.8G]

Am I missing anything...

city96

Owner Aug 15

The original is in BF16 not FP32, so this is expected.

The FP16 GGUF is meant as a base for other (for exampke _K) quants via CPP code, although they're all currently created from the safetensors file.

city96 changed discussion status to closed Aug 15

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment