YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

This model is an ExecuTorch-compatible, quantized variant of Meta’s Llama 3.2 3B Instruct, using the official SpinQuant INT4/EO8 format provided by Meta for efficient on-device inference.

Based on: Meta Llama 3.2 Quantization method: SpinQuant (INT4 weights / EO8 activations) by Meta

For more information, refer to the Llama 3.2 Model Card.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support