Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AnyModal
/
Image-Captioning-Llama-3.2-1B
like
1
Follow
AnyModal
7
Image-to-Text
Safetensors
AnyModal/flickr30k
English
AnyModal
vlm
vision
multimodal
License:
mit
Model card
Files
Files and versions
xet
Community
1
main
Image-Captioning-Llama-3.2-1B
62.5 MB
1 contributor
History:
10 commits
ritabratamaiti
Update README.md
7697704
verified
about 1 year ago
language_model
Upload folder using huggingface_hub
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
README.md
4.53 kB
Update README.md
about 1 year ago
input_tokenizer.pt
39.9 MB
xet
Upload folder using huggingface_hub
about 1 year ago