Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3.6
TFLOPS
4
26
103
João Palmeiro
joaompalmeiro
Follow
S-Dreamer's profile picture
Gargaz's profile picture
Flyxion's profile picture
11 followers
·
213 following
joaopalmeiro
joaopalmeiro
joaopalmeiro.bsky.social
AI & ML interests
None yet
Recent Activity
liked
a model
about 17 hours ago
Qwen/Qwen3-VL-Embedding-2B
liked
a model
about 17 hours ago
Qwen/Qwen3-VL-Embedding-8B
reacted
to
Akhil-Theerthala
's
post
with 👍
2 days ago
Is it better to show a model too many images once (Diversity), or extract as much information from a small set of images? I have always wanted to do an ablation study on this and recently I got the chance to do exactly that. Why? In applied domains like robotics, manufacturing, or banking, we rarely have the luxury of internet-scale diverse image datasets. We are often "Data Poor" in terms of diversity but "Data Rich" in depth. The takeaway? Density is efficient for facts but dangerous for reasoning (logical collapse) if you don't have larger scale data. More details: https://huggingface.co/blog/Akhil-Theerthala/diversity-density-for-vision-language-models
View all activity
Organizations
joaompalmeiro
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 models
about 17 hours ago
Qwen/Qwen3-VL-Embedding-2B
Image-to-Text
•
2B
•
Updated
about 22 hours ago
•
6
•
61
Qwen/Qwen3-VL-Embedding-8B
Image-to-Text
•
8B
•
Updated
about 22 hours ago
•
67
liked
a model
17 days ago
jinaai/jina-vlm
Image-Text-to-Text
•
2B
•
Updated
Dec 5, 2025
•
3.8k
•
93
liked
a dataset
24 days ago
axel-darmouni/anychart-vqa
Viewer
•
Updated
May 9, 2025
•
300
•
12
•
8
liked
a model
29 days ago
google/embeddinggemma-300m
Sentence Similarity
•
0.3B
•
Updated
Sep 25, 2025
•
672k
•
•
1.4k
liked
a dataset
about 1 month ago
eyehole/VisChainBench
Updated
May 20, 2025
•
87
•
2
liked
3 models
2 months ago
nvidia/nemotron-graphic-elements-v1
Object Detection
•
Updated
29 days ago
•
1
•
13
utter-project/TowerVision-9B
Image-Text-to-Text
•
10B
•
Updated
Nov 5, 2025
•
27
•
4
utter-project/TowerVision-2B
Image-Text-to-Text
•
3B
•
Updated
Nov 5, 2025
•
34
•
3
liked
a dataset
2 months ago
TIGER-Lab/VisPlotBench
Viewer
•
Updated
Nov 3, 2025
•
888
•
242
•
2
liked
a model
4 months ago
moondream/moondream3-preview
Image-Text-to-Text
•
9B
•
Updated
Oct 9, 2025
•
6.43k
•
•
538
liked
2 datasets
4 months ago
jupyter-agent/jupyter-agent-dataset
Viewer
•
Updated
Sep 10, 2025
•
95.8k
•
1.47k
•
154
HuggingFaceM4/FineVision
Viewer
•
Updated
Oct 21, 2025
•
24.2M
•
102k
•
463
liked
a Space
4 months ago
Running
215
FineVision: Open Data is All You Need
📝
215
A new open-source dataset for training VLMs
liked
a model
4 months ago
OpenGVLab/InternVL3_5-1B
Image-Text-to-Text
•
1B
•
Updated
Aug 29, 2025
•
39.7k
•
21
liked
a model
5 months ago
YannQi/R-4B
Image-Text-to-Text
•
5B
•
Updated
Sep 4, 2025
•
40.6k
•
176
liked
3 datasets
5 months ago
yubo2333/MMLongBench-Doc
Viewer
•
Updated
Nov 6, 2025
•
1.09k
•
2.14k
•
22
MMMU/MMMU_Pro
Viewer
•
Updated
Mar 8, 2025
•
5.19k
•
8.28k
•
41
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
52.5k
•
308
liked
a model
5 months ago
HuggingFaceTB/SmolVLM2-2.2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Apr 8, 2025
•
108k
•
296
Load more