-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 63 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 273
Av
Avi66
·
AI & ML interests
ML Research , LLMs , Applications
MultiModality
Recent Activity
updated
a collection
about 2 months ago
TTS
updated
a collection
5 months ago
TTS
updated
a collection
5 months ago
Papers
Organizations
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 1.51k • 168 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 594 • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 189k • 241 -
unsloth/Llama-3.2-90B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • 91B • Updated • 626 • 19
Spaces
Papers
-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 63 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 273
Tamil llm
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 1.51k • 168 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 594 • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 189k • 241 -
unsloth/Llama-3.2-90B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • 91B • Updated • 626 • 19
TTS
Spaces