Chroma's picture

Chroma

Chroma111

·

AI & ML interests

None yet

Recent Activity

liked a model 9 minutes ago

Xenova/bge-m3

liked a model 14 minutes ago

Xenova/multilingual-e5-large

liked a model 20 minutes ago

BAAI/bge-m3

View all activity

Organizations

None yet

upvoted 2 papers about 2 hours ago

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

Paper • 2510.11683 • Published Oct 13 • 14

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Paper • 2512.05905 • Published 6 days ago • 18

upvoted a paper about 5 hours ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 2 days ago • 109

upvoted 3 collections about 7 hours ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 50 items • Updated about 7 hours ago • 135

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 2 days ago • 41

Olmo 3 Pre-training

All artifacts related to Olmo 3 pre-training • 10 items • Updated 2 days ago • 28

upvoted a collection about 8 hours ago

Qwen Family

LiteRT models in the Qwen Family • 3 items • Updated about 9 hours ago • 1

upvoted a collection 2 days ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 2 days ago • 150

upvoted an article 2 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

2 days ago

•

64

upvoted a paper 2 days ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 63

upvoted 4 collections 2 days ago

Qwen2-Audio

Audio-language model series based on Qwen2 • 4 items • Updated Jul 21 • 65

QwQ

Qwen with Questions • 6 items • Updated Jul 21 • 101

Qwen3-Embedding

6 items • Updated Jul 21 • 141

Qwen3-Coder

5 items • Updated Jul 31 • 137

upvoted a paper 2 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 265

upvoted 4 collections 2 days ago

Qwen-Image

7 items • Updated Sep 25 • 31

Apriel-1.5-15B-Thinker

3 items • Updated Oct 2 • 76

Apriel-1.6-15B-Thinker

2 items • Updated 1 day ago • 4

Devstral 2

A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated 3 days ago • 29

upvoted a paper 2 days ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 117