Gibran Iqbal's picture

Gibran Iqbal PRO

Jibbscript

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

Multimodal OCR: Parse Anything from Documents

upvoted a paper 1 day ago

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

upvoted a paper 1 day ago

One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers

View all activity

Organizations

upvoted a paper about 19 hours ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published 5 days ago • 25

upvoted 3 papers 1 day ago

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

Paper • 2603.11593 • Published 6 days ago • 24

One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers

Paper • 2603.12245 • Published 6 days ago • 17

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published 6 days ago • 60

upvoted a collection 1 day ago

Mistral Small 4

A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 1 day ago • 45

upvoted a paper 3 days ago

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 6 days ago • 28

upvoted 5 papers 4 days ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published 9 days ago • 53

CodePercept: Code-Grounded Visual STEM Perception for MLLMs

Paper • 2603.10757 • Published 7 days ago • 13

LLM2Vec-Gen: Generative Embeddings from Large Language Models

Paper • 2603.10913 • Published 7 days ago • 38

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

Paper • 2603.09117 • Published 8 days ago • 9

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published 8 days ago • 74

upvoted 4 papers 6 days ago

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published 16 days ago • 17

MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

Paper • 2603.09652 • Published 8 days ago • 14

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published 8 days ago • 68

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published 8 days ago • 45

upvoted a paper 7 days ago

Scale Space Diffusion

Paper • 2603.08709 • Published 9 days ago • 15

upvoted an article 7 days ago

Article

Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge

9 days ago

•

11

upvoted 2 papers 7 days ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published 8 days ago • 28

Towards a Neural Debugger for Python

Paper • 2603.09951 • Published 8 days ago • 5

upvoted a paper 8 days ago

Dynamic Chunking Diffusion Transformer

Paper • 2603.06351 • Published 12 days ago • 14