Purbesh Mitra's picture

Open to Work

2 2 1

Purbesh Mitra

purbeshmitra

https://sites.google.com/view/pmitra

AI & ML interests

Emergent reasoning in AI systems

Recent Activity

updated a dataset 6 days ago

purbeshmitra/ssb_teacher_data

updated a model 6 days ago

purbeshmitra/semantic-soft-bootstrapping

updated a collection 6 days ago

Semantic Soft Bootstrapping

View all activity

Organizations

updated a dataset 6 days ago

purbeshmitra/ssb_teacher_data

Viewer • Updated 6 days ago • 256 • 31

updated a model 6 days ago

purbeshmitra/semantic-soft-bootstrapping

Text Generation • Updated 6 days ago • 5

updated a collection 6 days ago

Semantic Soft Bootstrapping

A self-distillation based training method for long context reasoning in a single LLM without reinforcement learning • 3 items • Updated 6 days ago

published a dataset 6 days ago

purbeshmitra/ssb_teacher_data

Viewer • Updated 6 days ago • 256 • 31

updated a collection 6 days ago

Semantic Soft Bootstrapping

A self-distillation based training method for long context reasoning in a single LLM without reinforcement learning • 3 items • Updated 6 days ago

updated a collection 7 days ago

Semantic Soft Bootstrapping

A self-distillation based training method for long context reasoning in a single LLM without reinforcement learning • 3 items • Updated 6 days ago

published a model 9 days ago

purbeshmitra/semantic-soft-bootstrapping

Text Generation • Updated 6 days ago • 5

upvoted a collection 4 months ago

Qwen3

84 items • Updated Aug 6 • 1.48k

updated 2 models 5 months ago

purbeshmitra/vanillaGRPO

Text Generation • Updated Jul 7 • 11

purbeshmitra/MOTIF

Text Generation • Updated Jul 7 • 23 • 1

New activity in purbeshmitra/MOTIF 5 months ago

Add pipeline tag to model card

#1 opened 5 months ago by

New activity in purbeshmitra/vanillaGRPO 5 months ago

Add pipeline tag and update library_name

#1 opened 5 months ago by

updated a collection 5 months ago

MOTIF paper

MOTIF trained model and Vanilla GRPO trained model, compared in the paper. • 3 items • Updated 7 days ago • 1

published a model 5 months ago

purbeshmitra/vanillaGRPO

Text Generation • Updated Jul 7 • 11

updated 2 models 5 months ago

purbeshmitra/MOTIF

Text Generation • Updated Jul 7 • 23 • 1

purbeshmitra/MOTIF

Text Generation • Updated Jul 7 • 23 • 1