Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alexey Gorbatovski's picture
4 10

Alexey Gorbatovski

Myashka
SmartFlow's profile picture borisshapa's profile picture 21world's profile picture
·
  • Myashka

AI & ML interests

NLP Alignment

Recent Activity

authored a paper 6 days ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
upvoted a paper 6 days ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
submitted a paper 6 days ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
View all activity

Organizations

None yet

commented a paper 4 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 84 •
3
New activity in agentica-org/DeepScaleR-Preview-Dataset 4 months ago

There are no answers for 6 samples

#4 opened 4 months ago by
Myashka
New activity in Myashka/CryptoNews_50_50 almost 2 years ago

Librarian Bot: Add language metadata for dataset

#2 opened almost 2 years ago by
librarian-bot
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs