Lia Kyle
liakyle
ยท
AI & ML interests
None yet
Recent Activity
updated
a Space 19 days ago
liakyle/liakyle published
a Space 19 days ago
liakyle/liakyle reacted
to
sergiopaniego's
post with ๐ 19 days ago
Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally (@microsoft):
๐ Detects training issues early
๐ Lets you intervene safely
๐ Keeps long training runs stable, auditable & efficient
Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/
Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration
Code: https://github.com/microsoft/post-training-toolkit Organizations
None yet