-
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper โข 2510.14972 โข Published โข 35 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper โข 2510.18866 โข Published โข 114 -
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning
Paper โข 2510.19338 โข Published โข 115 -
The Smol Training Playbook
๐3.03kThe secrets to building world-class LLMs
Jonatan Borkowski
j14i
AI & ML interests
None yet
Recent Activity
reacted
to
Ujjwal-Tyagi's
post with ๐ 1 day ago
Public reports allege that Anthropic gobbled up trillions of tokens of copyrighted material and public data to build their castle. ๐ฐ๐ Now that they're sitting on top, they're begging for special laws to protect their profits while pulling the ladder up behind them. ๐ช๐ซ
But the hypocrisy meter just broke! ๐ They are accusing Chinese labs like DeepSeek, Minimax, and Kimi of "huge distillation attacks. The Reality is that You can't just loot the entire internet's library, lock the door, and then sue everyone else for reading through the window. Stop trying to gatekeep the tech you didn't own in the first place. Read the complete article on it: https://huggingface.co/blog/Ujjwal-Tyagi/the-dark-underbelly-of-anthropic liked
a dataset 1 day ago
peteromallet/dataclaw-peteromallet