Tarka Embed V1 Collection Efficient DFKD embeddings for language understanding • 4 items • Updated 9 days ago • 6
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23 • 62
Toolformer: Language Models Can Teach Themselves to Use Tools Paper • 2302.04761 • Published Feb 9, 2023 • 12
PaddleOCR-VL Collection Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 3 items • Updated Oct 17 • 21
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published Oct 16 • 17
BERT Hash Nano Models Collection Set of BERT models with a modified embeddings layer • 3 items • Updated Oct 6 • 8
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research Paper • 2510.06056 • Published Oct 7 • 5