view article Article Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement 7 days ago • 11
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 171
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 7 days ago • 261
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 65
ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset Paper • 2405.10004 • Published May 16, 2024 • 1
Quantifying the Carbon Emissions of Machine Learning Paper • 1910.09700 • Published Oct 21, 2019 • 20
MMMModal -- Multi-Images Multi-Audio Multi-turn Multi-Modal Paper • 2402.11297 • Published Feb 17, 2024 • 2