unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 6 days ago • 42.1k • 18
huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated Image-Text-to-Text • 36B • Updated 15 days ago • 39.2k • 222
view reply Amazing work you guys. I can run this local and hit your API for excess load and get consistent output in both places.
How to Alleviate Catastrophic Forgetting in LLMs Finetuning? Hierarchical Layer-Wise and Element-Wise Regularization Paper • 2501.13669 • Published Jan 23, 2025 • 1