AI & ML interests
None yet
Organizations
None yet
Mostafa8Mehrabi/llama-1b-3blocks-BI-pruned-10-epochs-KD-bookcorpus-activeLearning-OnPolicy-SFT-CoT-merged
Text Generation
• 1B • Updated
• 3
Mostafa8Mehrabi/llama-1b-sft-cot-multiturn_active_learning-merged
Text Generation
• 1B • Updated
• 4
Mostafa8Mehrabi/llama-1b-multiturn-SFT-CoT-merged
Text Generation
• 1B • Updated
• 4
Mostafa8Mehrabi/llama-1b-3blocks-BI-pruned-10-epochs-KD-bookcorpus-SFT-CoT-merged
Text Generation
• 1B • Updated
• 1
Mostafa8Mehrabi/llama-1b-3blocks-BI-pruned-KD-bookcorpus-improved_epoch_10
Text Generation
• 1B • Updated
• 1
Mostafa8Mehrabi/qwen3-30m-tinystories-final
Text Generation
• 34.7M • Updated
• 3
Mostafa8Mehrabi/qwen3-30m-tinystories-checkpoints
Updated
Mostafa8Mehrabi/qwen3-30m-fp16
Text Generation
• 34.7M • Updated
• 4
• 1
Mostafa8Mehrabi/qwen3-71M-c4-final
Text Generation
• 71.6M • Updated
• 8
Mostafa8Mehrabi/qwen3-71m-c4-checkpoints
Updated
Mostafa8Mehrabi/qwen3-50m-c4-final_test_H200
Text Generation
• 71.6M • Updated
Mostafa8Mehrabi/qwen3-50m-c4-checkpoints_test_H200
Updated
Mostafa8Mehrabi/qwen3-50m-c4-final-test-version
Text Generation
• 71.6M • Updated
• 3
Mostafa8Mehrabi/qwen3-50m-c4-checkpoints-test-version
Updated
Mostafa8Mehrabi/qwen3-50m-insomnia-therapist
Text Generation
• 71.6M • Updated
• 6
Mostafa8Mehrabi/qwen3-50m-fp16
Text Generation
• 71.6M • Updated
• 1
• 1
Mostafa8Mehrabi/qwen3-50m-storyteller
Text Generation
• 71.6M • Updated
Mostafa8Mehrabi/qwen3-50m-fp32
71.6M • Updated
• 5
• 1
Mostafa8Mehrabi/custom-57m-language-model
Text Generation
• Updated
• 1
Mostafa8Mehrabi/deepseek_v3_mini_50m
Updated
Mostafa8Mehrabi/deepseek-v3-mini-wikitext103-lora-merged
0.2B • Updated
• 1
Mostafa8Mehrabi/deepseek-v3-mini
Text Generation
• Updated
• 9
• 1
Mostafa8Mehrabi/llama-1b-3blocks-PPL-pruned-10-epochs-KD-ptb-SFT-CoT-merged
Text Generation
• 1B • Updated
Mostafa8Mehrabi/llama-1b-3blocks-PPL-pruned-10-epochs-KD-ptb
1B • Updated
Mostafa8Mehrabi/llama-1b-pruned-3blocks-ppl-therapy-calibration
Text Generation
• 1B • Updated
Mostafa8Mehrabi/llama-1b-pruned-3blocks-bi-Insomnia-ChatBot-SFT-CoT-merged
Text Generation
• 1B • Updated
Mostafa8Mehrabi/llama-1b-3blocks-BI-pruned-10-epochs-KD-ptb-SFT-CoT-merged
Text Generation
• 1B • Updated
• 13
Mostafa8Mehrabi/llama-1b-3blocks-BI-pruned-10-epochs-KD-ptb
Mostafa8Mehrabi/llama-1b-pruned-3blocks-bi-therapy-calibration
Text Generation
• 1B • Updated
• 2
Mostafa8Mehrabi/llama-1b-3blocks-taylor-plus-pruned-10-epochs-KD-ptb-SFT-CoT-merged
Text Generation
• 1B • Updated
• 1