Uploaded finetuned model

  • Developed by: Veiterr
  • License: apache-2.0
  • Finetuned from model : Veiterr/MNLP_M3_dpo_model_unsloth

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
6
Safetensors
Model size
0.6B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Veiterr/MNLP_M3_dpo_model

Unable to build the model tree, the base model loops to the model itself. Learn more.