·
AI & ML interests
DL/RL
Organizations
rs545837/PIPer-Stage1-SFT-ShareGPT
rs545837/PIPer-Stage2-RL-Final
rs545837/PIPer-Qwen3-8B-RL-envsetup
Reinforcement Learning
• 8B • Updated • 2
rs545837/cartoonization-scratch
Updated
rs545837/act_so100_test_kl8
rs545837/act_so100_test2_ckpts
Updated
0.9B • Updated 0.9B • Updated rs545837/TrelisLM-100M-Instruct-layer-hidden-pruned
Text Generation
• 0.1B • Updated rs545837/80M-0.050-cosmopedia
0.1B • Updated • 1
rs545837/80M-0.0040-cosmopedia
0.1B • Updated rs545837/80M-0.0060-cosmopedia
0.1B • Updated rs545837/TrelisLM-smollm-distil-2000
Text Generation
• 0.1B • Updated rs545837/TrelisLM-smollm-distil-750
Text Generation
• 0.1B • Updated rs545837/TrelisLM-smollm-distil-1000
Text Generation
• 0.1B • Updated • 1
rs545837/TrelisLM-smollm-distil-1500
Text Generation
• 0.1B • Updated • 12
rs545837/TrelisLM-smollm-distil-2
Text Generation
• 0.1B • Updated rs545837/TrelisLM-smollm-distil-1
Text Generation
• 0.1B • Updated • 3
rs545837/finetuned-messages-distil-smollm
Text Generation
• 0.1B • Updated rs545837/TrelisLM-smollm-distil
Text Generation
• 0.1B • Updated • 6
rs545837/TrelisLM-100M-Instruct
Text Generation
• 0.4B • Updated • 1
Text Generation
• 0.4B • Updated rs545837/speecht5_jenny_2000sample
Text-to-Audio
• 0.1B • Updated rs545837/speecht5_jenny_500sample
Text-to-Audio
• 0.1B • Updated rs545837/speecht5_jenny_500
Text-to-Audio
• 0.1B • Updated