Self-Hinting Language Models Enhance Reinforcement Learning
Baohao Liao
baohao
AI & ML interests
NLP
Recent Activity
updated
a model about 7 hours ago
baohao/byt5-base-optim_clean-final_fold5-0_ep10bs1x8lr1e-4_ep7 published
a model about 7 hours ago
baohao/byt5-base-optim_clean-final_fold5-0_ep10bs1x8lr1e-4_ep7 updated
a model 1 day ago
baohao/byt5-xl_clean_fold0_ep10bs4x4lr2e-4_best