Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jingxuan Fan's picture

Jingxuan Fan

fjxdaisy
·

AI & ML interests

None yet

Recent Activity

updated a dataset about 7 hours ago
fjxdaisy/finemath_part5_llama8b_actor_218step_rm
updated a dataset about 7 hours ago
fjxdaisy/finemath_part6_llama8b_actor_218step_rm
updated a dataset about 18 hours ago
fjxdaisy/finemath_part8_llama8b_actor_218step_rm_vllm
View all activity

Organizations

reward-scaling's profile picture UserAssist's profile picture

Papers 5

arxiv:2603.02225
arxiv:2510.09885
arxiv:2508.15815
arxiv:2501.14249

models 0

None public yet

datasets 8

fjxdaisy/finemath_part5_llama8b_actor_218step_rm

Viewer • Updated about 7 hours ago • 2.84k • 11

fjxdaisy/finemath_part6_llama8b_actor_218step_rm

Viewer • Updated about 7 hours ago • 2.83k • 8

fjxdaisy/finemath_part8_llama8b_actor_218step_rm_vllm

Viewer • Updated about 7 hours ago • 3.32k

fjxdaisy/finemath_part7_llama8b_actor_218step_rm_vllm

Viewer • Updated about 7 hours ago • 3.34k • 1

fjxdaisy/rlhfpipeline_mix1_llamafactory

Viewer • Updated 8 days ago • 244k • 26

fjxdaisy/summarize_from_feedback_comparisons_pref

Viewer • Updated 8 days ago • 179k • 7

fjxdaisy/shp-preferences

Viewer • Updated 8 days ago • 386k • 7

fjxdaisy/hh-rlhf

Viewer • Updated 8 days ago • 168k • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs