Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiarui Yao's picture
2 20 1

Jiarui Yao

FlippyDora
research4pan's profile picture manh-linh's profile picture
·

AI & ML interests

None yet

Recent Activity

published a model about 16 hours ago
rb-dev/rubrics_train_data
upvoted a paper about 21 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
submitted a paper about 21 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
View all activity

Organizations

University of Illinois at Urbana-Champaign's profile picture RandomSampling's profile picture Embodied Reasoning Agent's profile picture EM-RAFT's profile picture Micro-RM's profile picture era-temporary's profile picture FANS - Formal Answer Selection Using Lean4's profile picture DPO-RM's profile picture CoE - Chain of Experts's profile picture tmp's profile picture PRM-CoT's profile picture UIUC ScaleML Lab's profile picture rb_dev's profile picture

FlippyDora 's models 64

FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo

Updated Oct 23, 2024 • 1

FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo

Updated Oct 22, 2024

FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo

Updated Oct 22, 2024 • 1

FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback

3B • Updated Oct 16, 2024
  • Previous
  • 1
  • 2
  • 3
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs