Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
20
1
Jiarui Yao
FlippyDora
Follow
research4pan's profile picture
manh-linh's profile picture
2 followers
·
22 following
AI & ML interests
None yet
Recent Activity
published
a model
about 16 hours ago
rb-dev/rubrics_train_data
upvoted
a
paper
about 21 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
submitted
a paper
about 21 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
View all activity
Organizations
FlippyDora
's models
64
Sort: Recently updated
FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo
Updated
Oct 23, 2024
•
1
FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo
Updated
Oct 22, 2024
FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo
Updated
Oct 22, 2024
•
1
FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback
3B
•
Updated
Oct 16, 2024
Previous
1
2
3
Next