Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
RLHF4MATH
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Team members
1
RLHF4MATH
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Articles
1231czx
updated
a dataset
almost 2 years ago
RLHF4MATH/Gemma-7B-1.1-it-iter1-random-pairs
Viewer
•
Updated
Jul 27, 2024
•
19k
•
6
•
1
1231czx
updated
6 models
almost 2 years ago
RLHF4MATH/CodeGemma-7B-it-M-DPO
Text Generation
•
9B
•
Updated
Jul 26, 2024
•
4
RLHF4MATH/Gemma-7B-it-M-DPO
Text Generation
•
9B
•
Updated
Jul 26, 2024
•
36
RLHF4MATH/Gemma-9B-it-SFT3epoch
Text Generation
•
9B
•
Updated
Jul 26, 2024
•
4
RLHF4MATH/Mistral-7B-pt-SFT2epoch
Text Generation
•
7B
•
Updated
Jul 26, 2024
•
1
RLHF4MATH/Code-Gemma-7B-it-SFT3epoch
Text Generation
•
9B
•
Updated
Jul 26, 2024
•
18
•
1
RLHF4MATH/Gemma-7B-it-SFT3epoch
Text Generation
•
9B
•
Updated
Jul 26, 2024
•
4
1231czx
updated
5 datasets
almost 2 years ago
RLHF4MATH/SFT_510K
Viewer
•
Updated
Jul 25, 2024
•
512k
•
8
•
1
RLHF4MATH/prompt_iter4
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
12
RLHF4MATH/prompt_iter3
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
5
RLHF4MATH/prompt_iter2
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
5
RLHF4MATH/prompt_iter1
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
6
1231czx
updated
2 models
almost 2 years ago
RLHF4MATH/Gemma-2-9B-it-M-DPO
Text Generation
•
9B
•
Updated
Jul 15, 2024
•
5
RLHF4MATH/Mistral-7B-pt-M-DPO
Text Generation
•
7B
•
Updated
Jul 13, 2024
•
3