Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
PTPReasoning
's Collections
PTP Models
SFT Data
RL Data
Evalutation
PTP Models
updated
Jul 29, 2025
Upvote
-
PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2
Text Generation
•
8B
•
Updated
Apr 23, 2025
•
3
PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2
Text Generation
•
8B
•
Updated
Apr 23, 2025
•
4
PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2
Text Generation
•
8B
•
Updated
May 3, 2025
•
5
PTPReasoning/Qwen2.5-7B-Base-RL-Baseline
Text Generation
•
8B
•
Updated
Apr 30, 2025
•
4
PTPReasoning/Llama-3.1-8B-SFT-Clean-V2
Text Generation
•
8B
•
Updated
Jul 25, 2025
PTPReasoning/Llama-3.1-8B-SFT-Baseline
Text Generation
•
8B
•
Updated
Jul 25, 2025
PTPReasoning/Llama-3.1-8B-RL-Baseline-V2
8B
•
Updated
Jul 26, 2025
PTPReasoning/Llama-3.1-8B-RL-Clean-V2
8B
•
Updated
Jul 29, 2025
Upvote
-
Share collection
View history
Collection guide
Browse collections