-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated • 5 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 9 • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated • 7 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated • 6
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
published
a model
about 11 hours ago
AmberYifan/Qwen2.5-3B-Instruct-MATH-MARL-structure-gating
published
a model
2 days ago
AmberYifan/Qwen2.5-3B-Instruct-GRPO
published
a model
3 days ago
AmberYifan/Qwen2.5-3B-Instruct-MATH-MARL-structure
Organizations
LLMs Can Get "Brain Rot"!
-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated • 5 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 9 • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated • 7 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated • 6
DRIFT
Learning from Abundant User Dissatisfaction in Real-World Preference Learning
models
245
AmberYifan/Qwen2.5-3B-Instruct-MATH-MARL-structure-gating
Updated
AmberYifan/Qwen2.5-3B-Instruct-GRPO
Updated
AmberYifan/Qwen2.5-3B-Instruct-MATH-MARL-structure
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-test
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-mysw-diameter
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-mysw-diameter-relative
Updated
AmberYifan/qwen3-0.6b-p36-sft
Updated
AmberYifan/qwen3-0.6b-mmlu-sft
Updated
•
19
AmberYifan/Llama-3.1-8B-Instruct-tulu-sft-30k
Updated
AmberYifan/Llama-3.1-8B-Instruct-tulu-sft-12k
Updated
datasets
28
AmberYifan/seed-data
Viewer
•
Updated
•
491
•
31
AmberYifan/dsat-data
Viewer
•
Updated
•
10.6k
•
22
AmberYifan/sat-data
Viewer
•
Updated
•
4.43k
•
34
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
•
21
AmberYifan/sft-spin-filter
Updated
•
2
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
•
20
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
14
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
16
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
•
13
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
•
12