Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dan Zhang's picture
2 4

Dan Zhang

zd21
yangwang92's profile picture Nitral-AI's profile picture 21world's profile picture
ยท
https://zhangdan0602.github.io/
  • ZhangDa57152861
  • zhangdan0602

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago
zd21/qwen2.5-7b-td2
published a model about 1 month ago
zd21/qwen2.5-7b-td2
updated a model about 1 month ago
zd21/qwen2.5-7b-baseline-prm
View all activity

Organizations

None yet

zd21 's collections 1

TDRM
Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference
  • zd21/DeepSeek-TD0-PRM

    Updated Jul 12
  • zd21/DeepSeek-TD2-PRM

    Updated Jul 12
  • zd21/DeepSeek-ScalarPRM

    Updated Jul 12
  • zd21/DeepSeek-ScalarORM

    Updated Jul 12
TDRM
Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference
  • zd21/DeepSeek-TD0-PRM

    Updated Jul 12
  • zd21/DeepSeek-TD2-PRM

    Updated Jul 12
  • zd21/DeepSeek-ScalarPRM

    Updated Jul 12
  • zd21/DeepSeek-ScalarORM

    Updated Jul 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs