Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CodeGoat24 's Collections
UnifiedReward Flex
Pref-GRPO & UniGenBench
UnifiedReward Edit Models
UnifiedReward 2.0 Qwen3VL Models
UnifiedReward 2.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5 Models GGUF
UnifiedReward 1.0 LLaVA Model
UnifiedReward Training Data

UnifiedReward Training Data

updated 1 day ago
Upvote
6

  • Unified Reward Model for Multimodal Understanding and Generation

    Paper • 2503.05236 • Published Mar 7, 2025 • 123

  • Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

    Paper • 2505.03318 • Published May 6, 2025 • 92

  • CodeGoat24/UnifiedReward-2.0-T2X-score-data

    Viewer • Updated Sep 26, 2025 • 337k • 353

  • CodeGoat24/ImageGen-CoT-Reward-5K

    Viewer • Updated Aug 29, 2025 • 5.54k • 92 • 1

  • CodeGoat24/LLaVA-Critic-113k

    Preview • Updated Sep 3, 2025 • 168

  • CodeGoat24/OIP

    Viewer • Updated Sep 3, 2025 • 21.4k • 61

  • CodeGoat24/ShareGPTVideo-DPO

    Viewer • Updated Sep 17, 2025 • 101k • 47

  • CodeGoat24/VideoDPO

    Viewer • Updated Sep 3, 2025 • 29k • 206

  • CodeGoat24/EvalMuse

    Preview • Updated Sep 3, 2025 • 128

  • CodeGoat24/VideoFeedback

    Viewer • Updated Sep 3, 2025 • 73.2k • 54

  • CodeGoat24/HPD

    Viewer • Updated Sep 3, 2025 • 72.7k • 65

  • CodeGoat24/LiFT-HRA

    Viewer • Updated Aug 29, 2025 • 19k • 73
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs