18 47 15

Yiren Song

yiren98

AI & ML interests

CV,NLP

Recent Activity

upvoted a paper 9 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

upvoted a paper 9 days ago

Region-Constraint In-Context Generation for Instructional Video Editing

upvoted a paper 17 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

View all activity

Organizations

upvoted 2 papers 9 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 17 days ago • 58

Region-Constraint In-Context Generation for Instructional Video Editing

Paper • 2512.17650 • Published 16 days ago • 49

upvoted a paper 17 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published 17 days ago • 19

upvoted 2 papers 22 days ago

H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos

Paper • 2512.09406 • Published 25 days ago • 3

X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale

Paper • 2512.04537 • Published about 1 month ago • 6

upvoted 2 papers 23 days ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published Dec 2, 2025 • 29

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published 25 days ago • 46

upvoted 2 papers 30 days ago

PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

Paper • 2512.04082 • Published Dec 3, 2025 • 13

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published about 1 month ago • 167

upvoted 3 papers about 1 month ago

The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

Paper • 2511.20614 • Published Nov 25, 2025 • 37

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 27

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 52

upvoted a paper about 2 months ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 44

upvoted a paper 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 101

upvoted 2 papers 3 months ago

Code2Video: A Code-centric Paradigm for Educational Video Generation

Paper • 2510.01174 • Published Oct 1, 2025 • 33

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24, 2025 • 82

upvoted a paper 4 months ago

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Paper • 2509.01977 • Published Sep 2, 2025 • 12

upvoted a paper 5 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 267

upvoted 2 papers 6 months ago

AnyI2V: Animating Any Conditional Image with Motion Control

Paper • 2507.02857 • Published Jul 3, 2025 • 12

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 58

Yiren Song

AI & ML interests

Recent Activity

Organizations

yiren98's activity