Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
On Vacation ποΈ
23
16
27
Ji Xie
PRO
sanaka87
Follow
Verah's profile picture
Stars321123's profile picture
PsychAnshul's profile picture
68 followers
Β·
23 following
https://horizonwind2004.github.io/
HorizonWind2004
HorizonWind2004
AI & ML interests
Generative Model
Recent Activity
liked
a dataset
9 days ago
Marlo-Z/SegLLM_dataset
reacted
to
their
post
with π₯
13 days ago
π Introducing VideoCoF: Unified Video Editing with a Temporal Reasoner (Chain-of-Frames)! Weβre excited to introduce VideoCoF, a unified framework for instruction-based video editing that enables temporal reasoning and ~4Γ video length extrapolation, trained with only 50k video pairs. π₯ π What makes VideoCoF different? π§ Chain-of-Frames reasoning , mimic human thinking process like Seeing β Reasoning β Editing to apply edits accurately over time without external masks, ensuring physically plausible results. π Strong length generalization β trained on 33-frame clips, yet supports multi-shot editing and long-video extrapolation (~4Γ). π― Unified fine-grained editing β Object Removal, Addition, Swap, and Local Style Transfer, with instance-level & part-level, spatial-aware control. β‘ Fast inference update π H100: ~20s / video with 4-step inference, making high-quality video editing far more practical for real-world use. π Links π Paper: https://arxiv.org/abs/2512.07469 π» Code: https://github.com/knightyxp/VideoCoF π€ Demo: https://huggingface.co/spaces/XiangpengYang/VideoCoF π§© Models: https://huggingface.co/XiangpengYang/VideoCoF π Project Page: https://videocof.github.io/ #VideoEditing #DiffusionModels #GenerativeAI #ComputerVision #AI
posted
an
update
14 days ago
π Introducing VideoCoF: Unified Video Editing with a Temporal Reasoner (Chain-of-Frames)! Weβre excited to introduce VideoCoF, a unified framework for instruction-based video editing that enables temporal reasoning and ~4Γ video length extrapolation, trained with only 50k video pairs. π₯ π What makes VideoCoF different? π§ Chain-of-Frames reasoning , mimic human thinking process like Seeing β Reasoning β Editing to apply edits accurately over time without external masks, ensuring physically plausible results. π Strong length generalization β trained on 33-frame clips, yet supports multi-shot editing and long-video extrapolation (~4Γ). π― Unified fine-grained editing β Object Removal, Addition, Swap, and Local Style Transfer, with instance-level & part-level, spatial-aware control. β‘ Fast inference update π H100: ~20s / video with 4-step inference, making high-quality video editing far more practical for real-world use. π Links π Paper: https://arxiv.org/abs/2512.07469 π» Code: https://github.com/knightyxp/VideoCoF π€ Demo: https://huggingface.co/spaces/XiangpengYang/VideoCoF π§© Models: https://huggingface.co/XiangpengYang/VideoCoF π Project Page: https://videocof.github.io/ #VideoEditing #DiffusionModels #GenerativeAI #ComputerVision #AI
View all activity
Organizations
None yet
sanaka87
's models
9
Sort:Β Recently updated
sanaka87/3DIS
Text-to-Image
β’
Updated
24 days ago
β’
65
β’
7
sanaka87/Show-o-RecA
Text-to-Image
β’
Updated
Nov 13
β’
15
β’
3
sanaka87/Show-o-512x512-RecA
Any-to-Any
β’
Updated
Nov 13
β’
13
β’
2
sanaka87/BAGEL-RecA
Any-to-Any
β’
Updated
Nov 13
β’
70
β’
26
sanaka87/Harmon-0.5B-RecA
Text-to-Image
β’
Updated
Nov 13
β’
15
β’
4
sanaka87/Harmon-1.5B-RecA
Any-to-Any
β’
Updated
Nov 13
β’
13
β’
2
sanaka87/Harmon-1.5B-RecA-plus
Text-to-Image
β’
Updated
Nov 13
β’
18
β’
3
sanaka87/OpenUni-RecA
Any-to-Any
β’
Updated
Sep 11
β’
22
β’
1
sanaka87/ICEdit-MoE-LoRA
Image-to-Image
β’
Updated
May 2
β’
328
β’
118