fulong ye
Alon77777
AI & ML interests
vision and language, diffusion model, text-to-image generation, image-to-text generation, referring expression generation and comprehension
Recent Activity
upvoted
a
paper
about 19 hours ago
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
upvoted
a
paper
4 months ago
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion
Transformer Models
upvoted
a
paper
4 months ago
UMO: Scaling Multi-Identity Consistency for Image Customization via
Matching Reward