ajinkyakale 's Collections papers
updated
De-Diffusion Makes Text a Strong Cross-Modal Interface
Paper
• 2311.00618
• Published
• 23
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper
• 2311.10093
• Published
• 58
Using Human Feedback to Fine-tune Diffusion Models without Any Reward
Model
Paper
• 2311.13231
• Published
• 28
Diffusion Model Alignment Using Direct Preference Optimization
Paper
• 2311.12908
• Published
• 49
Visual In-Context Prompting
Paper
• 2311.13601
• Published
• 18
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion
Models
Paper
• 2312.00079
• Published
• 17
Scaling Laws of Synthetic Images for Model Training ... for Now
Paper
• 2312.04567
• Published
• 9
VILA: On Pre-training for Visual Language Models
Paper
• 2312.07533
• Published
• 21
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip
Connection Editing
Paper
• 2312.11392
• Published
• 20
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Paper
• 2312.13913
• Published
• 24
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
Paper
• 2312.13252
• Published
• 27
DreamDistribution: Prompt Distribution Learning for Text-to-Image
Diffusion Models
Paper
• 2312.14216
• Published
• 12
One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and
Erasing Applications
Paper
• 2312.16145
• Published
• 10
Unsupervised Universal Image Segmentation
Paper
• 2312.17243
• Published
• 20
Prompt Expansion for Adaptive Text-to-Image Generation
Paper
• 2312.16720
• Published
• 6
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper
• 2312.16862
• Published
• 31
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision,
Language, Audio, and Action
Paper
• 2312.17172
• Published
• 30
Improving fine-grained understanding in image-text pre-training
Paper
• 2401.09865
• Published
• 18