xziayro

xziayro

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

MultiverseComputingCAI/HyperNova-60B

liked a dataset 2 days ago

yeates/omnipaint-bench

liked a model 2 days ago

huaichang/PersonaLive

View all activity

Organizations

upvoted a paper 2 days ago

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

Paper • 2602.21196 • Published 3 days ago • 3

upvoted a paper 4 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 16 days ago • 185

upvoted a paper 5 days ago

PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation

Paper • 2507.16116 • Published Jul 22, 2025 • 13

upvoted a paper 6 days ago

DuoGen: Towards General Purpose Interleaved Multimodal Generation

Paper • 2602.00508 • Published 27 days ago • 4

upvoted 3 papers 7 days ago

LayerSync: Self-aligning Intermediate Layers

Paper • 2510.12581 • Published Oct 14, 2025 • 9

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Paper • 2602.16968 • Published 9 days ago • 11

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 14 days ago • 43

upvoted 2 papers 8 days ago

SAM 3D Body: Robust Full-Body Human Mesh Recovery

Paper • 2602.15989 • Published 10 days ago • 11

Optimizing Few-Step Generation with Adaptive Matching Distillation

Paper • 2602.07345 • Published 20 days ago • 9

upvoted a paper 9 days ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published 19 days ago • 10

upvoted 2 papers 10 days ago

FireRed-Image-Edit-1.0 Techinical Report

Paper • 2602.13344 • Published 15 days ago • 4

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published 12 days ago • 50

upvoted 2 articles 11 days ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

142

Article

KV Cache from scratch in nanoVLM

Jun 4, 2025

•

112

upvoted a paper 11 days ago

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

Paper • 2602.11715 • Published 15 days ago • 5

upvoted an article 11 days ago

Article

Custom Kernels for All from Codex and Claude

15 days ago

•

upvoted a paper 11 days ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published 15 days ago • 58

upvoted 3 papers 13 days ago

Voxtral Realtime

Paper • 2602.11298 • Published 16 days ago • 16

T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

Paper • 2602.12262 • Published 15 days ago • 8

PISCO: Precise Video Instance Insertion with Sparse Control

Paper • 2602.08277 • Published 18 days ago • 11

xziayro

AI & ML interests

Recent Activity

Organizations

xziayro's activity

Mastering Tensor Dimensions in Transformers

KV Cache from scratch in nanoVLM

Custom Kernels for All from Codex and Claude