wang

wangxbx

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Qwen3-VL Technical Report

upvoted a paper 3 days ago

Kling-Omni Technical Report

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

View all activity

Organizations

None yet

upvoted 3 papers 3 days ago

liked a model 17 days ago

QuantTrio/DeepSeek-V3.2-AWQ

Text Generation • 685B • Updated 27 days ago • 3.77k • 8

upvoted 16 papers 6 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 157

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 246

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19 • 60

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 80

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 93

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 273

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3 • 39

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27 • 45

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 53

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 73

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 78

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16 • 75

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 93

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12 • 134

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 154

wang

AI & ML interests

Recent Activity

Organizations

wangxbx's activity