Kai Ruan's picture

Kai Ruan

6cf

·

x66ccff

AI & ML interests

AI for Science

Recent Activity

liked a model 16 days ago

cerebras/GLM-4.5-Air-REAP-82B-A12B

liked a model 16 days ago

zai-org/GLM-4.5-Air

liked a model 16 days ago

unsloth/GLM-4.7-GGUF

View all activity

Organizations

upvoted an article 2 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Jun 11, 2025

•

126

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 147

upvoted a collection 5 months ago

DeepSeek-V3.1

DeepSeek's new 3.1 update to their V3 models! • 6 items • Updated 14 days ago • 8

upvoted 2 papers 7 months ago

Scaling Diffusion Transformers Efficiently via μP

Paper • 2505.15270 • Published May 21, 2025 • 35

Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Paper • 2506.09250 • Published Jun 10, 2025 • 27

upvoted a collection 7 months ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 67 items • Updated 8 days ago • 295

upvoted 2 papers 8 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 80

Benchmarking LLMs' Swarm intelligence

Paper • 2505.04364 • Published May 7, 2025 • 20

upvoted a paper 10 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

upvoted a paper 11 months ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47

upvoted a paper 12 months ago

Discovering symbolic expressions with parallelized tree search

Paper • 2407.04405 • Published Jul 5, 2024 • 1

upvoted a collection 12 months ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 826

upvoted 4 collections about 1 year ago

IdeaWhiz

3 items • Updated Jan 9, 2025 • 3

DeepSeek-V3

4 items • Updated Nov 27, 2025 • 278

Models for Open Hands (Open Devin)

Models for Open Hands(Open Devin) trained on the Devinator Dataset. • 3 items • Updated Oct 25, 2024 • 3

LiveIdeaBench

4 items • Updated May 8, 2025 • 5

upvoted a paper about 1 year ago

LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context

Paper • 2412.17596 • Published Dec 23, 2024 • 6

upvoted 2 collections about 1 year ago

YuLan-Mini

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 6 items • Updated Apr 14, 2025 • 16

DeepSeek-VL2

5 items • Updated Nov 27, 2025 • 79

upvoted an article about 1 year ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

•

80