Stoney Kang
sikang99
AI & ML interests
Remote Control based on Vision
Recent Activity
upvoted
a
paper
about 2 hours ago
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
upvoted
a
paper
about 2 hours ago
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
upvoted
a
paper
about 2 hours ago
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head