16 8

Александр Петров

tmp-123

AI & ML interests

None yet

Recent Activity

liked a dataset about 3 hours ago

FreedomIntelligence/medical-o1-reasoning-SFT

upvoted a paper about 15 hours ago

Adam's Law: Textual Frequency Law on Large Language Models

liked a dataset 2 days ago

chaitanya-yadav/vehicle-predictive-maintenance

View all activity

Organizations

None yet

liked a dataset about 3 hours ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22, 2025 • 90.1k • 7.2k • 1.08k

upvoted a paper about 15 hours ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 10 days ago • 381

liked a dataset 2 days ago

chaitanya-yadav/vehicle-predictive-maintenance

Preview • Updated 1 day ago • 58

upvoted a paper 4 days ago

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Paper • 2604.00830 • Published 10 days ago • 14

liked a dataset 4 days ago

MHuangX/LAION-Beyond

Preview • Updated 2 days ago • 10.9k

upvoted a paper 5 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 9 days ago • 348

upvoted a paper 8 days ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

Paper • 2603.27481 • Published 14 days ago • 35

liked a dataset 9 days ago

legacy-datasets/wikipedia

Updated Mar 11, 2024 • 91.2k • 618

upvoted a paper 11 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 13 days ago • 339

liked a model 11 days ago

mulemp/kcworld

Updated about 17 hours ago

upvoted 2 papers 12 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 19 days ago • 61

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 23 days ago • 330

upvoted a paper 16 days ago

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published 17 days ago • 117

upvoted a paper 25 days ago

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published 27 days ago • 152

upvoted a paper 28 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

liked 2 models about 1 month ago

zai-org/GLM-5

Text Generation • 754B • Updated 7 days ago • 491k • • 1.98k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.48M • • 13.2k

upvoted 3 papers about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 216

Александр Петров

AI & ML interests

Recent Activity

Organizations

tmp-123's activity