147 3

Noah

noahml

https://researchpod.app

researchpodapp

AI & ML interests

None yet

Recent Activity

commentedon a paper about 3 hours ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

commentedon a paper about 3 hours ago

SciOrch: Learning to Orchestrate Expert LLMs for Solving Frontier Multimodal Scientific Reasoning Tasks

commentedon a paper about 3 hours ago

FlowBender: Feedback-Aware Training for Self-Correcting Conditional Flows

View all activity

Organizations

None yet

commented 5 papers about 3 hours ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 4 days ago • 3 •

SciOrch: Learning to Orchestrate Expert LLMs for Solving Frontier Multimodal Scientific Reasoning Tasks

Paper • 2606.15872 • Published 5 days ago • 4 •

FlowBender: Feedback-Aware Training for Self-Correcting Conditional Flows

Paper • 2606.20404 • Published 1 day ago • 12 •

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Paper • 2606.20517 • Published 1 day ago • 5 •

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Paper • 2606.15133 • Published 7 days ago • 45 •

commented 5 papers about 21 hours ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Paper • 2606.19236 • Published 3 days ago • 8 •

A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets

Paper • 2606.13802 • Published 9 days ago •

LLM-Enabled NWDAF: A Step Toward AI-Native 6G Network Intelligence

Paper • 2606.11877 • Published 10 days ago •

Morpheus: A Morphology-Aware Neural Tokenizer and Word Embedder for Turkish

Paper • 2606.18717 • Published 3 days ago • 1 •

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Paper • 2606.17682 • Published 4 days ago • 16 •

commented 5 papers 1 day ago

Variable-Width Transformers

Paper • 2606.18246 • Published 4 days ago • 6 •

Visual-Seeker: Towards Visual-Native Multimodal Agentic Search via Active Visual Reasoning

Paper • 2606.15231 • Published 7 days ago • 3 •

Guava: An Effective and Universal Harness for Embodied Manipulation

Paper • 2606.18363 • Published 4 days ago • 24 •

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Paper • 2606.18967 • Published 3 days ago • 20 •

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Paper • 2606.18322 • Published 4 days ago • 16 •

commented 5 papers 2 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 4 days ago • 44 •

Noah

AI & ML interests

Recent Activity

Organizations

noahml's activity