Drift No More? Context Equilibria in Multi-Turn LLM Interactions Paper • 2510.07777 • Published Oct 9, 2025
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations Paper • 2506.20100 • Published Jun 25, 2025 • 1
Plan Verification for LLM-Based Embodied Task Completion Agents Paper • 2509.02761 • Published Sep 2, 2025
TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Paper • 2504.19982 • Published Apr 28, 2025
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents Paper • 2505.01592 • Published May 2, 2025
TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models Paper • 2509.13395 • Published Sep 16, 2025 • 1
GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis Paper • 2507.21035 • Published Jul 28, 2025 • 3
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models Paper • 2311.07022 • Published Nov 13, 2023 • 1
Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare Paper • 2404.16621 • Published Apr 25, 2024
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024 • 2
Simulating User Agents for Embodied Conversational-AI Paper • 2410.23535 • Published Oct 31, 2024 • 1
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024 • 2
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents Paper • 2410.23555 • Published Oct 31, 2024
Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems Paper • 2501.17348 • Published Jan 28, 2025 • 1
Beyond Pixels: Exploring Human-Readable SVG Generation for Simple Images with Vision Language Models Paper • 2311.15543 • Published Nov 27, 2023
Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models Paper • 2308.10632 • Published Aug 21, 2023