Measuring what Matters: Construct Validity in Large Language Model Benchmarks Paper • 2511.04703 • Published Nov 3, 2025 • 8
Training language models to be warm and empathetic makes them less reliable and more sycophantic Paper • 2507.21919 • Published Jul 29, 2025 • 2
Clinical knowledge in LLMs does not translate to human interactions Paper • 2504.18919 • Published Apr 26, 2025 • 26
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Paper • 2504.07961 • Published Apr 10, 2025 • 5
MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild Paper • 2406.01595 • Published Jun 3, 2024
A Linear Reconstruction Approach for Attribute Inference Attacks against Synthetic Data Paper • 2301.10053 • Published Jan 24, 2023
Into the crossfire: evaluating the use of a language model to crowdsource gun violence reports Paper • 2401.12989 • Published Jan 16, 2024
When the signal is in the noise: Exploiting Diffix's Sticky Noise Paper • 1804.06752 • Published Apr 18, 2018
Characterizing and modeling harms from interactions with design patterns in AI interfaces Paper • 2404.11370 • Published Apr 17, 2024
ACES: Automatic Cohort Extraction System for Event-Stream Datasets Paper • 2406.19653 • Published Jun 28, 2024
GREEN: Generative Radiology Report Evaluation and Error Notation Paper • 2405.03595 • Published May 6, 2024
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis Paper • 2407.07295 • Published Jul 10, 2024
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning Paper • 2406.00392 • Published Jun 1, 2024 • 14
Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks? Paper • 2404.03411 • Published Apr 4, 2024 • 10
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning Paper • 2312.14878 • Published Dec 22, 2023 • 15
Frontier AI Regulation: Managing Emerging Risks to Public Safety Paper • 2307.03718 • Published Jul 6, 2023 • 5