1 13

Yize Cheng

yizecheng

https://chengez.github.io

chengez

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

upvoted a paper 11 days ago

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

upvoted a paper 14 days ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

View all activity

Organizations

None yet

authored a paper 7 days ago

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published 14 days ago • 14

upvoted a paper 11 days ago

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published 14 days ago • 14

upvoted a paper 14 days ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published 15 days ago • 24

upvoted a paper 21 days ago

V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions

Paper • 2512.11995 • Published 24 days ago • 9

upvoted a paper about 2 months ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published Nov 4, 2025 • 20

updated a dataset 3 months ago

yizecheng/TicToc

Updated Oct 22, 2025 • 6

published a dataset 3 months ago

yizecheng/TicToc

Updated Oct 22, 2025 • 6

upvoted a paper 5 months ago

Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision

Paper • 2507.20976 • Published Jul 28, 2025 • 10

authored a paper 5 months ago

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

Paper • 2506.07001 • Published Jun 8, 2025 • 4

upvoted 3 papers 5 months ago

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

Paper • 2506.07001 • Published Jun 8, 2025 • 4

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Paper • 2503.03205 • Published Mar 5, 2025 • 4

Diversity-Enhanced Reasoning for Subjective Questions

Paper • 2507.20187 • Published Jul 27, 2025 • 25

upvoted a paper 6 months ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published Jun 26, 2025 • 28

commented a paper 7 months ago

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Paper • 2505.23001 • Published May 29, 2025 • 8 •

authored a paper 7 months ago

Attacking by Aligning: Clean-Label Backdoor Attacks on Object Detection

Paper • 2307.10487 • Published Jul 19, 2023

upvoted a paper 7 months ago

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Paper • 2505.23001 • Published May 29, 2025 • 8

authored a paper 7 months ago

Gaming Tool Preferences in Agentic LLMs

Paper • 2505.18135 • Published May 23, 2025 • 8

upvoted a paper 7 months ago

Gaming Tool Preferences in Agentic LLMs

Paper • 2505.18135 • Published May 23, 2025 • 8

upvoted 2 papers 9 months ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10, 2025 • 48

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published Apr 9, 2025 • 39

Yize Cheng

AI & ML interests

Recent Activity

Organizations

yizecheng's activity