Shumin Deng's picture

3 3

Shumin Deng

231sm

·

231sm

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a paper 2 months ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

upvoted a paper 5 months ago

Automating Steering for Safe Multimodal Large Language Models

View all activity

Organizations

authored a paper 7 months ago

Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms

Paper • 2505.20322 • Published May 23 • 14

authored a paper 9 months ago

CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners

Paper • 2503.16356 • Published Mar 20 • 15

authored 2 papers 10 months ago

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published Feb 16 • 23

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 30

authored a paper over 1 year ago

Knowledge Mechanisms in Large Language Models: A Survey and Perspective

Paper • 2407.15017 • Published Jul 22, 2024 • 34