4 18 8

Roxanna

borntobeignored

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

20x Faster TRL Fine-tuning with RapidFire AI

upvoted an article about 1 month ago

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

upvoted an article about 1 month ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

upvoted 4 articles about 1 month ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

Nov 21, 2025

•

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Apr 29, 2025

•

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

887

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Jan 29, 2025

•

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 108

upvoted an article 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

176

upvoted 2 articles 3 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

upvoted a paper 4 months ago

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27, 2025 • 33

upvoted a paper 5 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 122

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

741

upvoted 4 papers 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 124

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21, 2025 • 35

Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper • 2507.15844 • Published Jul 21, 2025 • 16

upvoted a collection 6 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated 16 days ago • 116

upvoted an article 6 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

751

upvoted a paper 7 months ago

Adaptive Sparse Allocation with Mutual Choice & Feature Choice Sparse Autoencoders

Paper • 2411.02124 • Published Nov 4, 2024 • 1

Roxanna

AI & ML interests

Recent Activity

Organizations

borntobeignored's activity

20x Faster TRL Fine-tuning with RapidFire AI

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Open-R1: a fully open reproduction of DeepSeek-R1

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

SmolLM3: smol, multilingual, long-context reasoner

Uncensor any LLM with abliteration