Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets Paper • 2604.22294 • Published Apr 24 • 18
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation Paper • 2604.08455 • Published Apr 9 • 47
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Paper • 2410.10813 • Published Oct 14, 2024 • 16
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance tngtech • Apr 16, 2025 • 79
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 341
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! +4 dvilasuero, Ameeeee, frascuchon, damianpumar, lvwerra, thomwolf • Aug 8, 2025 • 109
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks nvidia • Aug 11, 2025 • 76
view article Article How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio fdaudens • Aug 14, 2025 • 27
view article Article Share your open ML datasets on Hugging Face Hub! +2 davanstrien, cfahlgren1, lhoestq, erinys • Nov 12, 2024 • 32
view article Article MCP for Research: How to Connect AI to Research Tools dylanebert • Aug 18, 2025 • 69
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 smohammadi, siro1, winglian, marcsun13, djsaunde • Aug 8, 2025 • 98
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 cbensimon, sayakpaul, linoyts, multimodalart • Sep 2, 2025 • 77
view article Article Vision Language Model Alignment in TRL ⚡️ +3 sergiopaniego, merve, qgallouedec, kashif, ariG23498 • Aug 7, 2025 • 111
view article Article Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio freddyaboulton • Jul 31, 2025 • 60
view article Article Fast LoRA inference for Flux with Diffusers and PEFT sayakpaul, BenjaminB • Jul 23, 2025 • 54
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 777
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community +5 davidberenstein1957, burtenshaw, dvilasuero, davanstrien, sayakpaul, Ameeeee, linoyts • Dec 9, 2024 • 71