Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation Paper • 2604.27263 • Published 11 days ago • 8
Targeted Neuron Modulation via Contrastive Pair Search Paper • 2605.12290 • Published 13 days ago • 14
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1, 2025 • 81
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 777
🧠 SmolLM3 Collection Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 104
view article Article Efficient MultiModal Data Pipeline +3 ariG23498, lusxvr, andito, sergiopaniego, pcuenq • Jul 8, 2025 • 71
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb • May 21, 2025 • 258
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 613
Perception Encoder: The best visual embeddings are not at the output of the network Paper • 2504.13181 • Published Apr 17, 2025 • 36
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published Apr 14, 2025 • 2
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 ariG23498, merve, pcuenq, reach-vb • Mar 12, 2025 • 497
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7, 2025 • 81
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model EuroBERT • Mar 10, 2025 • 147
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 saurabhdash, olivernan, ArashAhmadian, johndang-cohere • Mar 4, 2025 • 78