M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Paper
• 2504.10449 • Published
• 15
None defined yet.
mamba is now available in transformers. Thanks to @tridao and @albertgu for this brilliant model! 🚀 and the amazing mamba-ssm kernels powering this!