Hackernews posts about Mamba/Transformer
- Jamba: A Hybrid Transformer-Mamba Language Model (arxiv.org)
- Nemotron-H: A Family of Accurate, Efficient Hybrid Mamba-Transformer Models (research.nvidia.com)
- Mamba (Transformer Alternative): The Future of LLMs? (lazyprogrammer.me)
- Nvidia debuts Nemotron 3 with hybrid MoE and Mamba-Transformer (venturebeat.com)
- Transformers are SSMs (Mamba-2) (arxiv.org)
- The Mamba in the Llama (distilling from transformers) (twitter.com)
- Mamba Explained: The State Space Model Taking On Transformers (www.kolaayonrinde.com)
- Show HN: New Cartesia Text-to-Speech Model (www.cartesia.ai)
- Bamba: An open-source LLM that crosses a transformer with an SSM (research.ibm.com)
- Jamba: AI21's Groundbreaking SSM-Transformer Model (www.ai21.com)
- Jamba: A groundbreaking hybrid SSM-Transformer model (www.ai21.com)
- Jamba: AI21's Groundbreaking SSM-Transformer Model (www.ai21.com)