Links

S4 by Albert Gu

The Annotated S4

Interview w Tri Dao

Clean code implementation of Mamba

Chat finetuning for Mamba

Gated linear attention (transformer)

New Mamba model 12th December 3B parameters, 600B tokens

Mamba, Memory, and the SSM Moment (Cog Rev Podcast)

Sparse Notes Mamba walk through

The Annotated Mamba paper

Papers Inspired by Mamba

Mamba for speech synthesis

Is Mamba Capable Of In-Context Learning?

Can Mamba Learn How to Learn?

U-shaped Vision Mamba

Semi-Mamba-UNet

Weak-Mamba-UNet

Point Cloud Mamba

The Hidden Attention of Mamba Models

Theoretical Foundations of Deep Selective State-Space Models

Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy

A multi-cohort study on prediction of acute brain dysfunction states

Universality of Linear Recurrences Followed by Non-linear Projections

Large Window-based Mamba UNet for Medical Image Segmentation

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers

Activating Wider Areas in Image Super-Resolution

Video Mamba Suite

On the low-shot transferability of [V]-Mamba

EfficientVMamba

Is Mamba Effective for Time Series Forecasting?

Music to Dance as Language Translation using Sequence Models

Uncovering Selective State Space Model’s Capabilities in Lifelong Sequential Recommendation

State Space Models as Foundation Models

Proprioception Is All You Need

Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion

Dual-path Mamba

UltraLight VM-UNet

Locating and Editing Factual Associations in Mamba

Does Transformer Interpretability Transfer to RNNs?

3DMambaComplete

A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion

State Space Model for New-Generation Network Alternative to Transformers

Text-controlled Motion Mamba

Integrating Mamba and Transformer for Long-Short Range Time Series Forecasting

Model Repos