GitHunt
AR

Arya-Honarpisheh/gen_err_sel_ssm

Official Implementation of "Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention"

gen_err_sel_ssm

This repository contains the code for the experiments in the paper "Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention", Accepted as a poster to NeurIPS 2025.

๐Ÿ”ง Setup (Conda)

To create the environment:

conda env create -f environment.yml -n gen_err_sel_ssm
conda activate gen_err_sel_ssm

๐Ÿ“ฆ Code Overview

  • The Selective State-Space Model is implemented in PyTorch in models/selective_ssm.py.
  • The main script is main.py, which loads data, trains the model, and saves the results for different experiments.
    You can run it directly from the command line, or modify and execute the helper script:
bash run_main.sh

๐Ÿ“Š Experiments

  • Length-independence experiment: test_models.ipynb
  • Stability margin experiment: plot_sA_T.ipynb
  • Significant analysis: significance_test.ipynb

๐Ÿ—‚๏ธ Datasets

  • IMDB (sentiment, 25k train / 25k test, binary labels): Stanford IMDB (Maas et al., 2011) โ€ข HF dataset card
  • ListOps (hierarchical parsing / long-context reasoning): part of Long Range Arena (LRA) โ€” arXiv โ€ข LRA GitHub
  • Majority (synthetic): binary sequence where the label equals the majority bit; we use a simple generator released in this repo.

Dataset loading functions are in training/load_datasets.py.

๐Ÿ“š Citation

If you find this repository useful, please cite:

@inproceedings{
honarpisheh2025generalization,
title={Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention},
author={Arya Honarpisheh and Mustafa Bozdag and Octavia Camps and Mario Sznaier},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=YVZbaVikBp}
}


Contributors

GNU General Public License v3.0
Created May 14, 2025
Updated November 4, 2025