Name: SpikeInterface — Unified Extracellular Electrophysiology Framework
Author: jaechang-hits

SpikeInterface — Unified Extracellular Electrophysiology Framework

Unified Python framework for extracellular electrophysiology. Load recordings from 20+ formats (SpikeGLX, OpenEphys, NWB, Intan, Maxwell, Blackrock), preprocess signals, run 10+ spike sorters (Kilosort4, SpykingCircus2, Tridesclous, MountainSort5) with a single API, compute quality metrics (SNR, ISI violations, firing rate, amplitude cutoff), compare sorter outputs, and export to NWB or Phy. Use for format-agnostic and multi-sorter workflows. For a Neuropixels-specific Kilosort4 pipeline with PSTH and population decoding, use neuropixels-analysis instead.

jaechang-hits119 starsFeb 18, 2026

Occupation
Categories: Bioinformatics

Overview

SpikeInterface provides a common Python API to read extracellular recordings from 20+ file formats, preprocess raw voltage traces, run 10+ spike sorters, postprocess and quality-control sorted units, and export results — all without format-specific code. Its modular design lets users swap sorters, formats, and preprocessing steps without rewriting pipelines. SpikeInterface is built around lazy, chainable objects: a Recording holds raw data, a Sorting holds spike times, and a SortingAnalyzer ties them together for waveform and metric computation.

When to Use

Loading recordings from multiple acquisition systems (SpikeGLX, OpenEphys, Intan, NWB, Maxwell MEA, Blackrock) with a unified API rather than format-specific parsers
Running the same preprocessing and sorting pipeline across experiments recorded on different hardware
Comparing two or more spike sorters on the same recording to assess agreement and choose the best output
Running containerized sorters (Kilosort, IronClust, MountainSort5) via Docker or Singularity without local installation
Computing standard quality metrics (SNR, ISI violations, firing rate, presence ratio, amplitude cutoff) and applying threshold-based curation

SpikeInterface — Unified Extracellular Electrophysiology Framework

jaechang-hits119 starsFeb 18, 2026

Occupation
Categories: Bioinformatics

Overview

When to Use

Loading recordings from multiple acquisition systems (SpikeGLX, OpenEphys, Intan, NWB, Maxwell MEA, Blackrock) with a unified API rather than format-specific parsers

Running the same preprocessing and sorting pipeline across experiments recorded on different hardware

Comparing two or more spike sorters on the same recording to assess agreement and choose the best output

Running containerized sorters (Kilosort, IronClust, MountainSort5) via Docker or Singularity without local installation

Computing standard quality metrics (SNR, ISI violations, firing rate, presence ratio, amplitude cutoff) and applying threshold-based curation

import spikeinterface.full as si import spikeinterface.preprocessing as spre import spikeinterface.sorters as ss import spikeinterface.comparison as sc import spikeinterface.qualitymetrics as sqm import spikeinterface.exporters as sexp from pathlib import Path # --- Step 1: Load --- data_dir = Path("/data/oe_recording") streams = si.get_neo_streams("openephys", data_dir) print("Streams:", streams) recording = si.read_openephys(data_dir, stream_name="Signals CH") print(f"Loaded: {recording.get_num_channels()} ch, " f"{recording.get_sampling_frequency()} Hz, " f"{recording.get_total_duration():.1f} s") # --- Step 2: Preprocess --- rec = spre.bandpass_filter(recording, freq_min=300, freq_max=6000) rec = spre.common_reference(rec, reference="global", operator="median") rec, bad_ids = spre.remove_bad_channels(rec, method="coherence+psd") print(f"Preprocessing complete. Removed channels: {bad_ids}") # --- Step 3: Run two sorters --- out = Path("./sorting_outputs") sorting_sc2 = ss.run_sorter("spykingcircus2", rec, output_folder=out / "sc2", remove_existing_folder=True) sorting_tdc = ss.run_sorter("tridesclous2", rec, output_folder=out / "tdc", remove_existing_folder=True) print(f"SC2 units: {len(sorting_sc2.unit_ids)}, " f"TDC units: {len(sorting_tdc.unit_ids)}") # --- Step 4: Compare --- cmp = sc.compare_two_sorters(sorting_sc2, sorting_tdc, sorting1_name="SC2", sorting2_name="Tridesclous2", match_score=0.5) perf = cmp.get_performance(method="pooled_with_average") print(f"\nAgreement performance:\n{perf}") # --- Step 5: Quality metrics on SC2 (higher yield) --- analyzer = si.create_sorting_analyzer(sorting_sc2, rec, folder="./analyzer_sc2", overwrite=True, sparse=True) analyzer.compute(["random_spikes", "waveforms", "templates", "noise_levels", "spike_amplitudes"]) metrics = sqm.compute_quality_metrics( analyzer, metric_names=["snr", "firing_rate", "isi_violation", "presence_ratio", "amplitude_cutoff"], ) keep = (metrics["snr"] >= 5) & (metrics["isi_violations_ratio"] <= 0.1) \ & (metrics["firing_rate"] >= 0.1) & (metrics["presence_ratio"] >= 0.9) sorting_curated = sorting_sc2.select_units(metrics[keep].index.tolist()) print(f"\nCurated: {len(sorting_curated.unit_ids)} / {len(sorting_sc2.unit_ids)} units") # --- Step 6: Export winner to NWB --- sexp.export_to_nwb(sorting_curated, nwb_file_path="./session_sorted.nwb", overwrite=True) print("Saved: session_sorted.nwb")

import spikeinterface.full as si import spikeinterface.preprocessing as spre import spikeinterface.sorters as ss import spikeinterface.comparison as sc import numpy as np # --- Step 1: Generate ground-truth synthetic recording --- # Uses a Marsaglia noise model with realistic waveform templates recording_gt, sorting_gt = si.generate_ground_truth_recording( durations=[120.0], # 120 s recording sampling_frequency=30000.0, num_channels=32, num_units=10, noise_kwargs={"noise_level": 10.0, "dtype": "float32"}, seed=42, ) print(f"GT recording: {recording_gt.get_num_channels()} ch, " f"{recording_gt.get_total_duration():.0f} s") print(f"GT units: {len(sorting_gt.unit_ids)}") print(f"GT firing rates: " f"{[round(len(sorting_gt.get_unit_spike_train(u, 0))/120, 1) for u in sorting_gt.unit_ids]} Hz") # --- Step 2: Preprocess --- rec_pp = spre.bandpass_filter(recording_gt, freq_min=300, freq_max=6000) rec_pp = spre.common_reference(rec_pp, reference="global", operator="median") # --- Step 3: Sort with two sorters --- sorting_sc2 = ss.run_sorter("spykingcircus2", rec_pp, output_folder="./gt_sc2", remove_existing_folder=True) sorting_ms5 = ss.run_sorter("mountainsort5", rec_pp, output_folder="./gt_ms5", remove_existing_folder=True, scheme="2") # --- Step 4: Compare each sorter against ground truth --- for name, sorting_test in [("SC2", sorting_sc2), ("MS5", sorting_ms5)]: cmp = sc.compare_sorter_to_ground_truth(sorting_gt, sorting_test, exhaustive_gt=True) perf = cmp.get_performance(method="pooled_with_average") print(f"\n{name} vs Ground Truth:") print(f" Accuracy: {perf['accuracy']:.3f}") print(f" Recall: {perf['recall']:.3f}") print(f" Precision: {perf['precision']:.3f}") print(f" Well-detected units: {cmp.get_well_detected_units(well_detected_score=0.8)}")

Parameter	Module / Function	Default	Range / Options	Effect
`freq_min` / `freq_max`	`spre.bandpass_filter`	300 / 6000 Hz	150–500 / 3000–10000 Hz	Spike band; use 300–6000 Hz for AP activity
`reference`	`spre.common_reference`	`"global"`	`"global"`, `"local"`, `"single"`	Channel subset used for median reference subtraction
`method`	`spre.remove_bad_channels`	`"coherence+psd"`	`"coherence+psd"`, `"std"`, `"mad"`	Algorithm for bad channel detection
`scheme`	`ss.run_sorter("mountainsort5")`	`"2"`	`"1"`, `"2"`, `"3"`	Sorting scheme; scheme 2 recommended for high-density probes
`nblocks`	`ss.run_sorter("kilosort4")`	`5`	`0–10`	Number of drift correction blocks; 0 disables drift correction
`Th_learned`	`ss.run_sorter("kilosort4")`	`8`	`6–12`	Detection threshold (× noise); lower = more units, more noise
`match_score`	`sc.compare_two_sorters`	`0.5`	`0.1–0.9`	Minimum spike-train overlap to declare a unit match
`sparse`	`si.create_sorting_analyzer`	`True`	`True`, `False`	Limit waveform extraction to channels near each unit; reduces memory
`ms_before` / `ms_after`	`si.create_sorting_analyzer`	`1.0` / `2.0` ms	0.5–2.0 / 1.0–3.0 ms	Waveform snippet window relative to detected spike peak
`snr` threshold	`sqm.compute_quality_metrics`	—	5–10 recommended	Amplitude / noise ratio; > 5 indicates well-isolated unit
`isi_violations_ratio`	`sqm.compute_quality_metrics`	—	≤ 0.1 recommended	Fraction of ISIs < refractory period (1.5 ms); < 0.1 = single unit
`presence_ratio`	`sqm.compute_quality_metrics`	—	≥ 0.9 recommended	Fraction of recording epochs where unit fires; < 0.9 = drifting unit

SpikeInterface — Unified Extracellular Electrophysiology Framework

Overview

When to Use

SpikeInterface — Unified Extracellular Electrophysiology Framework

Overview

When to Use

Prerequisites

Quick Start

Core API

Module 1: Recording I/O

Module 2: Preprocessing

Module 3: Spike Sorting

Module 4: Postprocessing (SortingAnalyzer)

Module 5: Quality Metrics

Module 6: Comparison and Export

Common Workflows

Workflow 1: Multi-Sorter Comparison on OpenEphys Data

Workflow 2: Ground Truth Validation with Synthetic Recordings

Workflow 3: Batch Processing Multiple Sessions

Key Parameters

Best Practices

Common Recipes

Recipe: Load and Inspect a Multi-Stream Recording

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy