Spiking Neural Networks — interactive explorer

Leaky Integrate-and-Fire (LIF) Neuron

The LIF model treats the neuron membrane as an RC circuit: injected current charges the capacitance; the leak resistance continuously drains it toward V_rest. When V reaches threshold V_th, a spike is emitted, V resets to V_reset, and the neuron enters a brief refractory period.

GOVERNING EQUATION (recall: C·dV/dt = I)

τ_m · dV/dt = −(V − V_rest) + R·I(t)

R — membrane resistance1.00

          0.224
        

Equivalent RC circuit

C·dV/dt = I(t) − (V−V_rest)/R
I(t) = Σ inputs from presynaptic neurons

INPUT TYPE – I(t):

DC current20 mV

Display scroll speed1.5 ms/frame

Input current I(t) — DC drive

All potentials are absolute membrane potentials (mV), not relative to rest.

τ_m — time constant (= RC)20 ms

V_rest — resting potential (leak target)−70 mV

V_th — threshold potential−55 mV

V_reset — reset potential−80 mV

τ_ref — refractory period2.0 ms

Membrane potential V(t) — scrolling 500 ms window

Spike raster — shaded region = rate-estimate counting window; drag its left edge to adjust

← rate-estimate window (drag left = wider) 200 ms

500 ms3752501250 ms

Estimated firing rate r̂(t) = spikes in window ÷ window width

Firing rate: — Hz

Mean ISI: — ms

Spike count: 0

Connections to related frameworks

RC circuit analogy. The LIF equation is exactly the nodal-voltage equation for a parallel RC circuit driven by a current source: the capacitance C = τ_m/R integrates injected current while the conductance G = 1/R leaks charge back to V_rest. The threshold comparator and instantaneous reset together form the spike-generation mechanism. For students comfortable with analog electronics, this is a precision comparator with clamped feedback and a brief hold period.

Resting potential V_rest. V_rest is the membrane potential the neuron passively relaxes toward in the absence of input; it is the equilibrium of the RC circuit, and the difference between V and V_rest is what drives leak current. In the governing equation, the term −(V − V_rest) is exactly this leak: when V is above V_rest the leak is negative (pulling V down), and when V is below V_rest the leak is positive (pulling V up). With zero input drive, the neuron simply decays toward V_rest with time constant τ_m = RC.

Threshold, reset, and refractory period. These three mechanisms — the hard threshold V_th, the instantaneous reset to V_reset, and the silent refractory interval τ_ref — are additions layered on top of the passive RC dynamics, not properties of the RC circuit itself. The RC circuit alone would just charge and discharge continuously; spiking behavior requires an explicit comparator (threshold), a clamp (reset), and a timer (refractory). In a biophysically realistic Hodgkin-Huxley neuron, all three behaviors emerge naturally from the dynamics of voltage-gated Na⁺ and K⁺ channels — no separate mechanisms are needed. The LIF model trades that emergent complexity for analytical tractability: the hard-coded mechanisms are crude approximations, but they preserve the input-integration and firing-rate relationships that make the neuron computationally useful.

Σ-Δ modulation, PWM, and sliding-mode control. The integrate-and-fire mechanism is structurally a first-order Σ-Δ encoder: the neuron continuously accumulates the error between input and an implicit reference, fires (dumping accumulated charge) when it reaches threshold, and repeats. At constant suprathreshold drive, the output is a regular pulse train — the neural equivalent of pulse-width modulation (PWM). Varying drive modulates rate, exactly as a 1-bit Σ-Δ ADC's output bit-density encodes an analog input. In all three representations — Σ-Δ, PWM, and rate-coded spike train — the original analog signal is recoverable by averaging or low-pass filtering the binary pulse sequence: a downstream neuron integrating spikes through its synaptic conductance is doing precisely this reconstruction. Near threshold with oscillating or noisy input, the neuron can additionally enter high-frequency irregular firing structurally identical to chattering in sliding-mode controllers: the state trajectory repeatedly crosses the switching surface V = V_th, alternating between integration and hard reset — a relay nonlinearity embedded in a continuous first-order plant. In that regime the averaging interpretation still holds: the instantaneous duty cycle of the chatter encodes the signal amplitude.

Rate Coding

In rate coding, information is carried by how often a neuron fires within a time window. The signal amplitude modulates the instantaneous firing probability; a downstream neuron estimates the original signal by counting spikes over a recent window and dividing by the window width.

Display scroll speed1.5 ms/frame

Input Signal

Signal frequency1.0 Hz

Signal amplitude0.70

Analog input signal s(t)

Rate Encoding

Max firing rate r_max80 Hz

Window-average spike rate: — Hz (spikes counted over the visible 600 ms window)

Encoded spike train — shaded region = counting window; drag its left edge to adjust

Inter-spike interval (ISI) histogram

Rate Estimate

← counting window (drag left = wider) 100 ms

600 ms4503001500 ms

Estimated firing rate r̂(t) = spikes in window ÷ window width — faint dashed = expected rate s(t)·r_max

Information in time vs. in space

In a classical ANN, each neuron's contribution is a single floating-point activation — information encoded in space (which neuron, what value, all at once). In an SNN, the same information is a binary event sequence — information encoded in time (when and how often did this neuron fire). This shift has consequences for hardware (only asynchronous, sparse communication is needed), for learning (timing-sensitive rules replace weight-gradient averaging), and for what computations are natural to represent.

The windowed spike counter demonstrated here is the most transparent rate decoder: count spikes in a sliding window of width W, divide by W to get an estimate in Hz. A narrow window tracks fast signal changes but the count of 0, 1, 2, … spikes is coarse and noisy. A wide window averages out the Poisson noise but lags behind signal changes and smears together distinct features. This is the fundamental bias–variance trade-off of rate coding — and it is why rate coding requires relatively high firing rates to transmit fast-changing signals reliably. In this sense the spike train is the neuronal equivalent of a PWM or Σ-Δ signal: the instantaneous firing rate encodes amplitude, and the original analog signal is always recoverable in principle by averaging or low-pass filtering the binary event sequence over a long enough window.

Temporal coding. An alternative hypothesis holds that information is encoded not in firing rate but in the precise timing of each spike relative to a background reference oscillation (e.g., the hippocampal theta rhythm at 4–12 Hz). A neuron fires once per theta cycle, and the phase of that spike within the cycle encodes the instantaneous signal value — earlier in the cycle means higher value. The key advantage over rate coding is sparsity: where rate coding requires many spikes to estimate a mean rate reliably, temporal coding needs only one spike per cycle, regardless of signal amplitude. Fewer spikes means less synaptic activity, less dynamic power dissipation, and — on neuromorphic hardware — fewer spike routing events traversing the on-chip network. On a platform like Loihi 2, where energy consumption is directly proportional to spike event count, temporal coding can reduce inference energy by an order of magnitude compared to rate coding at equivalent signal fidelity. Temporal codes are also more robust to certain classes of noise: a jittered spike timing shifts the decoded value by a small amount, whereas in rate coding, missed spikes or spurious spikes directly corrupt the rate estimate in proportion to how sparse the train already is. The trade-off is that temporal decoding requires a shared reference oscillation between encoder and decoder — a synchronization constraint that rate coding does not impose — and that the information-theoretic advantage is most pronounced when the reference frequency is high relative to the signal bandwidth.

Spike-Timing-Dependent Plasticity (STDP)

STDP is a temporally asymmetric Hebbian rule observed in cortical synapses. Synaptic weight changes depend on the relative timing Δt = t_post − t_pre: pre before post (Δt > 0) potentiates the synapse (LTP); post before pre (Δt < 0) depresses it (LTD).

STDP learning rule

ΔW = +A₊ · exp(−Δt / τ₊) if Δt > 0 [LTP]
ΔW = −A₋ · exp(+Δt / τ₋) if Δt < 0 [LTD]
where Δt = t_post − t_pre

STDP learning window: ΔW vs. Δt = t_post − t_pre

Δt = t_post − t_pre (or drag either graph)+20 ms

Spike timing diagram

Δt: +20 ms

ΔW: +0.0000

Outcome: LTP

A₊ — LTP amplitude0.0100

τ₊ — LTP time constant20 ms

A₋ — LTD amplitude0.0120

τ₋ — LTD time constant20 ms

Hebbian causality and network stability

Causal structure. STDP makes Hebb's rule temporally explicit: "neurons that fire together wire together," but with the critical refinement that causal order matters. Pre → Post (Δt > 0) means the pre-synaptic neuron plausibly contributed to driving the post-synaptic spike — strengthen the synapse. Post → Pre (Δt < 0) is anticausal — the pre-synaptic neuron fired after the event it supposedly caused — weaken it.

Asymmetry and stability. The typical biological default A₋ > A₊ (as set here) provides a net depressive bias that counteracts runaway potentiation — analogous to weight decay or L2 regularization in supervised learning. This asymmetry, combined with homeostatic intrinsic plasticity (not shown here), is thought to drive the self-organized formation of stable receptive fields and temporal sequence representations in cortical circuits.

Analogy to pheromone evaporation in ACO. The evaporation rate ρ in Ant Colony Optimization plays a structurally identical stabilizing role. Without evaporation, pheromone accumulates on every traversed path until all routes are indistinguishably saturated and the colony loses selectivity. With evaporation, only paths that receive consistent, repeated reinforcement maintain high pheromone levels — sporadic or poor-quality paths decay away. The A₋ > A₊ bias in STDP works the same way: synapses that fire together only occasionally get net-depressed out, while synapses with strong, reliable causal relationships accumulate enough LTP to overcome the depressive tide. In both cases the "forgetting" mechanism is not a loss — it is precisely what allows the system to extract and hold onto consistent structure in the input.

Neuromorphic Hardware & Software

Neuromorphic chips implement spiking neuron networks directly in silicon, exploiting the sparsity and asynchronous event-driven character of spike trains to achieve orders-of-magnitude better energy efficiency than clocked GPU inference on dense activations.

Classical vs. Neuromorphic: Key Contrasts

Property	GPU / CPU (classical ANN)	Neuromorphic (e.g., Loihi 2)
Computation trigger	Clocked, synchronous	Asynchronous, event-driven spikes
Activity pattern	Dense — every neuron computed each step	Sparse — only spiking neurons active
Inference power	Tens to hundreds of watts	Milliwatts to microwatts
Latency model	Batch-optimized; high throughput	Ultra-low latency per spike event
On-device learning	Typically offline backpropagation	Local STDP; genuinely online
Temporal state	Absent (feedforward) or explicit (RNNs)	Intrinsic — neurons carry membrane state
Programmability	Fully general GPU / CPU	Constrained to neuromorphic core model
Scale-out	Data-parallelism on large homogeneous arrays	Many-core NoC; Pohoiki Springs = 768 chips

Notable Hardware Platforms for SNNs

Intel Loihi 2 IBM TrueNorth SpiNNaker 2 BrainScaleS-2

Intel's second-generation research neuromorphic processor (2021) integrates 1 million programmable neurons across 128 neuromorphic cores, with up to 120 million synapses and three embedded Lakemont x86 management cores. Each neuromorphic core simulates approximately 1000 neurons running a configurable generalized-LIF model with multiple programmable state variables.

Communication is entirely event-driven and asynchronous: a neuron emits a spike packet only when it fires, routed over an on-chip mesh network-on-chip (NoC). Silent neurons consume almost no dynamic power — in sharp contrast to the dense synchronous multiply-accumulate operations executed for every neuron on every clock cycle of a GPU, regardless of whether the activations are zero.

Loihi 2 also supports on-chip learning: STDP-like plasticity rules are encoded in programmable learning microcode executing locally at each synapse after each spike event, without any off-chip gradient computation. This enables genuinely online, real-time synaptic adaptation at milliwatt-scale power budgets. Intel's Pohoiki Springs system scales to 768 Loihi chips, approaching 100 million neurons on a single board.

Software Frameworks for SNNs

Neuromorphic hardware is not required to study, simulate, or train SNNs. The following frameworks run on conventional CPUs and GPUs:

Brian2

Python-based SNN simulator. Neuron equations specified symbolically; generates optimized C++ simulation code. Best for flexible scientific exploration and custom models.

snnTorch

PyTorch-based SNN library with surrogate-gradient training. Implements differentiable LIF layers for backpropagation-through-time. Strong GPU support.

Norse

Functional PyTorch SNN API. Clean, composable design; integrates naturally into standard deep-learning pipelines. Supports several LIF variants and STDP.

PyNN

Hardware-agnostic front-end: write one model, run it on NEURON, Brian2, NEST, SpiNNaker, or BrainScaleS backends. Useful for cross-platform reproducibility.

NEST

Mature, HPC-scalable spiking network simulator. Efficient for millions of neurons on compute clusters. Less interactive than Brian2 but well-validated for large-scale cortical models.

Lava (Intel)

Intel's open-source framework for Loihi 2. Hardware-abstraction layer lets the same model code run in simulation or on physical Loihi 2 chips without code changes.

What Can You Do with a Neuromorphic System?

One of the less-appreciated features of neuromorphic systems is that the hardest problem in modern deep learning — how to train a network — is not always the right question to ask. The spike-based dynamics of SNNs and the local synaptic plasticity rules that emerge naturally from their hardware make them excellent candidates for tasks that require no global supervision at all. Only after covering those strengths does it make sense to ask what it costs to impose traditional supervised learning on a platform that wasn't designed for it.

I. Unsupervised Learning — The Natural Home of Neuromorphics

STDP as a Built-in Unsupervised Learning Rule

STDP (Tab ③) is a purely local, unsupervised synaptic update rule. No labels are required, no loss function is computed, and no gradient is propagated backward through the network. A synapse updates itself based solely on the relative timing of the two neurons it connects. The rule is implementable in silicon at the individual synapse — Loihi 2 encodes STDP variants directly in per-synapse learning microcode — so the full learning loop runs on-chip, asynchronously, in real time, at milliwatt power.

This is qualitatively different from unsupervised learning in conventional ANNs (autoencoders, contrastive self-supervised learning), which still require a global loss, a backward pass, and typically a GPU. Neuromorphic STDP-based learning is physically local in a way that conventional unsupervised learning is not. Exposing the network to structured inputs — visual patterns, acoustic features, spike sequences — gradually specializes neurons into feature detectors without any labeling burden. The selectivity emerges from the statistics of the input itself.

Memristor Crossbar Arrays: Unsupervised Classification in Hardware

A compelling realization of STDP-based unsupervised learning is the memristor crossbar array. In this architecture, synaptic weights are stored as analog conductance values in non-volatile memristive devices arranged at crossbar intersections. Because a memristor's conductance changes in response to the voltage pulses it experiences — and those pulses are determined by the pre- and post-synaptic spike times — the device itself is the STDP learning rule. No explicit weight-update computation is needed; the physics of the device does the work.

In practice, such systems can be trained for classification tasks simply by presenting labeled-class inputs during an unsupervised exposure phase. Neurons that fire together in response to a class prototype wire together; neurons driven by distinct prototypes self-organize into separate, discriminative representations. After exposure, the network classifies novel inputs by the pattern of elicited spike activity — without ever having been told which inputs belong together. The energy cost of both training and inference remains orders of magnitude below a GPU running equivalent workloads.

II. No Training Required — Exploiting Intrinsic Dynamics

Some of the most powerful applications of neuromorphic hardware require no learning phase whatsoever. Instead of training a network to approximate a function, these approaches directly encode a problem into the network's weight structure and let the physical dynamics of the chip find the answer.

Neuromorphic Annealers & QUBO Optimization

Many NP-hard combinatorial optimization problems can be cast as Quadratic Unconstrained Binary Optimization (QUBO): find x ∈ {0,1}ⁿ that minimizes the energy E(x) = x^TQx. QUBO maps directly to an Ising model — a spin system whose Hamiltonian is exactly E(x) — and the Ising ground state is the solution. The connection is exact, not approximate.

A recurrent SNN with weights W = −Q is dynamically equivalent to a stochastic Ising machine. Membrane noise plays the role of temperature: at high noise the network explores freely; as effective temperature decreases, it settles toward lower-energy configurations. This is physically implemented simulated annealing — the chip's own spike noise drives the annealing schedule. No gradient, no training loop, no labeled data. The Q matrix is programmed directly into the synaptic weight memory, and the chip runs.

Loihi 2 implementations have solved graph coloring, maximum cut, and traveling salesman instances at sub-milliwatt power. Critically, recent benchmarks place neuromorphic annealers competitively with quantum annealers (D-Wave) on certain problem classes — at a fraction of the cost, at room temperature, and without the coherence maintenance overhead that constrains quantum hardware. This reframes neuromorphic hardware not as a neural network accelerator but as a physics-based combinatorial optimizer.

Event-Driven Sensing (DVS Cameras)

Dynamic Vision Sensors (DVS) output a stream of asynchronous spike events — one per pixel per polarity change in brightness — rather than frame-synchronized images. This is retinal computation: only pixels that change fire, yielding microsecond temporal resolution with near-zero data rate for static scenes. No frame buffer, no synchronization overhead, no wasted computation on unchanged regions.

DVS output is natively spike-coded, making these sensors the natural front-end for neuromorphic processing pipelines. Downstream SNN layers can respond to the event stream with matched event-driven compute — the entire signal chain from photon to motor command can remain asynchronous and sparse. The N-MNIST and DVS-CIFAR10 benchmarks used to evaluate SNN classifiers were recorded with DVS cameras for exactly this reason: the benchmark is already in the native format of the hardware.

Robotics & Always-On Edge Processing

Robotic sensorimotor loops demand low latency and low power simultaneously — requirements that favor neuromorphic computation. Event-driven processing of DVS input, proprioceptive spike streams, and spiking motor commands can close a full sensorimotor loop in microseconds at microwatt steady-state power, with no clock cycle wasted on quiescent sensors. Research platforms including Intel's Kapoho Bay have demonstrated neuromorphic balance control and obstacle avoidance at power budgets orders of magnitude below conventional embedded processors.

The always-on, event-driven model is particularly compelling for keyword spotting and anomaly detection: the neuromorphic processor remains essentially dormant at near-zero power until an input spike pattern matches a stored template, then triggers a wakeup event. A conventional processor must run its detection algorithm on every clock cycle regardless of whether anything is happening — a fundamental architectural mismatch with the statistics of most real-world sensory streams.

III. Supervised Learning — Possible, But Harder

SNNs can also tackle the same classification, regression, and sequence-modeling tasks that conventional ANNs dominate — but this requires working against the grain of the spike mechanism. The extra engineering effort can be justified by inference-time energy savings, but the baseline accuracy gap with conventional deep networks is real and should not be understated.

The Surrogate Gradient Problem

Backpropagation requires differentiating a loss through every nonlinearity in the network. In a conventional ANN, each neuron applies a smooth nonlinearity (sigmoid, ReLU) — differentiable everywhere. In an SNN, the activation is spike emission: a Heaviside step function that is zero almost everywhere and has an undefined (distributional) derivative at V_th. Backpropagating through time across spike events yields gradients that are either identically zero or infinite. The standard deep-learning toolkit does not apply.

This is the fundamental barrier. It is not an implementation detail — it follows directly from what a spike is. The field's response has been a collection of workarounds, each sacrificing something different.

Surrogate Gradient Methods

Used in: snnTorch, Norse, SpyTorch

Keep the true Heaviside in the forward pass. In the backward pass only, replace its derivative with a smooth approximation — a piecewise-linear bump or sigmoid centered at V_th. The network trains as if the threshold were smooth while still producing genuine spikes at inference. Standard BPTT then applies across all time steps. This works, scales to deep architectures, and achieves competitive accuracy on DVS-based benchmarks. The cost is that BPTT must store the full spike history across time — memory-intensive and non-local, precluding on-chip learning on neuromorphic hardware in this form.

ANN-to-SNN Conversion

Train a conventional ANN normally, then replace each ReLU with an integrate-and-fire neuron whose mean firing rate approximates the original activation magnitude. Gradient computation is entirely avoided — the SNN is constructed from a pre-trained ANN, not trained as one. The weakness is latency: rate coding requires many timesteps to represent a value accurately, so converted SNNs typically need 100–1000 inference timesteps where an ANN needs one forward pass. This erodes the latency advantage of neuromorphic hardware and discards the temporal dynamics that make SNNs distinctive. Best suited to deploying an existing vision classifier onto low-power neuromorphic inference hardware without retraining from scratch.

E-prop & Contrastive Hebbian Methods

E-prop (Bellec et al., 2020) derives an online learning rule by constraining the BPTT gradient to only use locally available information. Each synapse maintains an eligibility trace — a real-time integral of correlated pre/post activity — and a global learning signal broadcast from a supervisor. Weight updates are their product. No backward pass, no stored history, compatible with on-chip STDP microcode on Loihi 2. Performance currently lags BPTT but the locality makes it deployable in hardware learning loops.

Contrastive Hebbian / Equilibrium Propagation sidesteps differentiation entirely by running two phases: a free phase where the network settles to its own dynamics, and a nudged phase where outputs are weakly clamped toward a target. Synaptic updates are proportional to the difference in local Hebbian correlations between phases. Scellier & Bengio (2017) showed this converges to the true gradient in the limit of infinitesimal nudging — so backpropagation is being computed implicitly by the network's own physics, without an explicit backward pass. Particularly natural for recurrent SNNs whose dynamics already settle into attractor-like states.

IV. Open Questions

The narrative above — neuromorphics are best for unsupervised and training-free tasks, and merely competitive (with extra effort) for supervised tasks — reflects the current state of the field, not a settled conclusion. Several important questions remain open:

When do SNNs outperform ANNs? At equal parameter counts and on tasks with dense, static inputs, conventional ANNs currently win on accuracy. The energy advantage of neuromorphic inference is real but only decisive for sparse, event-driven, or always-on workloads. The regime where the SNN's temporal dynamics genuinely outperform an ANN of equal capacity — not just match it more efficiently — has not been clearly established.

Is the spike the right primitive? The LIF neuron is a simplification of biological neurons that discards dendritic computation, neuromodulation, astrocytic coupling, and many other mechanisms. It is not clear whether the spike itself, rather than the broader computational principles of biological circuits, is what confers the functional advantages neuromorphics seek to exploit.

Can neuromorphic annealers close the gap with quantum hardware at scale? On small-to-medium QUBO instances, neuromorphic annealers are already competitive. Whether the scaling law holds as problem size grows — and whether the effective annealing schedule can be controlled precisely enough — is an active research question.

What is the right theory? A unified theoretical framework explaining what SNNs can represent that ANNs cannot, and vice versa, does not yet exist. The field is engineering-led rather than theory-led, which produces empirical results but makes principled architectural design difficult.