ICML 2026

Equivariant Neural Networks for
General Linear Symmetries on Lie Algebras

Reductive Lie Neurons (ReLNs)

Chankyo Kim1* Sicheng Zhao1* Minghan Zhu1,2 Tzu-Yuan Lin3 Maani Ghaffari1
1University of Michigan  ·  2University of Pennsylvania  ·  3Massachusetts Institute of Technology
*Equal contribution
Lie groups in scientific applications: SL(3), Lorentz SO+(1,3), Sp(n), SPD(3)xR3, GL(3)

One backbone, many symmetries. From image homographies ($\mathrm{SL}(3)$) and spacetime physics (the Lorentz group $\mathrm{SO}^+(1,3)$) to Hamiltonian mechanics ($\mathrm{Sp}(n)$), probabilistic state estimation ($\mathrm{SPD}(3)\times\mathbb{R}^3$), and continuum mechanics ($\mathrm{GL}(3)$) — these symmetries span the spectrum that a single adjoint-equivariant ReLN backbone is designed to address, with no per-subgroup redesign.

TL;DR

Abstract

Many scientific and geometric problems exhibit general linear symmetries, yet most equivariant neural networks are built for compact groups or simple vector features, limiting their reuse on matrix-valued data such as covariances, inertias, or shape tensors. We introduce Reductive Lie Neurons (ReLNs), an exactly $\mathrm{GL}(n)$-equivariant architecture that natively supports matrix-valued and Lie-algebraic features.

ReLNs resolve a central stability issue for reductive Lie algebras by introducing a non-degenerate adjoint (conjugation)-invariant bilinear form, enabling principled nonlinear interactions and invariant feature construction in a single architecture that transfers across subgroups without redesign. We demonstrate ReLNs on native $\mathrm{GL}(n)$ system identification, algebraic tasks with $\mathfrak{sl}(3)$ and $\mathfrak{sp}(4)$ symmetries, uncertainty-aware drone state estimation via joint velocity–covariance processing, learning from 3D Gaussian-splat representations, and an EMLP double-pendulum benchmark spanning multiple symmetry groups. ReLNs consistently match or outperform strong equivariant and self-supervised baselines while using substantially fewer parameters and compute, providing a practical, reusable backbone for learning with broad linear symmetries.

Positioning

A Unified Framework for General Symmetries

Prior equivariant networks are typically specialized for subgroups such as $\mathrm{SO}(n)$, $\mathrm{E}(n)$, or $\mathrm{SL}(n)$. ReLNs target the full general linear group $\mathrm{GL}(n)$ — subsuming these families under one adjoint-equivariant construction.

Taxonomy of equivariant architectures, with ReLNs covering GL(n)

Method

Equivariance by Design

Heterogeneous geometric inputs obey different transformation laws — vectors transform by a left action ($v \mapsto Rv$), while covariances transform by congruence ($\Sigma \mapsto R\Sigma R^\top$). ReLNs embed both into a shared Lie algebra $\mathfrak{gl}(n)$, where they share a single adjoint action $\mathrm{Ad}_g$ that the network $f$ commutes with — equivariance by construction.

Adjoint-equivariance: a unified representation for vectors and covariances

Velocity and covariance are lifted into a common $\mathfrak{gl}(n)$ feature space and transform under the same $\mathrm{Ad}_g$. The ReLN map $f$ commutes with this action: $f(\mathrm{Ad}_g \cdot x) = \mathrm{Ad}_g \cdot f(x)$.

The key ingredient is a non-degenerate, $\mathrm{Ad}$-invariant bilinear form for any reductive matrix algebra (e.g. $\mathfrak{gl}(n)$), which resolves the degeneracy of the classical Killing form and underlies every ReLN gating, normalization, and invariant layer:

$$ B(X, Y) \;=\; 2n\,\operatorname{tr}(XY) \;-\; \operatorname{tr}(X)\operatorname{tr}(Y) $$

Invariant under conjugation by any $g \in \mathrm{GL}(n)$, this form generalizes the trace/Killing form (degenerate on $\mathfrak{gl}(n)$) and yields stable invariant scalars for nonlinearities.

Results

One Backbone Across Physics, Robotics & 3D Vision

A single ReLN backbone transfers across tasks with no per-symmetry redesign. Its advantage is largest where features carry geometric structure that changes with the frame — covariance, uncertainty, and scale — information that semisimple equivariant models discard.

127×
lower trace-recovery error than Lie Neurons on native $\mathrm{GL}(n)$ system ID
Stable
accuracy on 3D Gaussian splats under arbitrary rotations
11.2×
fewer FLOPs/step than EMLP at matched accuracy
27.8×
faster inference than EMLP on Hamiltonian dynamics
Dynamical Systems

Native $\mathrm{GL}(n)$ System Identification

The one task whose symmetry is genuinely the full general linear group. We recover an unknown linear dynamics matrix $A\in\mathfrak{gl}(3)$ of a system $z_{t+1}=Az_t$ from noisy windowed least-squares estimates. Under a latent basis change $z\mapsto Sz$ with $S\in\mathrm{GL}(3)$, both inputs and target transform by similarity $A\mapsto SAS^{-1}$ — the adjoint action on $\mathfrak{gl}(3)$. Crucially, $\mathrm{Tr}(A)$ governs global expansion/contraction and lives in the center, so recovering $A$ needs both the semisimple ideal and the center. Lie Neurons rely on Killing-form-only invariants that are blind to $\mathrm{Tr}(A)$ by construction; ReLN's reductive completion recovers it while staying similarity-equivariant.

Method Test split Trace MSE ↓ Canonical MSE ↓
Baselines
Avg-LSall0.02110.0044
MLPID0.01450.0041
MLPGL0.04890.1480
MLPGL-Hard1.028557.41
Lie Neuronsall1.68500.0658
Reductive Lie Neurons (Ours)
ReLNall 0.01330.0039

Mean over 3 seeds. Trace MSE measures center recovery; Canonical MSE measures intrinsic, similarity-invariant error. Equivariant methods (Avg-LS, Lie Neurons, ReLN) report a single value because they are exactly similarity-equivariant; the non-equivariant MLP is shown across ID (training basis), GL (random $S$), and GL-Hard (ill-conditioned $S$), and collapses under unseen basis changes. Lie Neurons is stable in Canonical MSE but its Trace MSE is two orders of magnitude worse than ReLN ($1.685$ vs. $0.013$). ReLN attains the lowest error on both metrics simultaneously.

Robotics

Uncertainty-Aware Drone State Estimation

Reconstructing 3D trajectories from noisy velocities $\mathbf{v}\in\mathbb{R}^3$ and time-varying uncertainty covariances $C\in\mathrm{SPD}(3)$ requires jointly processing vector and matrix data. ReLNs lift both into a common $\mathfrak{gl}(3)$ representation; the variant using log-covariance is best, and ReLNs are exactly invariant to test-time $\mathrm{SO}(3)$ rotations (identical ID and SO(3) metrics).

Drone trajectory reconstruction: ReLN vs VN and ResNet baselines

On challenging aggressive trajectories, ReLN variants track the ground truth (black), while the ResNet and Vector-Neuron baselines shown here drift.

Model Input In-Distribution SO(3) perturbed
ATE ↓RTE ↓ATE ↓RTE ↓
Non-Equivariant
ResNet$(v, C)$205.11106.07213.26109.37
Equivariant Baselines
VN$v$17.3613.5117.3613.51
VN$(v, C)$191.7898.39190.2298.26
TFN$(v, C)$17.5614.4017.5614.40
TFN$(v, \log C)$16.8313.3416.8313.34
SE(3)-Transformer$(v, C)$21.6716.7721.6716.77
SE(3)-Transformer$(v, \log C)$20.1215.3620.1215.36
Lie Neurons$(v, C)$16.8613.6516.8613.65
Lie Neurons$(v, \log C)$15.6512.0415.6512.04
Reductive Lie Neurons (Ours)
ReLN$v$16.8512.7016.8512.70
ReLN$(v, C)$16.4913.0216.4913.02
ReLN$(v, \log C)$ 13.9211.04 13.9211.04

Absolute / relative trajectory error in meters (lower is better). For a fair comparison, the steerable baselines (TFN, SE(3)-Transformer) and Lie Neurons are given the same log-covariance interface; ReLN still attains the lowest error. ResNet and VN are reported with their native inputs (non-equivariant; eigendecomposition-based covariance interface).

3D Vision

Equivariance for 3D Gaussian Splatting

3D Gaussian splats couple a mean $\mu\in\mathbb{R}^3$ with an anisotropic covariance $\Sigma\in\mathrm{SPD}(3)$ that transform differently under rotation ($R\mu$ vs. $R\Sigma R^\top$). We rebuild the encoder/decoder of a Gaussian masked-autoencoder with ReLN blocks to enforce $\mathrm{GL}(3)$-equivariance. The result: classification accuracy on ModelNet10 stays stable under arbitrary rotations where the baseline collapses, with faster convergence and lower reconstruction error across every attribute.

ReLN-integrated Gaussian-MAE framework

The ReLN-integrated Gaussian-MAE lifts raw splats into $\mathfrak{gl}(3)$, processes active geometry ($\mu,\Sigma$) through a $\mathrm{GL}(3)$-equivariant encoder/decoder, and reconstructs via the Vee map and bilinear form.

MethodStandard ↑Rotated ↑
Gaussian-MAE93.3918.28
ReLN (Ours)94.8295.15

ModelNet10 accuracy (%). The baseline drops to 18% under random rotations; ReLN stays at 95%.

Pre-training convergence: ReLN vs baseline across metrics

Pre-training on ShapeNet: ReLN (red) reaches lower reconstruction error than the baseline across rotation, scale, density, SH, and Chamfer distance.

Physics & Efficiency

EMLP Double-Pendulum: Same Accuracy, Far Less Compute

On the EMLP double-pendulum benchmark (Hamiltonian dynamics under $O(2)$, $SO(2)$, $D_6$), ReLN matches or beats EMLP without any group-specific architecture changes, while running an order of magnitude cheaper. ReLN uses closed-form matrix operations from exact $\mathrm{Ad}$-equivariant primitives, whereas EMLP relies on symmetry-dependent basis construction.

Rollout Error ↓EMLPReLN
$O(2)$0.0120.011
$SO(2)$0.0150.010
$D_6$0.0130.011
Cost / stepEMLPReLN
FLOPs1,589,909142,190
Inference (ms)61.642.22

ReLN: 11.2× fewer FLOPs and 27.8× faster inference than EMLP at matched accuracy.

Foundations

Algebraic Benchmarks — $\mathfrak{sl}(3)$ & $\mathfrak{sp}(4)$

As a sanity check that the general $\mathfrak{gl}(n)$ construction specializes correctly to semisimple subalgebras, ReLN matches the specialized Lie Neurons on Platonic-solid classification ($\mathfrak{sl}(3)$, near-perfect rotated-camera accuracy) and on $\mathfrak{sp}(4)$ invariant-function regression (low MSE and near-zero invariance error) — while keeping a single, unified architecture across all algebras.

BibTeX

@inproceedings{kim2026equivariant,
  title     = {Equivariant Neural Networks for General Linear Symmetries on Lie Algebras},
  author    = {Kim, Chankyo and Zhao, Sicheng and Zhu, Minghan and Lin, Tzu-Yuan and Ghaffari, Maani},
  booktitle = {Forty-third International Conference on Machine Learning},
  year      = {2026}
}