category_theory_machine_learning
List of papers studying machine learning through the lens of category theory
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.9%) to scientific vocabulary
Keywords
Repository
List of papers studying machine learning through the lens of category theory
Basic Info
Statistics
- Stars: 1,409
- Watchers: 75
- Forks: 84
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Category Theory ∩ Machine Learning
Category theory has been finding increasing applications in machine learning. This repository aims to list all of the relevant papers, grouped by fields.
For an introduction to the ideas behind category theory, check out this link.

There might be papers missing, and some papers are in multiple fields. Feel free to contribute to this list - preferably by creating a pull request.
Theses
- Fundamental Components of Deep Learning: A category-theoretic approach
- Robust Diagrams for Deep Learning Architectures: Applications and Theory
- Category-Theoretic Datastructures and Algorithms for Learning Polynomial Circuits
- Category Theory for Quantum Natural Language Processing
- Towards a Categorical Foundation of Deep Learning: A Survey
General Deep Learning
- Categorical Foundations of Gradient-Based Learning
- Position: Categorical Deep Learning is an Algebraic Theory of All Architectures
- Position: Topological Deep Learning is the New Frontier for Relational Learning
- Backprop as Functor
- Lenses and Learners
- Reverse Derivative Ascent
- Dioptics
- Learning Functors using Gradient Descent (longer version here)
- Compositionality for Recursive Neural Networks
- Deep neural networks as nested dynamical systems
- Category Theory in Machine Learning
- Neural network layers as parametric spans
- Categories of Differentiable Polynomial Circuits for Machine Learning
- Learners are almost Free Compact Closed
- Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory
- Attending to Topological Spaces: The Cellular Transformer
- On the Anatomy of Attention
- Algebraic Dynamical Systems in Machine Learning
- Can neural operators always be continuously discretized?
- Order Theory in the Context of Machine Learning: an application
- An Invitation to Neuroalgebraic Geometry
- Algebraic Positional Encodings
- Learning Structure-Aware Representations of Dependent Types
- Accelerating Machine Learning Systems via Category Theory: Applications to Spherical Attention for Gene Regulatory Networks
- Copresheaf Topological Neural Networks: A Generalized Deep Learning Framework
Equivariance
- Equivariant neural networks and piecewise linear representation theory
- Local Permutation Equivariance For Graph Neural Networks
- Stochastic Neural Network Symmetrisation in Markov Categories
- Metric Learning for Clifford Group Equivariant Neural Networks
- Categorification of Group Equivariant Neural Networks
- Equivariant Single View Pose Prediction Via Induced and Restricted Representations
- Characterizing the invariances of learning algorithms using category theory
- Mathematical Foundation of Interpretable Equivariant Surrogate Models
- Filter Equivariant Functions: A symmetric account of length-general extrapolation on lists
- Relational inductive biases on attention mechanisms
Graph Neural Networks
- Graph Neural Networks are Dynamic Programmers
- Natural Graph Networks
- Sheaf Representation Learning
- Sheaf Neural Networks
- Sheaf Neural Networks with Connection Laplacians
- Neural Sheaf Diffusion: A Topological Perspective on Heterophily and Oversmoothing in GNNs
- Nonlinear Sheaf Diffusion in Graph Neural Networks
- Graph Convolutional Neural Networks as Parametric CoKleisli morphisms
- Learnable Commutative Monoids for Graph Neural Networks
- Sheaf Neural Networks for Graph-based Recommender Systems
- Sheaf theory: from deep geometry to deep learning
- Asynchronous Algorithmic Alignment with Cocycles
- Topologically Attributed Graphs for Shape Discrimination
- Grothendieck Graph Neural Networks Framework: An Algebraic Platform for Crafting Topology-Aware GNNs
- Don't be Afraid of Cell Complexes! An Introduction from an Applied Perspective
- Graph Lineages and Skeletal Graph Products
Differentiable programming / automatic differentiation
- Functorial String Diagrams for Reverse-Mode Automatic Differentiation
- Differentiable Causal Computations via Delayed Trace
- Simple Essence of Automatic Differentiation
- Reverse Derivative Categories
- Towards formalizing and extending differential programming using tangent categories
- Correctness of Automatic Differentiation via Diffeologies and Categorical Gluing
- Denotationally Correct, Purely Functional, Efficient Reverse-mode Automatic Differentiation
- Higher Order Automatic Differentiation of Higher Order Functions
- Space-time tradeoffs of lenses and optics via higher category theory
- Using Rewrite Strategies for Efficient Functional Automatic Differentiation
Probability theory
- Markov categories
- Markov Categories and Entropy
- Infinite products and zero-one laws in categorical probability
- A Convenient Category for Higher-Order Probability Theory
- Bimonoidal Structure of Probability Monads
- Representable Markov Categories and Comparison of Statistical Experiments in Categorical Probability
- De Finneti's construction as a categorical limit
- A Probability Monad as the Colimit of Spaces of Finite Samples
- A Probabilistic Dependent Type System based on Non-Deterministic Beta Reduction
- Probability, valuations, hyperspace: Three monads on Top and the support as a morphism
- Categorical Probability Theory
- Information structures and their cohomology
- Computable Stochastic Processes
- Compositional Semantics for Probabilistic Programs with Exact Conditioning
- String Diagrams with Factorized Densities
- Partial Markov Categories
- Random Variables, Conditional Independence and Categories of Abstract Sample Spaces
Bayesian/Causal inference
- The Compositional Structure of Bayesian Inference
- Dependent Bayesian Lenses: Categories of Bidirectional Markov Kernels with Canonical Bayesian Inversion
- A category theory framework for Bayesian Learning
- Causal Theories: A Categorical Perspective on Bayesian Networks
- Bayesian machine learning via category theory
- A Categorical Foundation for Bayesian probability
- Bayesian Open Games
- Causal Inference by String Diagram Surgery
- Disintegration and Bayesian Inversion via String Diagrams
- Categorical Stochastic Processes and Likelihood
- Bayesian Updates Compose Optically
- Automatic Backward Filtering Forward Guiding for Markov processes and graphical models
- Compositionality in algorithms for smoothing
- A Channel-Based Perspective on Conjugate Priors
- A Type Theory for Probabilistic and Bayesian Reasoning
- Denotational validation of higher-order Bayesian inference
- The Geometry of Bayesian Programming
- Relational Reasoning for Markov Chains in a Probabilistic Guarded Lambda Calculus
- A Bayesian Interpretation of the Internal Model Principle
- On the Functoriality of Belief Propagation Algorithms on Finite Partially Ordered Sets
Topological Data Analysis
- On Characterizing the Capacity of Neural Networks using Algebraic Topology
- Persistent-Homology-based Machine Learning and its Applications - A Survey
- Topological Expressiveness of Neural Networks
Metric space magnitude
- Approximating the convex hull via metric space magnitude
- Practical applications of metric space magnitude and weighting vectors
- Weighting vectors for machine learning: numerical harmonic analysis applied to boundary detection
- The magnitude vector of images
- Magnitude of arithmetic scalar and matrix categories
- Metric Space Magnitude for Evaluating Unsupervised Representation Learning
- The Magnitude of Categories of Texts Enriched by Language Models
Blog posts
- Neural Networks, Types, and Functional Programming
- Towards Categorical Foundations of Learning
- Graph Convolutional Neural Networks as Parametric CoKleisli morphisms
- Optics vs Lenses, Operationally
- Meta-learning and Monads
Automata Learning
- Automata Learning: A Categorical Perspective
- A Categorical Framework for Learning Generalised Tree Automata
- CALF: Categorical Automata Learning Framework (thesis)
Misc
- Generalized Convolution and Efficient Language Recognition
- General supervised learning as change propagation with delta lenses
- From Open Learners to Open Games
- Learners Languages
- A Constructive, Type-Theoretic Approach to Regression via Global Optimisation
- Functorial Manifold Learning
- Diegetic representation of feedback in open games
- Assessing the Unitary RNN as an End-to-End Compositional Model of Syntax
- Classifying Clustering Schemes
- Categorical Hopfield Networks
- A Category-theoretical Meta-analysis of Definitions of Disentanglement
- Isomorphism, Normalizing Flows, and Density Estimation: Preserving Relationships Between Data
- Transport of Algebraic Structure to Latent Embeddings
- Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning
- Categorical Diffusion of Weighted Lattices
- Aggregating time-series and image data: functors and double functors
- Logic Explanation of AI Classifiers by Categorical Explaining Functors
- Typing tensor calculus in 2-categories
- The Joys of Categorical Conformal Prediction
- The Gauss-Markov Adjunction: Categorical Semantics of Residuals in Supervised Learning
Owner
- Name: Bruno Gavranović
- Login: bgavran
- Kind: user
- Location: London, United Kingdom
- Company: Symbolica
- Website: www.brunogavranovic.com
- Repositories: 26
- Profile: https://github.com/bgavran
Principal Scientist - Categorical Deep Learning @symbolica-ai
Citation (CITATION.cff)
cff-version: 1.2.0
message: If you use this repository in your research, please cite it
title: Category Theory ∩ Machine Learning
abstract: This repository provides a comprehensive list of papers at the intersection of category theory and machine learning, grouped by fields.
authors:
- family-names: Gavranović
given-names: Bruno
orcid: https://orcid.org/0000-0002-6069-5727
version: 1.0.0
date-released: 2020-07-09
repository-code: https://github.com/bgavran/Category_Theory_Machine_Learning
keywords:
- Category Theory
- Machine Learning
- Deep Learning
- Graph Neural Networks
- Differentiable Programming
- Probability Theory
- Bayesian/Causal Inference
- Topological Data Analysis
- Metric Space Magnitude
- Automata Learning
GitHub Events
Total
- Watch event: 144
- Push event: 14
- Pull request event: 1
- Fork event: 12
Last Year
- Watch event: 144
- Push event: 14
- Pull request event: 1
- Fork event: 12
Committers
Last synced: 10 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Bruno Gavranović | b****o@b****m | 64 |
| Bruno Gavranović | b****c@f****r | 27 |
| edwin1729 | 1****9 | 1 |
| Riccardo Ali | 4****i | 1 |
| Darren Wilkinson | d****n@b****m | 1 |
| Cristian Bodnar | c****6@g****m | 1 |
| Bryan Bischof | b****f@g****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 0
- Total pull requests: 6
- Average time to close issues: N/A
- Average time to close pull requests: about 1 month
- Total issue authors: 0
- Total pull request authors: 6
- Average comments per issue: 0
- Average comments per pull request: 0.83
- Merged pull requests: 4
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- hans-riess (2)
- rick-ali (2)
- ferzcam (1)
- BBischof (1)
- crisbodnar (1)
- edwin1729 (1)
- darrenjw (1)