TrackerControl
TrackerControl: Transparency and Choice around App Tracking - Published in JOSS (2022)
Metasyn
Metasyn: Transparent Generation of Synthetic Tabular Data with Privacy Guarantees - Published in JOSS (2025)
arx
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
opendp
The core library of differential privacy algorithms powering the OpenDP Project.
privacy-meter
Privacy Meter: An open-source library to audit data privacy in statistical and machine learning algorithms.
acro
Tools for the Semi-Automatic Checking of Research Outputs. These are tools for researchers to use as drop-in replacements for common analysis commands.
pfl
Simulation framework for accelerating research in Private Federated Learning
qrlew
dione
Dione is an anonymize and encrypted messaging system build on top on a peer to peer layer.
privacy-pioneer
Privacy browser extension for analyzing web traffic of visited websites
https://github.com/cdcgov/covid_case_privacy_review
Privacy review and statistical disclosure control methods for covid public case data.
black-mirror
Blacklists and whitelists built by open code, so you know what goes into them.
https://github.com/zama-ai/concrete-ml
Concrete ML: Privacy Preserving ML framework using Fully Homomorphic Encryption (FHE), built on top of Concrete, with bindings to traditional ML frameworks.
volkszaehler.org
Open Source Smart Meter with focus on privacy - you remain the master of your data.
https://github.com/hashbite/consent-manager
Examples: https://github.com/hashbite/consent-manager-examples
https://github.com/claromes/socialswitch
Browser extension to redirect Instagram and TikTok URLs to anonymous viewers
https://github.com/caltechlibrary/trinomial
A very simple name anonymization library
private-pgd
Implementation for the paper "Privacy-preserving data release leveraging optimal transport and particle gradient descent"
https://github.com/ai-sdc/sumit
SUMiT: Statistical Uncertainty Management Toolkit
https://github.com/copyleftdev/mailsentinel
AI-powered Gmail classification system using local Ollama LLM inference. Privacy-first email triage with modular YAML profiles, cryptographic audit trails, and enterprise-grade security.
dp-opt
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
marich
Marich is a model-agnostic extraction algorithm. It uses a public data to query a private model, aggregates the predicted labels, and construct a distributionall equivalent/max-information leaking extracted model.
https://github.com/better-wealth/better-anonymity
Better anonymity configurations.
aivpn
The AI VPN provides an security assessment of VPN clients' network traffic to identify cyber security threats.
sacro-ml
Collection of tools and resources for managing the statistical disclosure control of trained machine learning models
https://github.com/holistic-ai/holisticai
This is an open-source tool to assess and improve the trustworthiness of AI systems.
raspberry-pi-network-setup
🍓️🖥️📔️ My personal and professional Raspberry Pi setups, along with my Raspberry Pi daily blog.
https://github.com/ai-sdc/acro-r
ACRO R Package: Tools for the Semi-Automatic Checking of Research Outputs.
https://github.com/scimorph/secureml
Easy-to-use utilities to build privacy-preserving AI.
reg_breach
Have I Been Pwned? Yes. Evidence from HIBP and Emails From Voter Registration Files.
treating-locomotor
Pilot e-learning tool for Treating Locomotor disease: developed for blended learning.
ganonymization
A GAN-based Face Anonymization Framework for Preserving Emotional Expressions
dataprivacyhandbook
Repository for the Data Privacy Handbook @UtrechtUniversity, a practical guide on handling personal data in scientific research
https://github.com/prismelabs/analytics
High-perfomance, self-hosted and privacy-focused web analytics service.
https://github.com/cdcgov/template
Template repository with rules, practices, and privacy, license, records notices to help people use the CDCgov GitHub organization.
https://github.com/awslabs/amazon-s3-find-and-forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)