GitHub
nkpc_estimation
Final Project for 'Effective programming practices for economists' (University of Bonn, winter term 22/23)
rcc.billing
Automated, data-driven service billing implemented on REDCap Custodian
floss-toolbox
A toolbox to help developers and open source referents to not waste their time with manual and boring tasks. Provides simple and light tools to make investigations in source code to look for hot data. Provides also primitives to manage GitHub and GitLab organizations.
multisyn
Drug Combination Prediction, Anti-cancer joint prediction algorithm based on multi-source data
privacy-policies-as-data
This repository explores text mining analysis of privacy policies.
machine-translation-nlp-african-languages-v2
Explore machine translation and NLP tools designed for African languages in this research. Join us in advancing language technology for underrepresented languages! 🛠️🌍
gnn_tracking
Reconstruct billions of particle trajectories with graph neural networks
propulate
Propulate is an asynchronous population-based optimization algorithm and software package for global optimization and hyperparameter search on high-performance computers.
pricepoints
Price Points research, data analyses, and packages. Informational only, not official Turquoise Health products
training-load-review-data
Systematic review of the training load and injury risk field [dataset]
11521-adv-geo-comp
Syllabus and materials for redesigned 11.521, Advanced Geographic Computation for Urban Planning
2025-06-16-strathclyde
Carpentries website for presentation in SIPBS, June 2025
https://github.com/jageo/tutorialatomate2forcefields
Tutorial to learn basic features of atomate2
theodolite
Theodolite is a framework for benchmarking the horizontal and vertical scalability of cloud-native applications.
exam_forecast
Companion analysis to a paper published in the Journal of Digital Imaging.
openmind
I use a simple model to investigate the epistemic value of open-mindedness. This repository contains the code for a model of the epistemic value of open-mindedness and for producing some figures.
https://github.com/jageo/pymatgen
Python Materials Genomics (pymatgen) is a robust materials analysis code that defines core object representations for structures and molecules with support for many electronic structure codes. It is currently the core analysis code powering the Materials Project.
axtreme
Development repo for the RaPiD project with extensions for Ax and BoTorch.
architxt
ArchiTXT is an open source Python library that transforms unstructured text into structured, searchable, and AI-ready data. It enables automated database generation and seamless data integration.
gemelos-digitales-proyecto-final-escalante-21212151-
Proyecto Final: Proceso de inflamación (3 poblaciones)
abc
Code accompanying the publication "Enhancing Bacterial Adhesion with Hydro-Softened Chitosan Films"
fastqrepair
A pipeline that can be used to recover corrupted FASTQ.gz files, drop or fix uncompliant reads, remove unpaired reads, and settles reads that became disordered
detaxizer
A pipeline to identify (and remove) certain sequences from raw genomic data. Default taxon to identify (and remove) is Homo sapiens. Removal is optional.
bayes-sampler-comparison
A comparison investigation of water sampling methods using Bayesian inference modeling as part of a 2023 study by the CSU Agricultural Water Quality Program
enolex
A repository of R codes and curated dataset for a Shiny web database titled "EnoLEX, a diachronic lexical database for the Enggano language". Access the database via the link below 👇.
reproducible-research-with-gpu-jupyter
This repository demonstrates how to use GPU-Jupyter for reproducible deep learning research with minimal setup effort..
sammyseq
Pipeline for Sequential Analysis of MacroMolecules accessibilitY sequencing (SAMMY-seq) data, to analyze chromatin state.
replicability-presentation-2022
Presentation on Lessons Learned after 1000 articles
splineomics
Timeseries analysis of -omics data can be carried out by fitting spline curves to the data and using limma for hypothesis testing. The obtained hits can be clustered based on the spline shape and then used for an overrepresentation (ORA) analysis.The R package SplineOmics streamlines this whole process and generates reports
crowdtangle-api-schema
Simple ontological representation of the CrowdTangle API to be used in a Tropy template