GitHub
reproducible-ml-workflows
Reproducible Machine Learning Workflows for Scientists
https://github.com/alleninstitute/tasic2016data
Single cell transcriptomic data from Tasic, et al. (2016)
canada-labour-research-assistant
The Canada Labour Research Assistant (CLaRA) is a privacy-first LLM-powered RAG AI assistant proposing Easily Verifiable Direct Quotations (EVDQ) to mitigate hallucinations in answering questions about Canadian labour laws, standards, and regulations. It works entirely offline and locally, guaranteeing the confidentiality of your conversations.
organic-fertilizers-detection
Detection of organic fertilizers (manure) application on crop fields leveraging satellite data and Machine Learning.
mc-crawler
A MobileCoin network crawler. Corresponding preprint available on arXiv (https://arxiv.org/pdf/2111.12364.pdf).
aptv2
The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K
pyrandtoys
A python module for generating the result of probability-based toys. Installable on any system with pip command.
huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
drone-data-in-agricultural-research
Lessons on working with multispectral drone based data in agricultural research settings
cubar
R Package for Codon Usage Bias Analysis. Comprehensive documentation and tutorials are available at:
baler-compressor
Repository of Baler, a machine learning based data compression tool
dcache
dCache - a system for storing and retrieving huge amounts of data, distributed among a large number of heterogenous server nodes, under a single virtual filesystem tree with a variety of standard access methods
pyPLNmodels
pyPLNmodels: A Python package to analyze multivariate high-dimensional count data - Published in JOSS (2024)
tf2deepfloorplan
TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.
pysodmetrics
PySODMetrics: A Simple and Efficient Implementation of Grayscale/Binary Segmentation Metrcis
jreferral
An open-source tool that recommends the most energy efficient JVM configuration for java software
cbdc-offline-sim
A CBDC simulation framework, designed to simulate an offline implementation with a barabasi albert graph and fradulent nodes. Markov processes are used for up and down states as well as transaction frequency.
nerf-us
Official code for NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild
piptools-sync
A pre-commit plugin that syncs the pre-commit package versions with the versions generated by pip-tools
dynaface
Dynaface is both a computer application and Python library to measure facial symmetry utilizing advanced AI for both symmetric and asymmetric faces.
ec-finder
EcFinder: Python library and script to fuzzy search an enzyme EC number from an enzyme name
docling4j
Docling4j brings the functionalities of Docling in document understanding to Java® projects
micrometaexplorer
Micro-Meta Explorer is used to compare and explore different Micro-Meta App generated microscope hardware JSON files
multiagenttrajectoryplanning
Experiments of the "Multi-Agent Trajectory Planning with NUV Priors" paper
prediction-of-pcos-using-optimized-ml-classifiers
Official Code Files For My Research Paper Presented at 3rd International Conference on Artificial Intelligence: Advances and Applications (ICAIAA 2022). Paper Published in Springer Book Series, "Algorithms for Intelligent Systems (AIS)".
https://github.com/althonos/pubchem.rs
Rust data structures and client for the PubChem REST API
markdown-include-variants
Markdown extension to expand directives to include source example files to also include their variants. Only useful to tiangolo's projets. Don't use it. 😅
m2g
NeuroData's MRI to Graphs (m2g) - connectome estimation package and pipeline
https://github.com/compvis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
papers
Contains the code for the paper "Multi-Horizon Short-Term Load Forecasting Using Hybrid of LSTM and Modified Split Convolution"
cbgp-lite
Code Building Genetic Programming for a narrow subset of the Clojure language.
eurocropsml
EuroCropsML is a ready-to-use benchmark dataset for few-shot crop type classification using Sentinel-2 imagery.
ixpeobssim
Simulation and analysis framework for the Imaging X-ray Polarimetry Explorer
https://github.com/biojulia/xam.jl
Parse and process SAM and BAM formatted files
mifa
An R package providing multiple Imputation of covariance matrices in order to perform factor analysis.
architecture-decision-record
Architecture decision record (ADR) examples for software planning, IT leadership, and template documentation
smoppix
Single-molecule spatial omics models analyzed through the probabilistic index
optimization
Reproducible results for the various types of optimization techniques I have implemented
blas-base-sdsdot
Calculate the dot product of two single-precision floating-point vectors with extended accumulation.
ai-assisted-protocol-analysis-in-design-research
Code repository for the paper "Towards AI-Assisted Protocol Analysis in Design Research: Automating Question Labeling with GPT-4 According to Eris' (2004) Taxonomy." Presented at Design Computing and Cognition’24
mantra-postprocessing
Scripts for postprocessing data generated with the "MountAiN glacier Transient snowline Retrieval Algorithm".
stats-base-snanvariancewd
Calculate the variance of a single-precision floating-point strided array ignoring NaN values and using Welford's algorithm.
fair-workflow-platform
A workflow platform using machine-actionable RO-Crates and FAIR signposting
distmsmatch
A MPI-distributed version of MSMatch integrating the PASEOS framework for physical modelling
qspace_deep_learning
Implementation codes for "Jointly estimating parametric maps of multiple diffusion models"
smesh
A simple Fortran package for generating and handling unstructured triangular and polygonal meshes
workshop_data_reuse
Data reuse saves time and accelerates the pace of scientific discovery. But how do you actually reuse the data that you put in repositories? This is a 4 hour workshop tailored to the ocean sciences. We aim to teach the basics of data reuse using ERDDAP servers maintained by ocean science specific repo's and programs and pulling these data into the Python environment.
twitter-graph
Fetch and visualize the graph of your Twitter friends and followers.
TIGERr
Technical variation elimination for metabolomics data using ensemble learning architecture
mixmod
Supervised, unsupervised and semi-supervised classification with mixture modelling