GitHub
whisperxtranscription4researchers
This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
duracketal25-forcingworkshopsummary
A meeting report, summarizing key takeaways from the October 2025 workshop "Pathway to regular and sustained delivery of climate forcing datasets"
notiflyer
an inspiring and refreshing new way of sending KPIs, alerts, or simple tabular data as emails
django2pydantic
Django2pydantic is the most complete library for converting Django ORM models to Pydantic models
robincar
ROBust INference for Covariate Adjustment in Randomized clinical trials
potjansdiesmann2014
Large scale model of cortical network from Potjans and Diesmann, 2014
hpc-springschool
Git training during HPC spring school using MSc code for plotting statistical data
likelihoodbasedprofilewiseanalysis.jl
Julia package which implements and explores the Likelihood-based Profile Wise Analysis workflow for uncertainty quantification
a-classical-proof-of-the-riemann-hypothesis-via-self-adjoint-spectral-theory
Formal repository for “A Classical Proof of the Riemann Hypothesis via Self-Adjoint Spectral Theory” by Lawrence Ip. Contains manuscript and philosophical context via the Spectral Unity Theorem.
sars-cov-2-mpro-drug-discovery
A complete virtual screening + QSAR modeling + ADME profiling project
security-datasets-for-testing
A set of security datasets for testing of tools and algorithms
euromix-to-sbml
Antimony/SBML re-implementation of the EuroMix Generic PBK model.
nfdi4ingscientificworkflowrequirements
This is a repository to discuss, collect and store the requirements for scientific workflow systems
https://github.com/barahona-research-group/mcf
Code for the paper "Analysing Multiscale Clusterings with Persistent Homology" by Juni Schindler and Mauricio Barahona.
awesome-story-generation
This repository collects an extensive list of awesome papers about Story Generation / Storytelling, exclusively focusing on the era of Large Language Models (LLMs).
psmi
An implementation of Pointwise Sliced Mutual Information (PSMI) for machine learning
nedextract
Extract information on persons and organisation from Dutch PDF files
hermes
HERMES is a publicly available computational framework for the line of sight integration over galactic radiative processes which creates sky maps in the HEALPix-compatibile format.
spin-dependent-5th-force-limits
This dataset compiles results from existing exotic boson searches, with a specific focus on the fifth force—particularly the spin-dependent fifth force (SDFF). It is intended as a live platform to provide researchers in the field with convenient access to relevant data.
jaipur
A short, introduction to using Python and Jupyter to interact with geo-data.
https://github.com/dfornika/amrhike
Proof-of-concept for storing and querying harmonized AMR Genomic Analysis Results in datahike
qasports-dataset-scripts
Scripts used to generate the (Question-Answering) QASports2 Dataset
bacqc
bacQC is a bioinformatics analysis pipeline for assessing the quality of bacterial isolate sequence data
open-science-checklists
Checklists produced by the Open Science Team at SciLifeLab Sweden for guidance on compliance with FAIR principles.
yolov3-tilapia-fish-freshness-evaluation-system
Updated fork of YOLOv3 and Grabcut System for Tilapia Gill Identification
makefile-support-for-help
Supporting 'help' as a target within Makefiles for self-documentation
cycling-kinematic
Python tool for the analysis of lower limb 3D kinematics during cycling across multiple power levels. Designed for motion capture recordings from Contemplas™ or similar systems.
cpg0017-rohban-pathways-data
Image based profiles regenerated for cpg0017-rohban-pathways (TA-ORF) using 2023 workflow. As found in the cellpainting-gallery.
npl
The NanoParticleLibrary (NPL) is a Python library for simulating and optimizing nanoparticle structures, built on ASE.
langchain-chatbot
AI Chatbot for analyzing/extracting information from data in conversational format.
information_disclosure_searchtools
A Python tool developed by Hafid Fajar, co-founder of Garuda Cyberthreat. It's specifically designed for full-power information disclosure vulnerability scanning, quickly and efficiently uncovering exposed sensitive data.
pace_oci_conversion
R scripts for converting PACE LandVI data from NetCDF to GeoTIF
eVAS
eVAS: A user-friendly electronic Visual Analogue Scale - Published in JOSS (2025)
skder
skDER & CiDDER: efficient & high-resolution dereplication of microbial genomes.
carbon-aware-nextflow-on-k8s
Run carbon-aware Nextflow pipelines on Kubernetes using real-time ElectricityMap data to reduce emissions from compute-heavy tasks.
@hugoalh/range-iterator
An ECMAScript (JavaScript & TypeScript) module to iterate between range.
stats-strided-wasm-dmeanors
Calculate the arithmetic mean of a double-precision floating-point strided array using ordinary recursive summation.
transform-emr
This model is a decoder transformer based model aiming to model events predictions from EMR records as a sequential text generation problem. This project is a part of my thesis research.
dynamic-consent-management
Dynamic Consent Management of Speakers in Voice Assistant Systems
ucd-agg-website
Laboratory website for the University College Dublin Animal Genomics Group (UCD-AGG)
constructive-konig
Coq artifact for "Constructive substitutes for König's lemma"