GitHub
https://github.com/doi-usgs/climate-fish-habitat
Shifts in fish habitat under climate change
https://github.com/doi-usgs/ds-pipelines-targets-3-course
Many-task pipelines using targets
https://github.com/doi-usgs/lake-temperature-lstm-static
Predict lake temperatures at depth using static lake attributes
https://github.com/doi-usgs/lake-temperature-out
outputs and summaries from lake modeling pipelines
https://github.com/ivis-at-bilkent/cytoscape.js-graphml
A Cytoscape.js extension to import from and export to GraphML format
https://github.com/doi-usgs/ds-pipelines-2-course
Big Collaborative Pipelines: using a shared cache, creating task tables, and implementing the split-apply and split-apply-combine paradigms
https://github.com/kul-optec/abstractoperators.jl
Abstract operators for large scale optimization in Julia
https://github.com/hidadeng/wordexpansion
使用SO_PMI互信息算法、词向量法快速构建不同领域(手机、汽车等)的专业情感词典
https://github.com/doi-usgs/habs-proxies-forecast-chl
Collection of code from the HABs Proxies group to forecast chl-a for Ecological Forecasting Challenge 2022
https://github.com/kundajelab/genomelake
Simple and efficient access to genomic data for deep learning models.
https://github.com/intake/intake
Intake is a lightweight package for finding, investigating, loading and disseminating data.
https://github.com/doi-usgs/lake-temperature-model-validation
Workflow to compare modeled to observed lake temperatures
https://github.com/google-research/recsim_ng
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
https://github.com/devidw/tabgod
execute any javascript on any chromium tabs - cross-tab parallel execution
https://github.com/doi-usgs/normal_mode_gsn
Codes for the GSN normal mode paper
https://github.com/doi-usgs/lake-temperature-missouri-models
GLM process model pipeline for 8 priority reservoirs in Missouri
https://github.com/google-deepmind/3d-shapes
This repository contains the 3D shapes dataset, used in Kim, Hyunjik and Mnih, Andriy. "Disentangling by Factorising." In Proceedings of the 35th International Conference on Machine Learning (ICML). 2018. to assess the disentanglement properties of unsupervised learning methods.
https://github.com/google-deepmind/kinetics-i3d
Convolutional neural network model for video classification trained on the Kinetics dataset.
https://github.com/geoscienceaustralia/dea-waterbodies
DEA Waterbodies is a product that maps and monitors open waterbodies across Australia. Once a polygon set has been generated corresponding to open waterbodies, each waterbody is tracked over time to record the change in wet surface area over time.
https://github.com/gradio-app/awesome-demos
links and status of cool gradio demos
https://github.com/feenkcom/gt4atproto
A dedicated environment for AT Protocol build in Glamorous Toolkit.
https://github.com/kundajelab/chipseq_pipeline
AQUAS TF and histone ChIP-seq pipeline
https://github.com/kundajelab/seqdataloader
Sequence data label generation and ingestion into deep learning models
https://github.com/google-deepmind/narrativeqa
This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers.
https://github.com/distancedevelopment/mads
Multi-Analysis Distance Sampling. Deals with unidentified sightings, covariate uncertainty and model uncertainty in Distance sampling.
https://github.com/deepset-ai/haystack-search-pipeline-streamlit
🚀 Template Haystack Search Application with Streamlit
https://github.com/lromul/argus-tgs-salt
Kaggle | 14th place solution for TGS Salt Identification Challenge
https://github.com/john-guerra/d3brushandlinkingexample
A step by step tutorial on how to do brushing and linking on d3
https://github.com/dwhieb/jschemer
A Node.js library to generate documentation for JSON Schemas
https://github.com/deepset-ai/haystack-sagemaker
🚀 This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelines
https://github.com/havakv/safescope
Prevent access to global variables in functions
https://github.com/google-deepmind/interval-bound-propagation
This repository contains a simple implementation of Interval Bound Propagation (IBP) using TensorFlow: https://arxiv.org/abs/1810.12715
https://github.com/lromul/gramtion
Twitter bot for generating photo descriptions (alt text)
https://github.com/ezefranca/the-missing-apple-watch-loader
The missing apple watch loader ⌚️
https://github.com/imperialcollegelondon/rcds-plotting-in-python-with-matplotlib
https://github.com/ffri/orom-backdoor-research
PoC code and tools for Black Hat USA 2024
https://github.com/hungrybluedev/xlsx
V library to add support for Microsoft Excel files.
https://github.com/dcavar/elan2split
Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners
https://github.com/dcavar/fomajni
A Java JNI interface for Foma (a Finite State Transducer compiler for NLP)
https://github.com/dcavar/schemenlp
Scheme code for computational linguistics, natural language processing, corpus analysis taught at ESSLLI long time ago
https://github.com/linuxscout/mishtar
Mishtar: Named and temporal entities chunker
https://github.com/lczech/nidhoggr
Snakemake pipeline to run phylogenetic tree inferences
https://github.com/lromul/argus-alaska
Kaggle | Part of 25th place solution for ALASKA2 Image Steganalysis kaggle competition.
https://github.com/deepset-ai/rag-ui-nextjs
An example of a simple UI for a deepset Cloud RAG pipeline using Next.js
https://github.com/devidw/covid-certificate-decoder-flask
Decode your eu health certificate qr code
https://github.com/devidw/bookmarktoshortcut
Convert your browser bookmarks to web shortcut files (.url, .webloc, .desktop) to be able to use them on your desktop and in your file explorer.
https://github.com/dmetivie/binaryhierarchicalperiodichiddenmarkovmodels.jl
Super long name to do binary data HMM (possibly non homogeneous) with extra dependence on past observation
https://github.com/devidw/e11
fully typed js/ts openapi client for elevenlabs api
https://github.com/kundajelab/locusselect
extraction of data embeddings from deep learning model layers; computation of embedding distance and visualization with umap/tsne
https://github.com/devidw/adoc-styles
Monorepo for stylesheets, "skins" and tooling around Asciidoctor document styles.
https://github.com/erictleung/skating-naif
Analyze and get feedback of figure skating moves using Python and OpenCV
https://github.com/devidw/pelican-theme-darksome
Darksome theme for Pelican Static Site Generator
https://github.com/lamastex/computational-statistical-experiments
Courses in Computational Statistical Experiments through Matlab and SageMath
https://github.com/erictleung/phyloseq-cheatsheet
:notebook: Minimal cheatsheet for functions in the phyloseq R package
https://github.com/dcavar/antisemitismdatathon2020
This is project material for the Antisemitism Datathon and Hackathon 2020 at Indiana University
https://github.com/dcavar/juliafoma
Julia NLP with Foma: Finite State Transducer for Morphological Analysis
https://github.com/dcavar/treebankparser
Parser for treebanks based on Penn Treebank type of encoding that generates Probabilistic Context Free Grammars
https://github.com/google-research/unique-randomizer
UniqueRandomizer is a data structure for sampling outputs of a randomized program, such as a neural sequence model, incrementally and without replacement.
https://github.com/devidw/wp-hook
Generate WordPress add_action and add_filter hooks on the fly.