Scientific Software
Updated 6 months ago

Retriever — Peer-reviewed • Rank 18.2 • Science 100%

Retriever: Data Retrieval Tool - Published in JOSS (2017)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

open-mastr — Peer-reviewed • Rank 16.6 • Science 100%

open-mastr: A Python Package to Download and Process the German Energy Registry Marktstammdatenregister - Published in JOSS (2024)

Political Science
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

Rdataretriever — Peer-reviewed • Rank 18.3 • Science 93%

Rdataretriever: R Interface to the Data Retriever - Published in JOSS (2021)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

terrainr — Peer-reviewed • Rank 11.9 • Science 98%

terrainr: An R package for creating immersive virtual environments - Published in JOSS (2022)

Scientific Software
Updated 6 months ago

Foundry-ML - Software and Services to Simplify Access to Machine Learning Datasets in Materials Science — Peer-reviewed • Rank 14.0 • Science 95%

Foundry-ML - Software and Services to Simplify Access to Machine Learning Datasets in Materials Science - Published in JOSS (2024)

Artificial Intelligence and Machine Learning (30%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

Jury — Peer-reviewed • Rank 14.8 • Science 93%

Jury: A Comprehensive Evaluation Toolkit - Published in JOSS (2024)

Scientific Software
Updated 6 months ago

Git-RDM — Peer-reviewed • Rank 4.2 • Science 95%

Git-RDM: A research data management plugin for the Git version control system - Published in JOSS (2016)

Updated 6 months ago

datasets • Rank 34.4 • Science 64%

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Updated 6 months ago

torchsr • Rank 16.8 • Science 67%

Super Resolution datasets and models in Pytorch

Updated 6 months ago

spectrochempy • Rank 14.0 • Science 67%

SpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python

Updated 6 months ago

folio • Rank 2.6 • Science 77%

Datasets for Teaching Archaeology and Palaeontology - :exclamation: This is a read-only mirror from https://codeberg.org/tesselle/folio

Updated 6 months ago

dataset-phenotypes • Rank 4.9 • Science 72%

Preparatory scripts for BIDS tabular phenotypic data in large neuroimaging datasets.

Updated 6 months ago

loghub • Rank 9.1 • Science 67%

A large collection of system log datasets for AI-driven log analytics [ISSRE'23]

Updated 6 months ago

minari • Rank 18.5 • Science 57%

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Updated 5 months ago

https://github.com/bioinfomachinelearning/dips-plus • Rank 6.6 • Science 67%

The Enhanced Database of Interacting Protein Structures for Interface Prediction

Updated 5 months ago

awesome-forests • Rank 8.0 • Science 59%

🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.

Updated 6 months ago

dataset • Rank 5.0 • Science 62%

dataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections

Updated 5 months ago

deeplake • Rank 25.6 • Science 36%

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Updated 6 months ago

@stdlib/datasets-harrison-boston-house-prices • Rank 3.8 • Science 57%

A dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).

Updated 6 months ago

datasets-herndon-venus-semidiameters • Rank 3.8 • Science 57%

Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.

Updated 6 months ago

@stdlib/datasets-harrison-boston-house-prices-corrected • Rank 3.0 • Science 57%

A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).

Updated 6 months ago

@stdlib/datasets-pace-boston-house-prices • Rank 2.7 • Science 57%

A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).

Updated 6 months ago

datasets-minard-napoleons-march • Rank 1.8 • Science 57%

Data for Charles Joseph Minard's cartographic depiction of Napoleon's Russian campaign of 1812.

Updated 6 months ago

open-data-on-github • Rank 4.5 • Science 54%

Dataset files for the Open Data on GitHub paper

Updated 6 months ago

grimoire • Rank 5.9 • Science 51%

Grimoire is All You Need for Enhancing Large Language Models

Updated 6 months ago

cihai • Rank 12.2 • Science 44%

Python library for CJK (Chinese, Japanese, and Korean) language dictionary

Updated 5 months ago

https://github.com/openneuroorg/openneuro • Rank 17.9 • Science 36%

A free and open platform for analyzing and sharing neuroimaging data

Updated 5 months ago

obp • Rank 17.2 • Science 36%

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Updated 5 months ago

awesome-earth-artificial-intelligence • Rank 7.0 • Science 46%

A curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.

Updated 6 months ago

@stdlib/datasets-us-states-abbr • Rank 6.7 • Science 44%

A list of US state two-letter abbreviations in alphabetical order according to state name.

Updated 5 months ago

csdmpy • Rank 10.6 • Science 39%

The Core Scientific Dataset Model (CSDM): A versatile and light-weight file-format for scientific datasets.

Updated 6 months ago

@stdlib/datasets-cdc-nchs-us-infant-mortality-bw-1915-2013 • Rank 4.5 • Science 44%

US infant mortality data, by race, from 1915 to 2013, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

Updated 6 months ago

@stdlib/datasets-cdc-nchs-us-births-1994-2003 • Rank 4.4 • Science 44%

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

Updated 6 months ago

@stdlib/datasets-cdc-nchs-us-births-1969-1988 • Rank 4.3 • Science 44%

US birth data from 1969 to 1988, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

Updated 6 months ago

@stdlib/datasets-us-states-capitals • Rank 4.1 • Science 44%

A list of US state capitals in alphabetical order according to state name.

Updated 6 months ago

@stdlib/datasets-suthaharan-single-hop-sensor-network • Rank 4.0 • Science 44%

Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.

Updated 6 months ago

@stdlib/datasets-suthaharan-multi-hop-sensor-network • Rank 3.7 • Science 44%

Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.

Updated 6 months ago

databot • Rank 2.6 • Science 44%

This platform is used to create bots whose job is to answer questions about a specific data source. It allows the automatic generation of a chat/voice bot swarm to attend all the data sources in an Open Data Portal.

Updated 5 months ago

geobr • Rank 19.3 • Science 26%

Easy access to official spatial data sets of Brazil in R and Python

Updated 5 months ago

pharmacogx • Rank 8.2 • Science 36%

R package to analyze large-scale pharmacogenomic datasets.

Updated 6 months ago

napolab • Rank 7.0 • Science 36%

The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their performance across carefully curated Portuguese language tasks.

Updated 5 months ago

usfertilizer • Rank 8.4 • Science 33%

An R package to retrieve the county-level fertilizer estimate data from 1945 to 2012 in USA.

Updated 5 months ago

Callisto-Dataset-Collection • Rank 7.1 • Science 33%

A list of datasets aiming to enable Artificial Intelligence applications that use Copernicus data.

Updated 5 months ago

wildlife-datasets • Rank 13.3 • Science 26%

WildlifeDatasets: An open-source toolkit for animal re-identification

Updated 5 months ago

medicaldata • Rank 11.3 • Science 23%

Data Package for Medical Datasets

Updated 5 months ago

metaboData • Rank 10.7 • Science 20%

An R package containing example data sets for metabolomics analyses