Scientific Software
Updated 6 months ago

Crowsetta — Peer-reviewed • Rank 13.3 • Science 93%

Crowsetta: A Python tool to work with any format for annotating animal vocalizations and bioacoustics data. - Published in JOSS (2023)

Updated 3 months ago

Tabbed: A Python package for reading variably structured text files at scale • Rank 0.7 • Science 95%

Tabbed: A Python package for reading variably structured text files at scale - Published in JOSS (2025)

Updated 5 months ago

https://github.com/alan-turing-institute/clevercsv • Rank 21.7 • Science 57%

CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.

Updated 6 months ago

pandas-ai • Rank 24.5 • Science 54%

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Updated 6 months ago

xan • Rank 12.0 • Science 59%

The CSV magician

Updated 6 months ago

datatools • Rank 5.7 • Science 62%

A set of tools for working with JSON, CSV and Excel workbooks

Updated 6 months ago

tensorboard-reducer • Rank 12.2 • Science 54%

Reduce multiple PyTorch TensorBoard runs to new event (or CSV) files.

Scientific Software
Updated 6 months ago

flowTorch - a Python library for analysis and reduced-order modeling of fluid flows — Peer-reviewed • Rank 6.9 • Science 59%

flowTorch - a Python library for analysis and reduced-order modeling of fluid flows - Published in JOSS (2021)

Earth and Environmental Sciences
Scientific Software · Peer-reviewed
Updated 6 months ago

externdata • Rank 7.0 • Science 57%

:page_facing_up: Modelica library for data I/O of CSV, INI, JSON, MATLAB MAT, SSV, TIR, Excel XLS/XLSX and XML files

Updated 6 months ago

pyexcel • Rank 25.1 • Science 36%

Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files

Updated 6 months ago

fitz-collection-raw-data • Rank 2.3 • Science 54%

Raw data from the collections database in json and csv format

Updated 6 months ago

readr • Rank 29.4 • Science 23%

Read flat files (csv, tsv, fwf) into R

Updated 6 months ago

vroom • Rank 27.5 • Science 23%

Fast reading of delimited files

Updated 6 months ago

rejoinderoo • Rank 4.1 • Science 44%

Rejoinderoo creates a rejoinder (response to reviewers) LaTeX document based on a spreadsheet file.

Updated 6 months ago

csv-metadata-quality • Rank 3.4 • Science 44%

A simple but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem

Updated 6 months ago

@stdlib/utils-dsv-base • Rank 2.8 • Science 44%

Standard base utilities for working with data formatted as delimiter-separated values (DSV).

Updated 6 months ago

@stdlib/utils-dsv • Rank 2.5 • Science 44%

Standard utilities for working with data formatted as delimiter-separated values (DSV).

Updated 6 months ago

sixarm_ruby_spreadsheeting • Rank 0.7 • Science 44%

SixArm.com » Ruby » Spreadsheeting has import & export helpers for CSV, TSV, Excel, etc.

Updated 6 months ago

rio • Rank 21.2 • Science 23%

🐟 A Swiss-Army Knife for Data I/O

Updated 6 months ago

odin • Rank 13.6 • Science 26%

Data-structure definition/validation/traversal, mapping and serialisation toolkit for Python

Updated 5 months ago

https://github.com/cube2222/octosql • Rank 14.4 • Science 23%

OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

Updated 5 months ago

https://github.com/alpstable/gidari • Rank 9.2 • Science 13%

Transport web data to local/remote storage using Gidari

Updated 5 months ago

https://github.com/amin2312/acsv • Rank 4.4 • Science 13%

ACsv is a easy, multi-platform and powerful csv parsing library, includes: js, ts, haxe, php, java, python, c#, go

Updated 6 months ago

damagedlogginganalyzer • Science 44%

A project about an analyzation of a statistic of damaged logging (wood) in Germany using Python.

Updated 5 months ago

https://github.com/rumbledb/rumble • Science 36%

⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more

Updated 6 months ago

convert-csv-schwab2pp • Science 67%

Converts a Charles Schwab transaction CSV file to a ready-to-import CSV file for Portfolio Performance.

Updated 5 months ago

https://github.com/avitase/ais_seqmaker • Science 13%

Tool suite for parsing AIS data from CSV stream

Updated 6 months ago

csv2ical • Science 67%

A CLI tool that converts a CSV file with event details into an iCalendar ICS file. The ICS file can then be imported into apps like Google Calendar, Microsoft Outlook, Apple macOS Calendar and etc.

Updated 5 months ago

https://github.com/akiomik/scalatest-csv-table • Science 13%

A scalatest helper for table driven testing with csv.

Updated 6 months ago

litsift • Science 44%

LitSift: Seamlessly search, sift, and export results from Semantic Scholar to BibTeX/CSV

Updated 5 months ago

https://github.com/cronokirby/serve-csv • Science 13%

Create a web API from static csv files

Updated 5 months ago

https://github.com/danielfeitopin/cve2csv • Science 13%

Fetch CVE data and save to CSV

Updated 6 months ago

lector • Science 44%

A fast reader for messy CSV files with optional type inference.

Updated 6 months ago

wovensnips • Science 44%

WovenSnips: A Lightweight, Free, and Open-source Implementation of Retrieval-Augmented Generation (RAG) using Straico API

Updated 5 months ago

https://github.com/cumbof/opengdc • Science 39%

An open-source Java tool to automatically extract and convert all clinical and genomic data from the Genomic Data Commons to BED, GTF, CSV, and JSON format

Updated 6 months ago

synthetic-data-generator • Science 44%

SDG generates synthetic breast cancer patient data

Updated 6 months ago

csv2cmi • Science 49%

a little program to transform a table of letters into the CMI format

Updated 5 months ago

https://github.com/anselmoo/csv_first_insight • Science 23%

A sklearn-based correlation- and prediction-maker for small *csv-data

Updated 5 months ago

https://github.com/cured-plus/csvw-duckdb • Science 13%

Convert a CSVW document (CSV metadata) to a DuckDB query to load a CSV file into a database.

Updated 6 months ago

india-isin-data • Science 67%

International Securities Identification Numbers for various Indian Securities

Updated 6 months ago

cautious-robot • Science 75%

Simple images from CSV downloader that runs and records checksums on downloaded image folder.

Updated 6 months ago

sum-buddy • Science 75%

Generate and save checksums for all (or certain) contents of given directory.

Updated 5 months ago

https://github.com/3mcloud/plotme • Science 26%

plot all the things in all the folders automatically but only if there have been changes

Updated 5 months ago

https://github.com/cnag-biomedical-informatics/pheno-ranker • Science 39%

Pheno-Ranker is a tool for comparing phenotypic data structured in JSON/YAML format, such as Beacon v2 Models or Phenopackets v2, as well as CSV.