Updated 6 months ago
ccn-data-library
The Coastal Carbon Network Data Library: An open-source database featuring carbon data from tidal wetlands around the world
Updated 6 months ago
data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Updated 6 months ago
graphg
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
Updated 6 months ago
https://github.com/sebhaan/tabpfgen
TabPFGen: Synthetic Tabular Data Generation with TabPFN
Updated 6 months ago
https://github.com/OpenDCAI/DataFlow
Easy Data Preparation with latest LLMs-based Operators and Pipelines.