kindred-ecommerce-merchant-deals-dataset
AI-ready open dataset of e-commerce coupons, deals & redeem-links curated by Kindred
https://github.com/kindred-app/kindred-ecommerce-merchant-deals-dataset
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.4%) to scientific vocabulary
Repository
AI-ready open dataset of e-commerce coupons, deals & redeem-links curated by Kindred
Basic Info
- Host: GitHub
- Owner: kindred-app
- License: other
- Default Branch: master
- Size: 7.81 KB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Kindred E-commerce Merchant Deals Dataset
AI-ready catalogue of deals and offers for global retail brands.
Structured in CSV and JSONL, validated against JSON Schema.
Train-ready catalogue of promotions, ready for RAG, embeddings, or classic search.
Dataset Overview
| File | Rows | Description |
|---|---|---|
data/csv/brands.csv or data/jsonl/brands.jsonl |
~90K | E-Commerce Merchant metadata, Logo URL, and domains |
data/csv/offers.csv or data/jsonl/offers.jsonl |
~4M | Offers with redeem_url, detailed summaries, and sample_q for RAG training |
Kindred E-Commerce Merchant Deals Dataset
A structured, open-access dataset of global E-Commerce merchant deals and offers designed specifically for:
- LLM training and fine-tuning
- Retrieval Augmented Generation (RAG) systems
- Machine learning models for recommendation and search
- Natural language processing applications
This dataset includes curated promotional offers from a wide range of online retailers and marketplaces, with structured metadata including offer descriptions, redemption URLs, brand information, and geolocation tags.
Key Features
- RAG-optimized: Includes
sample_qfields designed for prompt engineering and RAG training - Multi-format: Available in both CSV and JSONL formats with validated JSON Schema
- Comprehensive metadata: Brand information, redemption URLs, and country codes
- Machine learning ready: Clean, normalized data across multiple retail verticals
- No PII: Contains no personally identifiable information
Data Structure
- Brands: ~90K unique brands with identifiers, names, logo URLs, and associated domains
- Offers: ~4M offers with redemption URLs, detailed descriptions, and sample query patterns
Each offer has a direct relationship with a brand via brand_id, making it easy to build relational models
or knowledge graphs for advanced LLM applications.
Keywords
machine-learning, llm-training, rag, retrieval-augmented-generation, dataset, e-commerce, deals, offers, recommendation-system, knowledge-graph, retail-analytics, promotion, redeem-link, public-dataset, kindred, discount, consumer-insights, vector-database
License
Licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). Please see LICENSE.md for full details.
Contact
For questions, licensing, or partnership opportunities: help@kindredteam.com
Owner
- Name: Kindred
- Login: kindred-app
- Kind: organization
- Location: United Kingdom
- Website: https://sdk.kindred.co
- Repositories: 3
- Profile: https://github.com/kindred-app
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this dataset, please cite it as below."
title: "Kindred E-commerce Merchant Deals Dataset"
version: "1.0.0"
authors:
- family-names: Kindred
given-names: Data Team
affiliation: Kindred
date-released: 2025-04-30
license: "CC-BY-4.0"
url: "https://github.com/kindred-app/kindred-ecommerce-merchant-deals-dataset"
GitHub Events
Total
- Watch event: 16
- Push event: 3
- Fork event: 1
- Create event: 2
Last Year
- Watch event: 16
- Push event: 3
- Fork event: 1
- Create event: 2