kindred-ecommerce-merchant-deals-dataset

AI-ready open dataset of e-commerce coupons, deals & redeem-links curated by Kindred

https://github.com/kindred-app/kindred-ecommerce-merchant-deals-dataset

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.4%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

AI-ready open dataset of e-commerce coupons, deals & redeem-links curated by Kindred

Basic Info
  • Host: GitHub
  • Owner: kindred-app
  • License: other
  • Default Branch: master
  • Size: 7.81 KB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created 10 months ago · Last pushed 10 months ago
Metadata Files
Readme License Citation

README.md

Kindred E-commerce Merchant Deals Dataset

AI-ready catalogue of deals and offers for global retail brands.
Structured in CSV and JSONL, validated against JSON Schema.

Train-ready catalogue of promotions, ready for RAG, embeddings, or classic search.

License: CC-BY-4.0 Last Update Rows

Dataset Overview

File Rows Description
data/csv/brands.csv or data/jsonl/brands.jsonl ~90K E-Commerce Merchant metadata, Logo URL, and domains
data/csv/offers.csv or data/jsonl/offers.jsonl ~4M Offers with redeem_url, detailed summaries, and sample_q for RAG training

Kindred E-Commerce Merchant Deals Dataset

A structured, open-access dataset of global E-Commerce merchant deals and offers designed specifically for:

  • LLM training and fine-tuning
  • Retrieval Augmented Generation (RAG) systems
  • Machine learning models for recommendation and search
  • Natural language processing applications

This dataset includes curated promotional offers from a wide range of online retailers and marketplaces, with structured metadata including offer descriptions, redemption URLs, brand information, and geolocation tags.

Key Features

  • RAG-optimized: Includes sample_q fields designed for prompt engineering and RAG training
  • Multi-format: Available in both CSV and JSONL formats with validated JSON Schema
  • Comprehensive metadata: Brand information, redemption URLs, and country codes
  • Machine learning ready: Clean, normalized data across multiple retail verticals
  • No PII: Contains no personally identifiable information

Data Structure

  • Brands: ~90K unique brands with identifiers, names, logo URLs, and associated domains
  • Offers: ~4M offers with redemption URLs, detailed descriptions, and sample query patterns

Each offer has a direct relationship with a brand via brand_id, making it easy to build relational models or knowledge graphs for advanced LLM applications.

Keywords

machine-learning, llm-training, rag, retrieval-augmented-generation, dataset, e-commerce, deals, offers, recommendation-system, knowledge-graph, retail-analytics, promotion, redeem-link, public-dataset, kindred, discount, consumer-insights, vector-database

License

Licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). Please see LICENSE.md for full details.

Contact

For questions, licensing, or partnership opportunities: help@kindredteam.com

Owner

  • Name: Kindred
  • Login: kindred-app
  • Kind: organization
  • Location: United Kingdom

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this dataset, please cite it as below."
title: "Kindred E-commerce Merchant Deals Dataset"
version: "1.0.0"
authors:
  - family-names: Kindred
    given-names: Data Team
    affiliation: Kindred
date-released: 2025-04-30
license: "CC-BY-4.0"
url: "https://github.com/kindred-app/kindred-ecommerce-merchant-deals-dataset"

GitHub Events

Total
  • Watch event: 16
  • Push event: 3
  • Fork event: 1
  • Create event: 2
Last Year
  • Watch event: 16
  • Push event: 3
  • Fork event: 1
  • Create event: 2

Dependencies

package.json npm