databot

This platform is used to create bots whose job is to answer questions about a specific data source. It allows the automatic generation of a chat/voice bot swarm to attend all the data sources in an Open Data Portal.

https://github.com/besser-pearl/databot

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.3%) to scientific vocabulary

Keywords

chatbot datasets intent-detection llm opendata voicebot
Last synced: 6 months ago · JSON representation ·

Repository

This platform is used to create bots whose job is to answer questions about a specific data source. It allows the automatic generation of a chat/voice bot swarm to attend all the data sources in an Open Data Portal.

Basic Info
  • Host: GitHub
  • Owner: BESSER-PEARL
  • License: mit
  • Language: Python
  • Default Branch: main
  • Homepage:
  • Size: 245 KB
Statistics
  • Stars: 6
  • Watchers: 1
  • Forks: 2
  • Open Issues: 1
  • Releases: 0
Topics
chatbot datasets intent-detection llm opendata voicebot
Created over 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme Contributing Funding License Code of conduct Citation Codeowners Security Governance

README.md

DataBot: Reliable data exploration through chatbots

This platform is used to create bots whose job is to answer questions about a specific data source. It allows the automatic generation of a chat/voice bot swarm to attend all the data sources in an Open Data Portal.

The highlights of DataBot are:

  • 💻 Import data through a friendly UI.
    • 💾 Upload your dataset directly to the platform, or...
    • 🌐 Automatically load all the data sources from an Open Data Portal through its API.
  • 🔎 A data schema is automatically inferred from the data source, and can be enhanced 💪 to improve the bot knowledge about the data (e.g., synonyms or translations). This can be done either manually or using ✨AI.
  • 🤖 Automatically generate a chatbot for each data source. These chatbots are powered by the BESSER Bot Framework. They recognize the user intent and generate the appropriate answer. So, no hallucinations at all.
  • Generation of tabular📅 and graphical📈 answers.
  • 🎙️ Interact with the chatbots either writing or speaking: voice recognition integrated.
  • ✨ For those questions the bot fails to identify, AI can be used to generate the best possible answer.
  • ✨ For the AI components (data schema enhancement and answer generation), we use the OpenAI API.

DataBot Playground Screenshot

Requirements

  • Python 3.11
  • Recommended: Create a virtual environment (e.g. venv, conda)

Installation

bash git clone https://github.com/BESSER-PEARL/databot cd databot pip install -r requirements.txt python main.py

License

This project is licensed under the MIT license

Copyright © 2023 Luxembourg Institute of Science and Technology. All rights reserved.

Owner

  • Name: BESSER-PEARL
  • Login: BESSER-PEARL
  • Kind: organization
  • Email: jordi.cabot@list.lu
  • Location: Luxembourg

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "FamilyNames"
  given-names: "GivenNames"
  orcid: "https://orcid.org/0000-0000-0000-0000"
- family-names: "FamilyNames"
  given-names: "GivenNames"
  orcid: "https://orcid.org/0000-0000-0000-0000"
title: "The title of your paper"
version: 1.0.0
doi: 10.1007/s10664-021-10061-x
date-released: 2021-11-16
url: "https://github.com/SOM-Research/NonCodingRoleAnalysis-NPMPackages"
preferred-citation:
  type: article
  authors:
  - family-names: "FamilyNames"
    given-names: "GivenNames"
    orcid: "https://orcid.org/0000-0000-0000-0000"
  - family-names: "FamilyNames"
    given-names: "GivenNames"
    orcid: "https://orcid.org/0000-0000-0000-0000"
  doi: "10.1007/s00000-000-00000-x"
  journal: "JournalName"
  month: 1
  start: 1 # First page number
  end: 100 # Last page number
  title: "The title of your paper"
  issue: 1
  volume: 29
  year: 2022

GitHub Events

Total
  • Watch event: 5
  • Push event: 3
  • Fork event: 2
Last Year
  • Watch event: 5
  • Push event: 3
  • Fork event: 2

Committers

Last synced: about 2 years ago

All Time
  • Total Commits: 64
  • Total Committers: 2
  • Avg Commits per committer: 32.0
  • Development Distribution Score (DDS): 0.016
Past Year
  • Commits: 64
  • Committers: 2
  • Avg Commits per committer: 32.0
  • Development Distribution Score (DDS): 0.016
Top Committers
Name Email Commits
mgv99 g****9@g****m 63
Marcos Gomez 5****9 1

Issues and Pull Requests

Last synced: about 2 years ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • maurodlt (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

docs/requirements.txt pypi
  • sphinx ==7.1.2
  • sphinx-rtd-theme ==1.3.0rc1
requirements.txt pypi
  • chardet ==5.2.0
  • openai ==1.3.8
  • pandasql ==0.7.3
  • streamlit ==1.27.2
  • streamlit-antd-components ==0.2.4
  • streamlit-browser-session-storage ==0.0.3
  • streamlit-chat ==0.1.1
  • streamlit-local-storage ==0.0.12
  • streamlit-screen-stats ==0.0.28