ragxplorer

Open-source tool to visualise your RAG 🔮

https://github.com/gabrielchua/ragxplorer

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • â—‹
    CITATION.cff file
  • ✓
    codemeta.json file
    Found codemeta.json file
  • ✓
    .zenodo.json file
    Found .zenodo.json file
  • â—‹
    DOI references
  • â—‹
    Academic publication links
  • â—‹
    Committers with academic emails
  • â—‹
    Institutional organization owner
  • â—‹
    JOSS paper metadata
  • â—‹
    Scientific vocabulary similarity
    Low similarity (11.3%) to scientific vocabulary

Keywords

interactive llm python rag streamlit visualization
Last synced: 6 months ago · JSON representation

Repository

Open-source tool to visualise your RAG 🔮

Basic Info
  • Host: GitHub
  • Owner: gabrielchua
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 1.07 MB
Statistics
  • Stars: 1,150
  • Watchers: 12
  • Forks: 105
  • Open Issues: 7
  • Releases: 2
Topics
interactive llm python rag streamlit visualization
Created about 2 years ago · Last pushed about 1 year ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.md

RAGxplorer

PyPI version Streamlit App

RAGxplorer is a tool to build Retrieval Augmented Generation (RAG) visualisations.

Quick Start

Installation

bash pip install ragxplorer

Usage

python from ragxplorer import RAGxplorer client = RAGxplorer(embedding_model="thenlper/gte-large") client.load_pdf("presentation.pdf", verbose=True) client.visualize_query("What are the top revenue drivers for Microsoft?")

A quickstart Jupyter notebook tutorial on how to use ragxplorer can be found at https://github.com/gabrielchua/RAGxplorer/blob/main/tutorials/quickstart.ipynb

Or as a Colab notebook:

Open In Colab

Streamlit Demo

The demo can be found here: https://ragxplorer.streamlit.app/

View the project here

Contributing

Contributions to RAGxplorer are welcome. Please read our contributing guidelines (WIP) for details.

License

This project is licensed under the MIT license - see the LICENSE for details.

Acknowledgments

  • DeepLearning.AI and Chroma for the inspiration and code labs in their Advanced Retrival course.
  • The Streamlit community for the support and resources.

Owner

  • Login: gabrielchua
  • Kind: user
  • Location: Singapore

GitHub Events

Total
  • Issues event: 3
  • Watch event: 74
  • Delete event: 1
  • Issue comment event: 1
  • Push event: 4
  • Pull request event: 5
  • Fork event: 10
  • Create event: 2
Last Year
  • Issues event: 3
  • Watch event: 74
  • Delete event: 1
  • Issue comment event: 1
  • Push event: 4
  • Pull request event: 5
  • Fork event: 10
  • Create event: 2

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 145
  • Total Committers: 10
  • Avg Commits per committer: 14.5
  • Development Distribution Score (DDS): 0.248
Past Year
  • Commits: 1
  • Committers: 1
  • Avg Commits per committer: 1.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Gabriel Chua c****b@g****m 109
sweep-ai[bot] 1****] 15
Vince Lam 5****m 6
dsmueller3760 d****m@d****o 5
Ted Stresen-Reuter t****d@s****u 4
ibrahimroshdy i****m@w****i 2
João Galego j****o@a****t 1
Ikko Eltociear Ashimine e****r@g****m 1
David Ruan r****z@g****m 1
Aniruddha Adhikary 9****y 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 21
  • Total pull requests: 40
  • Average time to close issues: 21 days
  • Average time to close pull requests: about 17 hours
  • Total issue authors: 12
  • Total pull request authors: 10
  • Average comments per issue: 2.19
  • Average comments per pull request: 0.73
  • Merged pull requests: 25
  • Bot issues: 0
  • Bot pull requests: 19
Past Year
  • Issues: 1
  • Pull requests: 3
  • Average time to close issues: N/A
  • Average time to close pull requests: 2 minutes
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gabrielchua (5)
  • vince-lam (2)
  • tedsecretsource (2)
  • chintanckg (2)
  • ibrahimroshdy (1)
  • JGalego (1)
  • deveshXm (1)
  • XinyueZ (1)
  • hhan-wooliesx (1)
  • Luhuanz (1)
  • dsmueller3760 (1)
  • linuaries (1)
Pull Request Authors
  • sweep-ai[bot] (25)
  • gabrielchua (13)
  • tedsecretsource (4)
  • vince-lam (3)
  • dan-s-mueller (3)
  • JGalego (2)
  • eltociear (2)
  • ibrahimroshdy (2)
  • ruanwz (2)
  • dsmueller3760 (1)
  • aniruddha-adhikary (1)
Top Labels
Issue Labels
enhancement (7) help wanted (4) sweep (3) good first issue (2) documentation (1) bug (1)
Pull Request Labels
sweep (25)

Packages

  • Total packages: 3
  • Total downloads:
    • pypi 52 last-month
  • Total dependent packages: 1
    (may contain duplicates)
  • Total dependent repositories: 0
    (may contain duplicates)
  • Total versions: 15
  • Total maintainers: 1
proxy.golang.org: github.com/gabrielchua/ragxplorer
  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 7.0%
Last synced: 6 months ago
proxy.golang.org: github.com/gabrielchua/RAGxplorer
  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 7.0%
Last synced: 6 months ago
pypi.org: ragxplorer

A open-source tool to to visualise your RAG documents 🔮.

  • Versions: 11
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Downloads: 52 Last month
Rankings
Dependent packages count: 10.0%
Average: 37.8%
Dependent repos count: 65.7%
Maintainers (1)
Last synced: 6 months ago

Dependencies

requirements.txt pypi
  • Deprecated ==1.2.14
  • GitPython ==3.1.41
  • Jinja2 ==3.1.3
  • MarkupSafe ==2.1.3
  • PyPDF2 ==3.0.1
  • PyPika ==0.48.9
  • PyYAML ==6.0.1
  • Pygments ==2.17.2
  • SQLAlchemy ==2.0.25
  • aiohttp ==3.9.1
  • aiosignal ==1.3.1
  • altair ==5.2.0
  • annotated-types ==0.6.0
  • anyio ==4.2.0
  • asgiref ==3.7.2
  • attrs ==23.2.0
  • backoff ==2.2.1
  • bcrypt ==4.1.2
  • blinker ==1.7.0
  • build ==1.0.3
  • cachetools ==5.3.2
  • certifi ==2023.11.17
  • charset-normalizer ==3.3.2
  • chroma-hnswlib ==0.7.3
  • chromadb ==0.4.22
  • click ==8.1.7
  • coloredlogs ==15.0.1
  • dataclasses-json ==0.6.3
  • distro ==1.9.0
  • fastapi ==0.108.0
  • filelock ==3.13.1
  • flatbuffers ==23.5.26
  • frozenlist ==1.4.1
  • fsspec ==2023.12.2
  • gitdb ==4.0.11
  • google-auth ==2.26.2
  • googleapis-common-protos ==1.62.0
  • grpcio ==1.60.0
  • h11 ==0.14.0
  • httpcore ==1.0.2
  • httptools ==0.6.1
  • httpx ==0.26.0
  • huggingface-hub ==0.20.2
  • humanfriendly ==10.0
  • idna ==3.6
  • importlib-metadata ==6.11.0
  • importlib-resources ==6.1.1
  • joblib ==1.3.2
  • jsonpatch ==1.33
  • jsonpointer ==2.4
  • jsonschema ==4.20.0
  • jsonschema-specifications ==2023.12.1
  • kubernetes ==29.0.0
  • langchain ==0.1.0
  • langchain-community ==0.0.11
  • langchain-core ==0.1.9
  • langsmith ==0.0.79
  • llvmlite ==0.41.1
  • markdown-it-py ==3.0.0
  • marshmallow ==3.20.2
  • mdurl ==0.1.2
  • mmh3 ==4.1.0
  • monotonic ==1.6
  • mpmath ==1.3.0
  • multidict ==6.0.4
  • mypy-extensions ==1.0.0
  • networkx ==3.2.1
  • nltk ==3.8.1
  • numba ==0.58.1
  • numpy ==1.26.3
  • oauthlib ==3.2.2
  • onnxruntime ==1.16.3
  • openai ==1.7.1
  • opentelemetry-api ==1.22.0
  • opentelemetry-exporter-otlp-proto-common ==1.22.0
  • opentelemetry-exporter-otlp-proto-grpc ==1.22.0
  • opentelemetry-instrumentation ==0.43b0
  • opentelemetry-instrumentation-asgi ==0.43b0
  • opentelemetry-instrumentation-fastapi ==0.43b0
  • opentelemetry-proto ==1.22.0
  • opentelemetry-sdk ==1.22.0
  • opentelemetry-semantic-conventions ==0.43b0
  • opentelemetry-util-http ==0.43b0
  • overrides ==7.4.0
  • packaging ==23.2
  • pandas ==2.1.4
  • pillow ==10.2.0
  • plotly ==5.18.0
  • posthog ==3.3.1
  • protobuf ==4.25.2
  • pulsar-client ==3.4.0
  • pyarrow ==14.0.2
  • pyasn1 ==0.5.1
  • pyasn1-modules ==0.3.0
  • pydantic ==2.5.3
  • pydantic_core ==2.14.6
  • pydeck ==0.8.1b0
  • pynndescent ==0.5.11
  • pypdf ==3.17.4
  • pyproject_hooks ==1.0.0
  • pysqlite3-binary *
  • python-dateutil ==2.8.2
  • python-dotenv ==1.0.0
  • pytz ==2023.3.post1
  • referencing ==0.32.1
  • regex ==2023.12.25
  • requests ==2.31.0
  • requests-oauthlib ==1.3.1
  • rich ==13.7.0
  • rpds-py ==0.16.2
  • rsa ==4.9
  • safetensors ==0.4.1
  • scikit-learn ==1.3.2
  • scipy ==1.11.4
  • sentence-transformers ==2.2.2
  • sentencepiece ==0.1.99
  • six ==1.16.0
  • smmap ==5.0.1
  • sniffio ==1.3.0
  • starlette ==0.32.0.post1
  • streamlit ==1.29.0
  • sympy ==1.12
  • tenacity ==8.2.3
  • threadpoolctl ==3.2.0
  • tokenizers ==0.15.0
  • toml ==0.10.2
  • toolz ==0.12.0
  • torch ==2.1.2
  • torchvision ==0.16.2
  • tornado ==6.4
  • tqdm ==4.66.1
  • transformers ==4.36.2
  • typer ==0.9.0
  • typing-inspect ==0.9.0
  • typing_extensions ==4.9.0
  • tzdata ==2023.4
  • tzlocal ==5.2
  • umap-learn ==0.5.5
  • urllib3 ==2.1.0
  • uvicorn ==0.25.0
  • uvloop ==0.19.0
  • validators ==0.22.0
  • watchfiles ==0.21.0
  • websocket-client ==1.7.0
  • websockets ==12.0
  • wrapt ==1.16.0
  • yarl ==1.9.4
  • zipp ==3.17.0