Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
â—‹CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
â—‹DOI references
-
â—‹Academic publication links
-
â—‹Committers with academic emails
-
â—‹Institutional organization owner
-
â—‹JOSS paper metadata
-
â—‹Scientific vocabulary similarity
Low similarity (11.3%) to scientific vocabulary
Keywords
Repository
Open-source tool to visualise your RAG 🔮
Basic Info
Statistics
- Stars: 1,150
- Watchers: 12
- Forks: 105
- Open Issues: 7
- Releases: 2
Topics
Metadata Files
README.md
RAGxplorer

RAGxplorer is a tool to build Retrieval Augmented Generation (RAG) visualisations.
Quick Start
Installation
bash
pip install ragxplorer
Usage
python
from ragxplorer import RAGxplorer
client = RAGxplorer(embedding_model="thenlper/gte-large")
client.load_pdf("presentation.pdf", verbose=True)
client.visualize_query("What are the top revenue drivers for Microsoft?")
A quickstart Jupyter notebook tutorial on how to use ragxplorer can be found at https://github.com/gabrielchua/RAGxplorer/blob/main/tutorials/quickstart.ipynb
Or as a Colab notebook:
Streamlit Demo
The demo can be found here: https://ragxplorer.streamlit.app/

View the project here
Contributing
Contributions to RAGxplorer are welcome. Please read our contributing guidelines (WIP) for details.
License
This project is licensed under the MIT license - see the LICENSE for details.
Acknowledgments
- DeepLearning.AI and Chroma for the inspiration and code labs in their Advanced Retrival course.
- The Streamlit community for the support and resources.
Owner
- Login: gabrielchua
- Kind: user
- Location: Singapore
- Repositories: 1
- Profile: https://github.com/gabrielchua
GitHub Events
Total
- Issues event: 3
- Watch event: 74
- Delete event: 1
- Issue comment event: 1
- Push event: 4
- Pull request event: 5
- Fork event: 10
- Create event: 2
Last Year
- Issues event: 3
- Watch event: 74
- Delete event: 1
- Issue comment event: 1
- Push event: 4
- Pull request event: 5
- Fork event: 10
- Create event: 2
Committers
Last synced: 11 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Gabriel Chua | c****b@g****m | 109 |
| sweep-ai[bot] | 1****] | 15 |
| Vince Lam | 5****m | 6 |
| dsmueller3760 | d****m@d****o | 5 |
| Ted Stresen-Reuter | t****d@s****u | 4 |
| ibrahimroshdy | i****m@w****i | 2 |
| João Galego | j****o@a****t | 1 |
| Ikko Eltociear Ashimine | e****r@g****m | 1 |
| David Ruan | r****z@g****m | 1 |
| Aniruddha Adhikary | 9****y | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 21
- Total pull requests: 40
- Average time to close issues: 21 days
- Average time to close pull requests: about 17 hours
- Total issue authors: 12
- Total pull request authors: 10
- Average comments per issue: 2.19
- Average comments per pull request: 0.73
- Merged pull requests: 25
- Bot issues: 0
- Bot pull requests: 19
Past Year
- Issues: 1
- Pull requests: 3
- Average time to close issues: N/A
- Average time to close pull requests: 2 minutes
- Issue authors: 1
- Pull request authors: 1
- Average comments per issue: 0.0
- Average comments per pull request: 0.0
- Merged pull requests: 2
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- gabrielchua (5)
- vince-lam (2)
- tedsecretsource (2)
- chintanckg (2)
- ibrahimroshdy (1)
- JGalego (1)
- deveshXm (1)
- XinyueZ (1)
- hhan-wooliesx (1)
- Luhuanz (1)
- dsmueller3760 (1)
- linuaries (1)
Pull Request Authors
- sweep-ai[bot] (25)
- gabrielchua (13)
- tedsecretsource (4)
- vince-lam (3)
- dan-s-mueller (3)
- JGalego (2)
- eltociear (2)
- ibrahimroshdy (2)
- ruanwz (2)
- dsmueller3760 (1)
- aniruddha-adhikary (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 3
-
Total downloads:
- pypi 52 last-month
-
Total dependent packages: 1
(may contain duplicates) -
Total dependent repositories: 0
(may contain duplicates) - Total versions: 15
- Total maintainers: 1
proxy.golang.org: github.com/gabrielchua/ragxplorer
- Documentation: https://pkg.go.dev/github.com/gabrielchua/ragxplorer#section-documentation
- License: mit
-
Latest release: v0.1.8
published about 2 years ago
Rankings
proxy.golang.org: github.com/gabrielchua/RAGxplorer
- Documentation: https://pkg.go.dev/github.com/gabrielchua/RAGxplorer#section-documentation
- License: mit
-
Latest release: v0.1.8
published about 2 years ago
Rankings
pypi.org: ragxplorer
A open-source tool to to visualise your RAG documents 🔮.
- Homepage: https://github.com/gabrielchua/ragxplorer
- Documentation: https://ragxplorer.readthedocs.io/
- License: MIT License
-
Latest release: 0.1.10
published almost 2 years ago
Rankings
Maintainers (1)
Dependencies
- Deprecated ==1.2.14
- GitPython ==3.1.41
- Jinja2 ==3.1.3
- MarkupSafe ==2.1.3
- PyPDF2 ==3.0.1
- PyPika ==0.48.9
- PyYAML ==6.0.1
- Pygments ==2.17.2
- SQLAlchemy ==2.0.25
- aiohttp ==3.9.1
- aiosignal ==1.3.1
- altair ==5.2.0
- annotated-types ==0.6.0
- anyio ==4.2.0
- asgiref ==3.7.2
- attrs ==23.2.0
- backoff ==2.2.1
- bcrypt ==4.1.2
- blinker ==1.7.0
- build ==1.0.3
- cachetools ==5.3.2
- certifi ==2023.11.17
- charset-normalizer ==3.3.2
- chroma-hnswlib ==0.7.3
- chromadb ==0.4.22
- click ==8.1.7
- coloredlogs ==15.0.1
- dataclasses-json ==0.6.3
- distro ==1.9.0
- fastapi ==0.108.0
- filelock ==3.13.1
- flatbuffers ==23.5.26
- frozenlist ==1.4.1
- fsspec ==2023.12.2
- gitdb ==4.0.11
- google-auth ==2.26.2
- googleapis-common-protos ==1.62.0
- grpcio ==1.60.0
- h11 ==0.14.0
- httpcore ==1.0.2
- httptools ==0.6.1
- httpx ==0.26.0
- huggingface-hub ==0.20.2
- humanfriendly ==10.0
- idna ==3.6
- importlib-metadata ==6.11.0
- importlib-resources ==6.1.1
- joblib ==1.3.2
- jsonpatch ==1.33
- jsonpointer ==2.4
- jsonschema ==4.20.0
- jsonschema-specifications ==2023.12.1
- kubernetes ==29.0.0
- langchain ==0.1.0
- langchain-community ==0.0.11
- langchain-core ==0.1.9
- langsmith ==0.0.79
- llvmlite ==0.41.1
- markdown-it-py ==3.0.0
- marshmallow ==3.20.2
- mdurl ==0.1.2
- mmh3 ==4.1.0
- monotonic ==1.6
- mpmath ==1.3.0
- multidict ==6.0.4
- mypy-extensions ==1.0.0
- networkx ==3.2.1
- nltk ==3.8.1
- numba ==0.58.1
- numpy ==1.26.3
- oauthlib ==3.2.2
- onnxruntime ==1.16.3
- openai ==1.7.1
- opentelemetry-api ==1.22.0
- opentelemetry-exporter-otlp-proto-common ==1.22.0
- opentelemetry-exporter-otlp-proto-grpc ==1.22.0
- opentelemetry-instrumentation ==0.43b0
- opentelemetry-instrumentation-asgi ==0.43b0
- opentelemetry-instrumentation-fastapi ==0.43b0
- opentelemetry-proto ==1.22.0
- opentelemetry-sdk ==1.22.0
- opentelemetry-semantic-conventions ==0.43b0
- opentelemetry-util-http ==0.43b0
- overrides ==7.4.0
- packaging ==23.2
- pandas ==2.1.4
- pillow ==10.2.0
- plotly ==5.18.0
- posthog ==3.3.1
- protobuf ==4.25.2
- pulsar-client ==3.4.0
- pyarrow ==14.0.2
- pyasn1 ==0.5.1
- pyasn1-modules ==0.3.0
- pydantic ==2.5.3
- pydantic_core ==2.14.6
- pydeck ==0.8.1b0
- pynndescent ==0.5.11
- pypdf ==3.17.4
- pyproject_hooks ==1.0.0
- pysqlite3-binary *
- python-dateutil ==2.8.2
- python-dotenv ==1.0.0
- pytz ==2023.3.post1
- referencing ==0.32.1
- regex ==2023.12.25
- requests ==2.31.0
- requests-oauthlib ==1.3.1
- rich ==13.7.0
- rpds-py ==0.16.2
- rsa ==4.9
- safetensors ==0.4.1
- scikit-learn ==1.3.2
- scipy ==1.11.4
- sentence-transformers ==2.2.2
- sentencepiece ==0.1.99
- six ==1.16.0
- smmap ==5.0.1
- sniffio ==1.3.0
- starlette ==0.32.0.post1
- streamlit ==1.29.0
- sympy ==1.12
- tenacity ==8.2.3
- threadpoolctl ==3.2.0
- tokenizers ==0.15.0
- toml ==0.10.2
- toolz ==0.12.0
- torch ==2.1.2
- torchvision ==0.16.2
- tornado ==6.4
- tqdm ==4.66.1
- transformers ==4.36.2
- typer ==0.9.0
- typing-inspect ==0.9.0
- typing_extensions ==4.9.0
- tzdata ==2023.4
- tzlocal ==5.2
- umap-learn ==0.5.5
- urllib3 ==2.1.0
- uvicorn ==0.25.0
- uvloop ==0.19.0
- validators ==0.22.0
- watchfiles ==0.21.0
- websocket-client ==1.7.0
- websockets ==12.0
- wrapt ==1.16.0
- yarl ==1.9.4
- zipp ==3.17.0