https://github.com/azurelotus06/pdfchatbot
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.0%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: azurelotus06
- Language: Python
- Default Branch: master
- Size: 7.81 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
PDFChatbot
PDFChatbot is a Python application designed to extract and provide relevant information from a set of documents using OpenAI's GPT models and Pinecone for vector similarity search. This project leverages embeddings to find the most relevant context for a given query and uses a chat-based completion model to generate answers with citations.
Features
- Vector Similarity Search: Uses Pinecone to find relevant document sections based on query embeddings.
- Chat-based Completion: Utilizes OpenAI's GPT models to generate answers from the extracted context.
- Citations: Provides source citations for the generated answers.
Requirements
- Python 3.7+
- OpenAI Python SDK
- Pinecone Python SDK
- dotenv
Setup
Clone the repository:
bash git clone https://github.com/azurelotus0926/PDFChatbot.git cd PDFChatbotInstall dependencies:
bash pip install -r requirements.txtEnvironment Variables: Create a
.envfile in the root directory and add your API keys and other configurations:OPENAI_API_KEY=your_openai_api_key PINECONE_API_KEY=your_pinecone_api_key PINECONE_INDEX_NAME=your_pinecone_index_name PINECONE_NAMESPACE=your_pinecone_namespace
Owner
- Login: azurelotus06
- Kind: user
- Repositories: 1
- Profile: https://github.com/azurelotus06
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1