Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.4%) to scientific vocabulary
Repository
官方示例
Basic Info
- Host: GitHub
- Owner: mason0510
- License: mit
- Language: Python
- Default Branch: main
- Size: 6.74 MB
Statistics
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
🗂️ LlamaIndex 🦙 (GPT Index)
⚠️ NOTE: We are rebranding GPT Index as LlamaIndex! We will carry out this transition gradually.
2/25/2023: By default, our docs/notebooks/instructions now reference "LlamaIndex" instead of "GPT Index".
2/19/2023: By default, our docs/notebooks/instructions now use the
llama-indexpackage. However thegpt-indexpackage still exists as a duplicate!2/16/2023: We have a duplicate
llama-indexpip package. Simply replace all imports ofgpt_indexwithllama_indexif you choose topip install llama-index.
LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.
PyPi: - LlamaIndex: https://pypi.org/project/llama-index/. - GPT Index (duplicate): https://pypi.org/project/gpt-index/.
Documentation: https://gpt-index.readthedocs.io/en/latest/.
Twitter: https://twitter.com/gpt_index.
Discord: https://discord.gg/dGcwcsnxhU.
LlamaHub (community library of data loaders): https://llamahub.ai
🚀 Overview
NOTE: This README is not updated as frequently as the documentation. Please check out the documentation above for the latest updates!
Context
- LLMs are a phenomenonal piece of technology for knowledge generation and reasoning.
- A big limitation of LLMs is context size (e.g. Davinci's limit is 4096 tokens. Large, but not infinite).
- The ability to feed "knowledge" to LLMs is restricted to this limited prompt size and model weights.
Proposed Solution
At its core, LlamaIndex contains a toolkit designed to easily connect LLM's with your external data. LlamaIndex helps to provide the following: - A set of data structures that allow you to index your data for various LLM tasks, and remove concerns over prompt size limitations. - Data connectors to your common data sources (Google Docs, Slack, etc.). - Cost transparency + tools that reduce cost while increasing performance.
Each data structure offers distinct use cases and a variety of customizable parameters. These indices can then be queried in a general purpose manner, in order to achieve any task that you would typically achieve with an LLM: - Question-Answering - Summarization - Text Generation (Stories, TODO's, emails, etc.) - and more!
💡 Contributing
Interesting in contributing? See our Contribution Guide for more details.
📄 Documentation
Full documentation can be found here: https://gpt-index.readthedocs.io/en/latest/.
Please check it out for the most up-to-date tutorials, how-to guides, references, and other resources!
💻 Example Usage
pip install llama-index
Examples are in the examples folder. Indices are in the indices folder (see list of indices below).
To build a simple vector store index: ```python import os os.environ["OPENAIAPIKEY"] = 'YOUROPENAIAPI_KEY'
from llamaindex import GPTSimpleVectorIndex, SimpleDirectoryReader documents = SimpleDirectoryReader('data').loaddata() index = GPTSimpleVectorIndex(documents) ```
To save to and load from disk: ```python
save to disk
index.savetodisk('index.json')
load from disk
index = GPTSimpleVectorIndex.loadfromdisk('index.json') ```
To query:
python
index.query("<question_text>?")
🔧 Dependencies
The main third-party package requirements are tiktoken, openai, and langchain.
All requirements should be contained within the setup.py file. To run the package locally without building the wheel, simply run pip install -r requirements.txt.
📖 Citation
Reference to cite if you use LlamaIndex in a paper:
@software{Liu_LlamaIndex_2022,
author = {Liu, Jerry},
doi = {10.5281/zenodo.1234},
month = {11},
title = {{LlamaIndex}},
url = {https://github.com/jerryjliu/gpt_index},
year = {2022}
}
Owner
- Name: mason
- Login: mason0510
- Kind: user
- Location: ShangHai,China
- Repositories: 1,019
- Profile: https://github.com/mason0510
Coding changes the world.
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Liu" given-names: "Jerry" orcid: "https://orcid.org/0000-0002-6694-3517" title: "LlamaIndex" doi: 10.5281/zenodo.1234 date-released: 2022-11-1 url: "https://github.com/jerryjliu/gpt_index"
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v2 composite
- actions/create-release v1 composite
- actions/setup-python v2 composite
- actions/upload-release-asset v1 composite
- pypa/gh-action-pypi-publish master composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- discord.py *
- google-api-python-client *
- google-auth-httplib2 *
- google-auth-oauthlib *
- pymongo *
- slack_sdk *
- wikipedia *
- docutils <0.17
- myst-parser *
- sphinx >=4.3.0
- sphinx_rtd_theme >=0.5.1
- black ==22.12.0
- flake8 ==6.0.0
- flake8-docstrings ==1.6.0
- ipython ==8.10.0
- isort ==5.11.4
- mypy ==0.991
- pylint ==2.15.10
- pytest ==7.2.1
- pytest-dotenv ==0.5.2
- rake_nltk ==1.0.6
- types-requests ==2.28.11.8
- types-setuptools ==67.1.0.0
- dataclasses_json *
- langchain *
- numpy *
- openai >=0.26.4
- pandas *
- tenacity >=8.2.0,<9.0.0