https://github.com/deepset-ai/haystack-search-pipeline-streamlit

🚀 Template Haystack Search Application with Streamlit

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
○
Academic publication links
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.1%) to scientific vocabulary

Keywords

haystack nlp python streamlit

Keywords from Contributors

transformers mlops

Last synced: 4 months ago · JSON representation

Repository

🚀 Template Haystack Search Application with Streamlit

Basic Info

Host: GitHub
Owner: deepset-ai
Language: Python
Default Branch: main
Homepage: https://haystack.deepset.ai
Size: 61.5 KB

Statistics

Stars: 27
Watchers: 4
Forks: 10
Open Issues: 0
Releases: 0

Topics

haystack nlp python streamlit

Created about 3 years ago · Last pushed about 1 year ago

Metadata Files

Readme

README.md

title: Haystack Search Pipeline with Streamlit emoji: 👑 colorFrom: indigo colorTo: indigo sdk: streamlit sdkversion: 1.23.0 appfile: app.py

pinned: false

Template Streamlit App for Haystack Search Pipelines

[!WARNING] This template is for Haystack version 1.x. Use this template: Haystack Streamlit App for Haystack 2.x applications.

This template Streamlit app set up for simple Haystack search applications. The template is ready to do QA with Retrievel Augmented Generation, or Extractive QA

See the 'How to use this template' instructions below to create a simple UI for your own Haystack search pipelines.

Below you will also find instructions on how you could push this to Hugging Face Spaces 🤗.

Installation and Running

To run the bare application which does nothing: 1. Install requirements: pip install -r requirements.txt 2. Run the streamlit app: streamlit run app.py

This will start up the app on localhost:8501 where you will find a simple search bar. Before you start editing, you'll notice that the app will only show you instructions on what to edit.

Optional Configurations

You can set optional cofigurations to set the: - --task you want to start the app with: rag or extractive (default: rag) - --store you want to use: inmemory, opensearch, weaviate or milvus (default: inmemory) - --name you want to have for the app. (default: 'My Search App')

E.g.:

bash streamlit run app.py -- --store opensearch --task extractive --name 'My Opensearch Documentation Search'

In a .env file, include all the config settings that you would like to use based on: - The DocumentStore of your choice - The Extractive/Generative model of your choice

While the /utils/config.py will create default values for some configurations, others have to be set in the .env such as the OPENAI_KEY

Example .env

OPENAI_KEY=YOUR_KEY EMBEDDING_MODEL=sentence-transformers/all-MiniLM-L12-v2 GENERATIVE_MODEL=text-davinci-003

How to use this template

Create a new repository from this template or simply open it in a codespace to start playing around 💙
Make sure your requirements.txt file includes the Haystack and Streamlit versions you would like to use.
Change the code in utils/haystack.py if you would like a different pipeline.
Create a .envfile with all of your configuration settings.
Make any UI edits you'd like to and share with the Haystack community
Run the app as show in installation and running

Repo structure

./utils: This is where we have 3 files:
- config.py: This file extracts all of the configuration settings from a .env file. For some config settings, it uses default values. An example of this is in this demo project.
- haystack.py: Here you will find some functions already set up for you to start creating your Haystack search pipeline. It includes 2 main functions called start_haystack() which is what we use to create a pipeline and cache it, and query() which is the function called by app.py once a user query is received.
- ui.py: Use this file for any UI and initial value setups.
app.py: This is the main Streamlit application file that we will run. In its current state it has a simple search bar, a 'Run' button, and a response that you can highlight answers with.

What to edit?

There are default pipelines both in start_haystack_extractive() and start_haystack_rag()

Change the pipelines to use the embedding models, extractive or generative models as you need.
If using the rag task, change the default_prompt_template to use one of our available ones on PromptHub or create your own PromptTemplate

Pushing to Hugging Face Spaces 🤗

Below is an example GitHub action that will let you push your Streamlit app straight to the Hugging Face Hub as a Space.

A few things to pay attention to:

Create a New Space on Hugging Face with the Streamlit SDK.
Create a Hugging Face token on your HF account.
Create a secret on your GitHub repo called HF_TOKEN and put your Hugging Face token here.
If you're using DocumentStores or APIs that require some keys/tokens, make sure these are provided as a secret for your HF Space too!
This readme is set up to tell HF spaces that it's using streamlit and that the app is running on app.py, make any changes to the frontmatter of this readme to display the title, emoji etc you desire.
Create a file in .github/workflows/hf_sync.yml. Here's an example that you can change with your own information, and an example workflow working for the Should I Follow demo

```yaml name: Sync to Hugging Face hub on: push: branches: [main]

# to run this workflow manually from the Actions tab workflow_dispatch:

jobs: sync-to-hub: runs-on: ubuntu-latest steps: - uses: actions/checkout@v2 with: fetch-depth: 0 lfs: true - name: Push to hub env: HFTOKEN: ${{ secrets.HFTOKEN }} run: git push --force https://{YOURHFUSERNAME}:$HFTOKEN@{YOURHFSPACEREPO} main ```

Owner

Name: deepset
Login: deepset-ai
Kind: organization
Email: hello@deepset.ai
Location: Berlin, Germany

Website: https://deepset.ai
Twitter: deepset_ai
Repositories: 14
Profile: https://github.com/deepset-ai

Building enterprise search systems powered by latest NLP & open-source.

GitHub Events

Total

Issues event: 1
Watch event: 5
Issue comment event: 1
Pull request event: 1
Fork event: 1

Last Year

Issues event: 1
Watch event: 5
Issue comment event: 1
Pull request event: 1
Fork event: 1

Committers

Last synced: 8 months ago

All Time

Total Commits: 25
Total Committers: 3
Avg Commits per committer: 8.333
Development Distribution Score (DDS): 0.16

Past Year

Commits: 1
Committers: 1
Avg Commits per committer: 1.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Tuana Çelik	t**k@d**i	21
Bilge Yücel	b**l@d**i	3
Stefano Fiorucci	4****7	1

Committer Domains (Top 20 + Academic)

deepset.ai: 2

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 2
Total pull requests: 4
Average time to close issues: 5 months
Average time to close pull requests: 9 months
Total issue authors: 2
Total pull request authors: 3
Average comments per issue: 1.5
Average comments per pull request: 0.0
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 0
Average time to close issues: 3 months
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 0
Average comments per issue: 1.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

https://github.com/deepset-ai/haystack-search-pipeline-streamlit

Science Score: 13.0%

Keywords

Keywords from Contributors

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

pinned: false

Template Streamlit App for Haystack Search Pipelines

Installation and Running

Optional Configurations

How to use this template

Repo structure

What to edit?

Pushing to Hugging Face Spaces 🤗

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels