glide

🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps

https://github.com/einstack/glide

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (16.0%) to scientific vocabulary

Keywords

ai gateway gateway-api genai generative-ai glide go llm llmops ml mlops router

Last synced: 6 months ago · JSON representation ·

Repository

🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps

Basic Info

Host: GitHub
Owner: EinStack
License: apache-2.0
Language: Go
Default Branch: develop
Homepage: https://docs.einstack.ai/glide/
Size: 3.43 MB

Statistics

Stars: 144
Watchers: 5
Forks: 20
Open Issues: 68
Releases: 10

Topics

ai gateway gateway-api genai generative-ai glide go llm llmops ml mlops router

Created about 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Changelog Contributing License Code of conduct Citation Security Roadmap

Glide: Cloud-Native LLM Gateway for Seamless LLMOps

Glide is your go-to cloud-native LLM gateway, delivering high-performance LLMOps in a lightweight, all-in-one package.

We take all problems of managing and communicating with external providers out of your applications, so you can dive into tackling your core challenges.

[!Important] Glide is under active development right now 🛠️

Give us a star⭐ to support the project and watch👀 our repositories not to miss any update. Appreciate your interest 🙏

Glide sits between your application and model providers to seamlessly handle various LLMOps tasks like model failover, caching, key management, etc.

Check out our documentation!

Features

Unified REST API across providers. Avoid vendor lock-in and changes in your applications when you swap model providers.
High availability and resiliency when working with external model providers. Automatic fallbacks on provider failures, rate limits, transient errors. Smart retries to reduce communication latency.
Support popular LLM providers.
High performance. Performance is our priority. We want to keep Glide "invisible" for your latency-wise, while providing rich functionality.
Production-ready observability via OpenTelemetry, emit metrics on models health, allows whitebox monitoring (coming soon)
Straightforward and simple maintenance and configuration, centralized API key control & management & rotation, etc.

Large Language Models

| Provider | Supported Capabilities | |-----------------------------------------------------------------------|-------------------------------------------| | OpenAI | ✅ Chat
✅ Streaming Chat | | Anthropic | ✅ Chat
🏗️ Streaming Chat (coming soon) | | Azure OpenAI | ✅ Chat
✅ Streaming Chat | | AWS Bedrock (Titan) | ✅ Chat | | Cohere | ✅ Chat
✅ Streaming Chat | | Google Gemini | 🏗️ Chat (coming soon) | | OctoML | ✅ Chat | | Ollama | ✅ Chat |

Get Started

Installation

[!Note] Windows users should follow an instruction right from the demo README file that specifies how to do the steps without the make command as Windows doesn't come with it by default.

The easiest way to deploy Glide is to our demo repository and docker-compose.

1. Clone the demo repository

bash git clone https://github.com/EinStack/glide-demo.git

2. Init Configs

The demo repository comes with a basic config. Additionally, you need to init your secrets by running:

bash make init # from the demo root

This will create the secrets directory with one .OPENAI_API_KEY file that you need to put your key to.

3. Start Glide

After that, just use docker compose via this command to start your demo environment:

bash make up

4. Sample API Request to `/chat` endpoint

See API Reference for more details.

json { "message": { "role": "user", "content": "Where was it played?" }, "message_history": [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Who won the world series in 2020?"}, {"role": "assistant", "content": "The Los Angeles Dodgers won the World Series in 2020."} ] }

API Docs

Finally, Glide comes with OpenAPI documentation that is accessible via http://127.0.0.1:9099/v1/swagger

That's it 🙌

Use our documentation to further learn about Glide capabilities and configs.

Other ways to install Glide are available:

Homebrew (MacOS)

bash brew tap einstack/tap brew install einstack/tap/glide

Snapcraft (Linux)

bash snap install glide

To upgrade the already installed package, you just need to run:

bash snap refresh glide

Detailed instruction on Snapcraft installation for different Linux distos:

Docker Images

Glide provides official images in our GHCR & DockerHub:

Alpine 3.19: bash docker pull ghcr.io/einstack/glide:latest-alpine
Ubuntu 22.04 LTS: bash docker pull ghcr.io/einstack/glide:latest-ubuntu
Google Distroless (non-root) bash docker pull ghcr.io/einstack/glide:latest-distroless
RedHat UBI 8.9 Micro bash docker pull ghcr.io/einstack/glide:latest-redhat

Helm Chart

Add the EinStack repository:

bash helm repo add einstack https://einstack.github.io/charts helm repo update

Before installing the Helm chart, you need to create a Kubernetes secret with your API keys like:

bash kubectl create secret generic api-keys --from-literal=OPENAI_API_KEY=sk-abcdXYZ

Then, you need to create a custom values.yaml file to override the secret name like:

```yaml

save as custom.values.yaml, for example

glide: apiKeySecret: "api-keys" ```

Finally, you should be able to install Glide's chart via:

helm upgrade glide-gateway einstack/glide --values custom.values.yaml --install

SDKs

To let you work with Glide's API with ease, we are going to provide you with SDKs that fits your tech stack:

Python (coming soon)
NodeJS (coming soon)

Core Concepts

Routers

Routers are a core functionality of Glide. Think of routers as a group of models with some predefined logic. For example, the resilience router allows a user to define a set of backup models should the initial model fail. Another example, would be to leverage the least-latency router to make latency sensitive LLM calls in the most efficient manner.

Detailed info on routers can be found here.

Available Routers

| Router | Description | |---------------|-----------------| | Priority | When the target model fails the request is sent to the secondary model. The entire service instance keeps track of the number of failures for a specific model reducing latency upon model failure | | Least Latency | This router selects the model with the lowest average latency over time. If the least latency model becomes unhealthy, it will pick the second the best, etc. | | Round Robin | Split traffic equally among specified models. Great for A/B testing. | | Weighted Round Robin | Split traffic based on weights. For example, 70% of traffic to Model A and 30% of traffic to Model B. |

Community

Join Discord for real-time discussion

Open an issue or start a discussion if there is a feature or an enhancement you'd like to see in Glide.

Contribute

Maintainers
- Roman Hlushko, Software Engineer, Distributed Systems & MLOps
- Max Krueger, Data & ML Engineer

Thanks everyone for already put their effort to make Glide better and more feature-rich:

License

Apache 2.0

Owner

Name: EinStack
Login: EinStack
Kind: organization

Repositories: 1
Profile: https://github.com/EinStack

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: Glide
message: 'A lightweight, cloud-Native model gateway'
type: software
authors:
  - given-names: Roman
    family-names: Hlushko
    email: roman.glushko.m@gmail.com
  - given-names: Max
    family-names: Krueger
repository-code: 'https://github.com/EinStack/glide'
repository-artifact: 'https://github.com/EinStack/glide/packages'
abstract: >-
  Glide is your go-to cloud-native model gateway, delivering
  high-performance production-ready LLMLOps in a
  lightweight, all-in-one service.
keywords:
  - generative-ai
  - llmops
  - mlops
  - gateway
  - infrastructure
  - distributed-system
  - llms
license: Apache-2.0

GitHub Events

Total

Watch event: 26
Fork event: 3

Last Year

Watch event: 26
Fork event: 3

Packages

Total packages: 1
Total downloads: unknown

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 0

proxy.golang.org: github.com/EinStack/glide

Homepage: https://github.com/EinStack/glide
Documentation: https://pkg.go.dev/github.com/EinStack/glide#section-documentation
License: Apache-2.0

Versions: 0
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 5.8%

Average: 6.0%

Dependent repos count: 6.2%

Last synced: 10 months ago