https://github.com/airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
libre-chat
🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline capable and easy to setup.
self-hosted-datamanagement
Lehrmaterialien zum Kurs "Die eigene Nextcloud mit dem Raspberry Pi: Self-hosted Data Management"
https://github.com/khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
https://github.com/learningcircuit/local-deep-research
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and your private documents. Everything Local.
ghost-ssg
A Docker-based pipeline to publish the content of a local Ghost 4 server as static pages.
https://github.com/av/harbor
Effortlessly run LLM backends, APIs, frontends, and services with one command.
datahub
Self-hostable, open-source engine for reproducible data harmonization, dataset building & exploration
https://github.com/amirzenoozi/git-hook-listener
Connect Your Github Projects To Your Telegram Easy Peasy
https://github.com/msgbyte/tianji
Tianji: Insight into everything, Website Analytics + Uptime Monitor + Server Status. not only another GA alternatives
tekst
A collaborative research platform for resources on natural language texts
https://github.com/prismelabs/analytics
High-perfomance, self-hosted and privacy-focused web analytics service.