alembica
Open science package for LLM-powered semantic synthesis and precise extraction of information from unstructured texts.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 7 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.1%) to scientific vocabulary
Keywords
Repository
Open science package for LLM-powered semantic synthesis and precise extraction of information from unstructured texts.
Basic Info
- Host: GitHub
- Owner: open-and-sustainable
- License: agpl-3.0
- Language: Go
- Default Branch: main
- Homepage: https://open-and-sustainable.github.io/alembica/
- Size: 194 KB
Statistics
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 5
- Releases: 8
Topics
Metadata Files
README.md
alembica
Open Science Software for Semantic Synthesis and Extraction of Information from Unstructured Sources.
About
alembica simplifies the use of Large Language Models (LLMs) to extract structured datasets from unstructured corpora of text.
It provides a flexible and scalable framework to process, synthesize, and transform textual information into structured formats suitable for analysis and further processing.
Installation (Go)
To install alembica in Go, run:
sh
go get github.com/open-and-sustainable/alembica
If you want to use alembica in other programming languages, check out the C-Shared Library in the User Guide.
Documentation
User Guide – Learn how to use alembica in different programming languages.
API Reference – Explore the Go package documentation.
Features
- Validation of Input – Ensures that queries are correctly formatted to support proper interaction with models.
- Cost Assessment – Calculates token costs based on the requested extraction and different model pricing.
- Data Extraction – Processes unstructured text and transforms it into structured datasets for further analysis.
Authors & Contributions
Author: Riccardo Boero - ribo@nilu.no
Contributions are welcome!
License
alembica is licensed under the GNU AFFERO GENERAL PUBLIC LICENSE, Version 3.

Citation
Boero, R. (2025).
alembica- Open Science Software for Semantic Synthesis and Extraction of Information from Unstructured Sources. Zenodo. https://doi.org/10.5281/zenodo.14899666
Owner
- Name: Open and Sustainable
- Login: Open-and-Sustainable
- Kind: organization
- Repositories: 1
- Profile: https://github.com/Open-and-Sustainable
Citation (CITATION.cff)
cff-version: 1.2.0
title: alembica - Open Science Software for Semantic Synthesis and Extraction of Information from Unstructured Sources.
message: >-
v0.0.8 - formatting and typos.
Fixed formatting issues and spelling errors, filled in docstrings and tests.
type: software
doi: 10.5281/zenodo.14899666
authors:
- given-names: Riccardo
family-names: Boero
email: ribo@nilu.no
orcid: 'https://orcid.org/0000-0002-7468-9096'
affiliation: NILU
identifiers:
- type: doi
value: 10.5281/zenodo.14899666
description: This DOI represents all software versions.
repository-code: 'https://github.com/open-and-sustainable/alembica'
url: 'https://open-and-sustainable.github.io/alembica/'
abstract: >-
`alembica` simplifies the use of Large Language Models (LLMs) to extract structured data sets from unstructured corpora of text. It provides a flexible and scalable framework to process, synthesize, and transform textual information of any type. The output is structured into formats suitable for analysis and further processing.
keywords:
- Generative AI
- Large Language Models
- Open Source
- Open Science
- Semantic Synthesis
- Information Extraction
license: AGPL-3.0-only
GitHub Events
Total
- Create event: 12
- Release event: 6
- Issues event: 24
- Watch event: 1
- Delete event: 6
- Issue comment event: 3
- Push event: 86
- Pull request event: 13
Last Year
- Create event: 12
- Release event: 6
- Issues event: 24
- Watch event: 1
- Delete event: 6
- Issue comment event: 3
- Push event: 86
- Pull request event: 13
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 14
- Total pull requests: 8
- Average time to close issues: 18 days
- Average time to close pull requests: about 1 hour
- Total issue authors: 1
- Total pull request authors: 1
- Average comments per issue: 0.07
- Average comments per pull request: 0.0
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 14
- Pull requests: 8
- Average time to close issues: 18 days
- Average time to close pull requests: about 1 hour
- Issue authors: 1
- Pull request authors: 1
- Average comments per issue: 0.07
- Average comments per pull request: 0.0
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- ricboer0 (13)
Pull Request Authors
- ricboer0 (6)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
- Total downloads: unknown
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 8
proxy.golang.org: github.com/open-and-sustainable/alembica
- Homepage: https://github.com/open-and-sustainable/alembica
- Documentation: https://pkg.go.dev/github.com/open-and-sustainable/alembica#section-documentation
- License: AGPL-3.0
-
Latest release: v0.0.8
published 9 months ago