awesome-ai4lam

A list of awesome AI in libraries, archives, and museum collections from around the world πŸ•ΆοΈ

https://github.com/ai4lam/awesome-ai4lam

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • β—‹
    CITATION.cff file
  • βœ“
    codemeta.json file
    Found codemeta.json file
  • βœ“
    .zenodo.json file
    Found .zenodo.json file
  • βœ“
    DOI references
    Found 8 DOI reference(s) in README
  • βœ“
    Academic publication links
    Links to: sciencedirect.com, springer.com, wiley.com, acm.org, zenodo.org
  • β—‹
    Academic email domains
  • β—‹
    Institutional organization owner
  • β—‹
    JOSS paper metadata
  • β—‹
    Scientific vocabulary similarity
    Low similarity (6.9%) to scientific vocabulary

Keywords

ai ai-in-libraries artificial-intelligence awesome awesome-list awesome-lists awesome-resources libraries library-automation library-systems machine-learning machine-learning-projects machinelearning ml-in-libraries natural-language-processing
Last synced: 4 months ago · JSON representation

Repository

A list of awesome AI in libraries, archives, and museum collections from around the world πŸ•ΆοΈ

Basic Info
Statistics
  • Stars: 133
  • Watchers: 9
  • Forks: 11
  • Open Issues: 43
  • Releases: 0
Topics
ai ai-in-libraries artificial-intelligence awesome awesome-list awesome-lists awesome-resources libraries library-automation library-systems machine-learning machine-learning-projects machinelearning ml-in-libraries natural-language-processing
Created about 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme Contributing License Code of conduct Support Codemeta

README.md

Awesome AI for LAM Awesome

A curated list of resources, projects, and tools for using Artificial Intelligence in Libraries, Archives, and Museums.

License Maintained? Last commit GitHub contributors Mastodon Slack

Contents

Introduction

This list is a collection of resources, tools, projects, and other materials for professionals and enthusiasts in the Libraries, Archives, and Museums (LAM) sector. You might also know this as the GLAM (galleries, libraries, archives and museums) or CHI (cultural heritage institutions) sector, or be more familiar with the term 'memory institutions'. However you describe the field, if you know of an AI, machine learning, big data or data science project, event or resource related to collections, please share it here!

This list is maintained by the AI4LAM community. Its aim is to support knowledge sharing, innovation, and collaboration in applying AI to LAM.

Learning ResourcesClick button to suggest an addition

Please note: the appearance of a resource on this list does not constitute an official endorsement by AI4LAM.

Introductions to AI

Computer vision

Natural language processing

Generative AI

AI in galleries, libraries, archives and museums

Other "awesome" lists in AI and ML

Tools and FrameworksClick button to suggest an addition

Note: datasets for training and testing are listed in a separate section of this document.

Document analysis, transcription, and labeling

  • Arkindex – open-source platform for managing & processing collections of digitized documents
  • Callico – open-source web platform for document annotation
  • Coconut Libtool – web-based textual analysis tool designed to assist social scientists, librarians, or anyone in data analysis
  • Distributed Annotation 'n' Enrichment (DANE) – compute task assignment & file storage for automatic annotation of content (CLARIAH, Norway)
  • HTRFLOW demo and associated GitHub repo – explore AI models for Handwritten Text Recogntion (Swedish National Archives)
  • Label Studio – data labeling platform to fine-tune LLMs, prepare training data, or validate AI models
  • OCR correction – OCR correction tools (BibliothΓ¨que nationale, Luxembourg)
  • Surya – multilingual document OCR toolkit with line-level text detection
  • Text models from the National Library of Sweden – available on Hugging Face
  • Transkribus – transcription, recognition, & searching of historical documents

Audio and video analysis, transcription, and labeling

  • Acoustic models from the National Library of Sweden – available on Hugging Face
  • Annotorious – JavaScript image annotation library
  • Audiovisual Metadata Platform (AMP) – generation of metadata for discovery & use of digital audio & video collections (Indiana U., USA)
  • CAMPI – Computer-Aided Metadata Generation for Photo archives Initiative (Carnegie Mellonw U., USA)
  • ELAN – addS textual annotations to audio and/or video recordings (Max Planck Institute for Psycholinguistics, The Netherlands)
  • inaFaceAnalyzer – Python toolbox for face-based description of gender representation in media (Institut National de l'Audiovisuel, France)
  • Newspaper Navigator – explore visual & textual content in the Chronicling America digitized newspaper collection (Library of Congress, USA)
  • Oodi – virtual information assistant (Helsinki Central Library)
  • ReTV – video analysis & summarization (Modul Univesrity, Austria)
  • VGG Image Annotator – manual annotation software for image, audio and video

Indexing and classification

  • Annif and associated tutorial – tool for automated subject indexing and classification (National Library of Finland)

Search and retrieval

Applications of Transformers, LLMs, and GPT

DatasetsClick button to suggest an addition

Datasets available on Hugging Face

There are many (G)LAM-related datasets on Hugging Face. The following links will perform live searches directly in Hugging Face for datasets tagged with the given terms:

Datasets available elsewhere

Projects, Initiatives, and Case StudiesClick button to suggest an addition

Project lists & directories

Select individual projects

Policies and recommendationsClick button to suggest an addition

Statements by organizations and government bodies

Surveys of policies and recommendations

Frameworks

Conferences and WorkshopsClick button to suggest an addition

The annual Fantastic Futures conference is the main conference series for the AI4LAM community. Various other conferences and workshops are relevant to the community and may be included in the list below.

Upcoming Conferences and Workshops

πŸ‘‹πŸ» Note: AI4LAM's conferences tracker Google sheet has a more complete list of events. The following is a list of larger and/or especially relevant events for AI4LAM.

Past Conferences and Workshops

Publications and News SourcesClick button to suggest an addition

Journals and Magazines

News sources

Community

The AI4LAM community's home page is https://ai4lam.org. The secretariat and other contact addresses can be found at the About page.

Contributions

Your help and participation in enhancing this awesome list are very much welcome! Please use the issue ticket system to request additions or changes, or to make other contributions to this repository. For more information, please visit the guidelines for contributing.

Click button to suggest an addition

License

CC0 Logo

The contents of this page are licensed under the Creative Commons CC0 1.0 Universal license. CC0 is a “no rights reserved” license; the authors relinquish copyright and similar rights to the contents of the Awesome AI for LAM list.

Owner

  • Name: AI for Libraries, Archives, and Museums
  • Login: AI4LAM
  • Kind: organization
  • Email: contact@ai4lam.org

AI4LAM is an international, participatory community focused on advancing the use of artificial intelligence in, for and by libraries, archives and museums.

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "@type": "Dataset",
  "name": "Awesome AI for LAM",
  "identifier": "awesome-ai4lam",
  "description": "A curated list of resources, projects, and tools for using Artificial Intelligence in Libraries, Archives, and Museums",
  "datePublished": "2024-02-21",
  "dateCreated": "2023-11-17",
  "author": [
    {
      "@type": "Person",
      "givenName": "Michael",
      "familyName": "Hucka",
      "affiliation": {
        "@type": "Organization",
        "name": "California Institute of Technology Library"
      },
      "email": "mhucka@caltech.edu",
      "@id": "https://orcid.org/0000-0001-9105-5960"
    },
    {
      "@type": "Person",
      "givenName": "Mia",
      "familyName": "Ridge",
      "affiliation": {
        "@type": "Organization",
        "name": "British Library"
      },
      "email": "digitalresearch@bl.uk",
      "@id": "https://orcid.org/0000-0003-3733-8120"
    },
    {
      "@type": "Person",
      "givenName": "Jean-Philippe",
      "familyName": "Moreux",
      "affiliation": {
        "@type": "Organization",
        "name": "Bibliothque nationale de France"
      },
      "email": "jean-philippe.moreux@bnf.fr",
      "@id": "https://orcid.org/0000-0001-6335-0903"
    },
    {
      "@type": "Person",
      "givenName": "Hannah",
      "familyName": "Moutran",
      "affiliation": {
        "@type": "Organization",
        "name": "University of Texas at Austin"
      }
    }
  ],
  "maintainer": [
    {
      "@type": "Person",
      "givenName": "Michael",
      "familyName": "Hucka",
      "affiliation": {
        "@type": "Organization",
        "name": "California Institute of Technology Library"
      },
      "email": "mhucka@caltech.edu",
      "@id": "https://orcid.org/0000-0001-9105-5960"
    }
  ],
  "license": "https://github.com/mhucka/awesome-ai4lam/blob/main/LICENSE",
  "isAccessibleForFree": true,
  "url": "https://ai4lam.github.io/awesome-ai4lam/",
  "codeRepository": "https://github.com/mhucka/awesome-ai4lam",
  "readme": "https://github.com/mhucka/awesome-ai4lam/blob/main/README.md",
  "issueTracker": "https://github.com/mhucka/awesome-ai4lam/issues",
  "keywords": [
    "archives",
    "artificial intelligence",
    "automation",
    "computer vision",
    "handwritten text recognition",
    "HTR",
    "libraries",
    "library information management systems",
    "library systems",
    "machine learning",
    "ML",
    "natural language processing",
    "NLP",
    "OCR",
    "optical character recognition",
    "software"
  ],
  "developmentStatus": "active"
}

GitHub Events

Total
  • Issues event: 27
  • Watch event: 41
  • Fork event: 3
Last Year
  • Issues event: 27
  • Watch event: 41
  • Fork event: 3

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 55
  • Total pull requests: 12
  • Average time to close issues: 5 days
  • Average time to close pull requests: 1 day
  • Total issue authors: 2
  • Total pull request authors: 5
  • Average comments per issue: 0.42
  • Average comments per pull request: 0.58
  • Merged pull requests: 9
  • Bot issues: 54
  • Bot pull requests: 0
Past Year
  • Issues: 19
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 19
  • Bot pull requests: 0
Top Authors
Issue Authors
  • github-actions[bot] (58)
  • mialondon (1)
Pull Request Authors
  • mhucka (5)
  • mialondon (4)
  • altomator (2)
  • cengel (1)
  • hannahmoutran (1)
Top Labels
Issue Labels
bug (58) addition (1) invalid (1)
Pull Request Labels

Dependencies

.github/workflows/markdown-link-tester.yml actions
  • actions/checkout v3 composite
  • lycheeverse/lychee-action v1.8.0 composite
  • peter-evans/create-issue-from-file v4 composite
.github/workflows/markdown-linter.yml actions
  • DavidAnson/markdownlint-cli2-action v13 composite
  • actions/checkout v4 composite