frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

https://github.com/languagemachines/frog

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (3.0%) to scientific vocabulary

Keywords

computational-linguistics dependency-parser dutch folia lemmatiser morphological-analyser morphology named-entity-recognition natural-language-processing nlp pos-tagger syntax text-processing

Keywords from Contributors

learning-algorithm decision-tree ib1 ib1-ig igtree k-nearest-neighbours knn nearest-neighbours timbl punctuation
Last synced: 4 months ago · JSON representation

Repository

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Basic Info
Statistics
  • Stars: 78
  • Watchers: 15
  • Forks: 10
  • Open Issues: 13
  • Releases: 39
Topics
computational-linguistics dependency-parser dutch folia lemmatiser morphological-analyser morphology named-entity-recognition natural-language-processing nlp pos-tagger syntax text-processing
Created over 11 years ago · Last pushed 5 months ago
Metadata Files
Readme Changelog License Authors Codemeta

README

Please see README.md for more information.

Owner

  • Name: Language Machines
  • Login: LanguageMachines
  • Kind: organization
  • Email: proycon@anaproy.nl
  • Location: Nijmegen, The Netherlands

NLP Research group at Centre for Language Studies, Radboud University Nijmegen

CodeMeta (codemeta.json)

{
  "@context": [
    "https://doi.org/10.5063/schema/codemeta-2.0",
    "http://schema.org",
    "https://w3id.org/software-types",
    "https://w3id.org/software-iodata"
  ],
  "@type": "SoftwareSourceCode",
  "identifier": "frog",
  "name": "Frog",
  "version": "0.34",
  "description": "Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It performs automatic linguistic enrichment such as part of speech tagging, lemmatisation, named entity recognition, shallow parsing, dependency parsing and morphological analysis. All NLP modules are based on TiMBL.",
  "license": "https://spdx.org/licenses/GPL-3.0-only",
  "url": "https://languagemachines.github.io/frog",
  "thumbnailUrl": "https://raw.githubusercontent.com/LanguageMachines/frog/master/logo.svg",
  "producer": {
    "@id": "https://huc.knaw.nl",
    "@type": "Organization",
    "name": "KNAW Humanities Cluster",
    "url": "https://huc.knaw.nl",
    "parentOrganization": {
      "@id": "https://knaw.nl",
      "@type": "Organization",
      "name": "KNAW",
      "url": "https://knaw.nl",
      "location": {
        "@type": "Place",
        "name": "Amsterdam"
      }
    }
  },
  "author": [
    {
      "@type": "Person",
      "givenName": "Ko",
      "familyName": "van der Sloot",
      "email": "ko.vandersloot@let.ru.nl",
      "affiliation": {
        "@id": "https://www.ru.nl/clst",
        "@type": "Organization",
        "name": "Centre for Language and Speech Technology",
        "url": "https://www.ru.nl/clst",
        "parentOrganization": {
          "@id": "https://www.ru.nl/cls",
          "@type": "Organization",
          "name": "Centre for Language Studies",
          "url": "https://www.ru.nl/cls",
          "parentOrganization": {
            "@id": "https://www.ru.nl",
            "name": "Radboud University",
            "@type": "Organization",
            "url": "https://www.ru.nl",
            "location": {
              "@type": "Place",
              "name": "Nijmegen"
            }
          }
        }
      }
    },
    {
      "@id": "https://orcid.org/0000-0003-2493-656X",
      "@type": "Person",
      "givenName": "Antal",
      "familyName": "van den Bosch",
      "email": "antal.vandenbosch@let.ru.nl"
    },
    {
      "@id": "https://orcid.org/0000-0002-1046-0006",
      "@type": "Person",
      "givenName": "Maarten",
      "familyName": "van Gompel",
      "email": "proycon@anaproy.nl",
      "affiliation": [
        {
          "@id": "https://huc.knaw.nl"
        },
        {
          "@id": "https://www.ru.nl/clst"
        }
      ]
    }
  ],
  "sourceOrganization": {
    "@id": "https://www.ru.nl/clst"
  },
  "programmingLanguage": {
    "@type": "ComputerLanguage",
    "identifier": "c++",
    "name": "C++"
  },
  "operatingSystem": [
    "Linux",
    "BSD",
    "macOS"
  ],
  "codeRepository": "https://github.com/LanguageMachines/frog",
  "softwareRequirements": [
    {
      "@type": "SoftwareApplication",
      "identifier": "icu",
      "name": "icu"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "libxml2",
      "name": "libxml2"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ticcutils",
      "name": "ticcutils"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "timbl",
      "name": "timbl"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "libfolia",
      "name": "libfolia"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "mbt",
      "name": "mbt"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ucto",
      "name": "ucto"
    }
  ],
  "funding": [
    {
      "@type": "Grant",
      "name": "CLARIN-NL (NWO grant 184.021.003)",
      "url": "https://www.clariah.nl",
      "funder": {
        "@type": "Organization",
        "name": "NWO",
        "url": "https://www.nwo.nl"
      }
    },
    {
      "@type": "Grant",
      "name": "CLARIAH-CORE (NWO grant 184.033.101)",
      "url": "https://www.clariah.nl",
      "funder": {
        "@type": "Organization",
        "name": "NWO",
        "url": "https://www.nwo.nl"
      }
    },
    {
      "@type": "Grant",
      "name": "CLARIAH-PLUS (NWO grant 184.034.023)",
      "funder": {
        "@type": "Organization",
        "name": "NWO",
        "url": "https://www.nwo.nl"
      }
    }
  ],
  "readme": "https://github.com/LanguageMachines/frog/blob/master/README.md",
  "softwareHelp": [
    {
      "@id": "https://frognlp.readthedocs.io",
      "@type": "WebSite",
      "name": "Introduction  frog  documentation",
      "url": "https://frognlp.readthedocs.io"
    }
  ],
  "issueTracker": "https://github.com/LanguageMachines/frog/issues",
  "contIntegration": "https://github.com/LanguageMachines/frog/actions/workflows/frog.yml",
  "releaseNotes": "https://github.com/LanguageMachines/frog/releases",
  "developmentStatus": [
    "https://www.repostatus.org/#active",
    "https://w3id.org/research-technology-readiness-levels#Level9Proven"
  ],
  "keywords": [
    "nlp",
    "natural language processing",
    "pos",
    "lemma",
    "ner",
    "parser",
    "tagger",
    "part-of-speech tagging",
    "lemmatisation",
    "dependency parsing",
    "shallow parsing",
    "dutch"
  ],
  "applicationCategory": [
    "https://vocabs.dariah.eu/tadirah/annotating",
    "https://vocabs.dariah.eu/tadirah/tagging",
    "https://vocabs.dariah.eu/tadirah/namedEntityRecognition",
    "https://vocabs.dariah.eu/tadirah/posTagging",
    "https://vocabs.dariah.eu/tadirah/segmenting",
    "https://vocabs.dariah.eu/tadirah/treeTagging",
    "https://vocabs.dariah.eu/tadirah/contextualizing",
    "https://w3id.org/nwo-research-fields#Linguistics",
    "https://w3id.org/nwo-research-fields#TextualAndContentAnalysis"
  ],
  "referencePublication": [
    {
      "@type": "TechArticle",
      "name": "Frog: A Natural Language Processing Suite for Dutch",
      "author": [
        "Iris Hendrickx",
        "Antal van den Bosch",
        "Maarten van Gompel",
        "Ko van der Sloot",
        "Walter Daelemans"
      ],
      "pageStart": "99",
      "pageEnd": 114,
      "isPartOf": {
        "@type": "PublicationIssue",
        "datePublished": "2016",
        "name": "CLST Technical Report",
        "issue": "16-02",
        "location": "Nijmegen, the Netherlands"
      },
      "url": "https://github.com/LanguageMachines/frog/raw/master/docs/frogmanual.pdf"
    },
    {
      "@type": "ScholarlyArticle",
      "name": "An efficient memory-based morphosyntactic tagger and parser for Dutch",
      "author": [
        "Antal van den Bosch",
        "Bertjan Busser",
        "Sander Canisius",
        "Walter Daelemans"
      ],
      "pageStart": "99",
      "pageEnd": 114,
      "isPartOf": {
        "@type": "PublicationIssue",
        "datePublished": "2007",
        "name": "Selected Papers of the 17th Computational Linguistics in the Netherlands Meeting",
        "location": "Leuven, Belgium"
      },
      "url": "https://www.clinjournal.org/CLIN_proceedings/XVII/vandenbosch.pdf"
    }
  ],
  "dateCreated": "2011-03-31T12:35:01Z+0000",
  "dateModified": "2023-12-05T15:43:06Z+0100",
  "targetProduct": [
    {
      "@type": "SoftwareLibrary",
      "executableName": "libfrog",
      "name": "libfrog",
      "runtimePlatform": [
        "Linux",
        "BSD",
        "macOS"
      ],
      "description": "Frog Library with API for C++"
    },
    {
      "@type": "CommandLineApplication",
      "executableName": "frog",
      "name": "frog",
      "runtimePlatform": [
        "Linux",
        "BSD",
        "macOS"
      ],
      "description": "Command-line interface to the full NLP suite",
      "consumesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld",
            "@type": "Language",
            "name": "Dutch",
            "identifier": "nld"
          }
        },
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "application/folia+xml",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld",
            "@type": "Language",
            "name": "Dutch",
            "identifier": "nld"
          }
        }
      ],
      "producesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld",
            "@type": "Language",
            "name": "Dutch",
            "identifier": "nld"
          }
        },
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "application/folia+xml",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld",
            "@type": "Language",
            "name": "Dutch",
            "identifier": "nld"
          }
        }
      ]
    },
    {
      "@type": "CommandLineApplication",
      "executableName": "mbma",
      "runtimePlatform": [
        "Linux",
        "BSD",
        "macOS"
      ],
      "name": "mbma",
      "description": "Memory-based Morphological Analysis (standalone)",
      "consumesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld"
          }
        }
      ],
      "producesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld"
          }
        }
      ]
    },
    {
      "@type": "CommandLineApplication",
      "executableName": "mblem",
      "name": "mblem",
      "description": "Memory-based Lemmatiser (standalone)",
      "consumesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld"
          }
        }
      ],
      "producesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld"
          }
        }
      ]
    },
    {
      "@type": "CommandLineApplication",
      "executableName": "ner",
      "name": "ner",
      "description": "Named Entity Recogniser (standalone)",
      "consumesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld"
          }
        }
      ],
      "producesData": [
        {
          "@type": "TextDigitalDocument",
          "encodingFormat": "text/plain",
          "inLanguage": {
            "@id": "https://iso639-3.sil.org/code/nld"
          }
        }
      ]
    }
  ]
}

GitHub Events

Total
  • Create event: 1
  • Release event: 1
  • Issues event: 2
  • Watch event: 2
  • Issue comment event: 2
  • Push event: 13
  • Fork event: 2
Last Year
  • Create event: 1
  • Release event: 1
  • Issues event: 2
  • Watch event: 2
  • Issue comment event: 2
  • Push event: 13
  • Fork event: 2

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 1,942
  • Total Committers: 7
  • Avg Commits per committer: 277.429
  • Development Distribution Score (DDS): 0.535
Past Year
  • Commits: 23
  • Committers: 1
  • Avg Commits per committer: 23.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
sloot s****t@1****3 903
Ko van der Sloot K****t@l****l 897
Maarten van Gompel p****n@a****l 123
antalb a****b@1****3 10
Irishx i****s@i****l 5
joostvb j****b@1****3 3
Helmut Grohne h****t@s****e 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 5 months ago

All Time
  • Total issues: 184
  • Total pull requests: 0
  • Average time to close issues: 7 months
  • Average time to close pull requests: N/A
  • Total issue authors: 25
  • Total pull request authors: 0
  • Average comments per issue: 3.45
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: 1 day
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 2.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • proycon (42)
  • kosloot (34)
  • gmjonker (2)
  • barthr (1)
  • Duchadian (1)
  • PanderMusubi (1)
  • sanmai-NL (1)
  • jelmervdl (1)
  • fkunneman (1)
  • matjemeisje (1)
  • olix20 (1)
  • cdfa (1)
  • hannomuller (1)
  • jwijffels (1)
  • tisvonkje (1)
Pull Request Authors
Top Labels
Issue Labels
bug (36) enhancement (25) question (11) MBMA (9) ready (5) low priority (4) testing (4) packaging (3) NER (2) help wanted (2) invalid (2) UCTO (1) wontfix (1) TAGGER (1) waiting (1) PRIORITY (1)
Pull Request Labels

Packages

  • Total packages: 16
  • Total downloads: unknown
  • Total dependent packages: 1
    (may contain duplicates)
  • Total dependent repositories: 0
    (may contain duplicates)
  • Total versions: 70
  • Total maintainers: 1
proxy.golang.org: github.com/LanguageMachines/frog
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 7.0%
Average: 8.2%
Dependent repos count: 9.3%
Last synced: 5 months ago
alpine-edge: frog

Integration of natural language processing models for Dutch

  • Versions: 13
  • Dependent Packages: 1
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 6.0%
Average: 13.3%
Stargazers count: 22.7%
Forks count: 24.4%
Maintainers (1)
Last synced: 4 months ago
alpine-edge: frog-doc

Integration of natural language processing models for Dutch (documentation)

  • Versions: 13
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 14.6%
Average: 15.4%
Stargazers count: 22.7%
Forks count: 24.4%
Maintainers (1)
Last synced: 4 months ago
alpine-edge: frog-dev

Integration of natural language processing models for Dutch (development files)

  • Versions: 13
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 14.6%
Average: 15.4%
Stargazers count: 22.7%
Forks count: 24.4%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.22: frog

Integration of natural language processing models for Dutch

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.19: frog

Integration of natural language processing models for Dutch

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.20: frog-doc

Integration of natural language processing models for Dutch (documentation)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.22: frog-doc

Integration of natural language processing models for Dutch (documentation)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.19: frog-dev

Integration of natural language processing models for Dutch (development files)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.20: frog-dev

Integration of natural language processing models for Dutch (development files)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.22: frog-dev

Integration of natural language processing models for Dutch (development files)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.21: frog-doc

Integration of natural language processing models for Dutch (documentation)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.21: frog

Integration of natural language processing models for Dutch

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.21: frog-dev

Integration of natural language processing models for Dutch (development files)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.20: frog

Integration of natural language processing models for Dutch

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago
alpine-v3.19: frog-doc

Integration of natural language processing models for Dutch (documentation)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 100%
Maintainers (1)
Last synced: 4 months ago

Dependencies

.github/workflows/frog.yml actions
  • Gottox/irc-message-action v2 composite
  • actions/checkout v2 composite
  • styfle/cancel-workflow-action 0.11.0 composite
Dockerfile docker
  • alpine latest build
.github/workflows/cleanup.yml actions
  • Mattraks/delete-workflow-runs v2 composite