apertium-sme-nob

Apertium translation pair for Northern Sámi and Norwegian Bokmål

https://github.com/apertium/apertium-sme-nob

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.7%) to scientific vocabulary

Keywords

apertium-trunk

Keywords from Contributors

apertium-nursery
Last synced: 7 months ago · JSON representation ·

Repository

Apertium translation pair for Northern Sámi and Norwegian Bokmål

Basic Info
Statistics
  • Stars: 4
  • Watchers: 12
  • Forks: 0
  • Open Issues: 2
  • Releases: 0
Topics
apertium-trunk
Created about 8 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog License Citation Authors

README

Northern Sámi and Norwegian Bokmål

                            apertium-sme-nob
===============================================================================

This is an Apertium language pair for translating between Northern
Sámi and Norwegian Bokmål. What you can use this language package for:

* Translating from Northern Sámi to Norwegian Bokmål
* Morphological analysis of Northern Sámi
* Part-of-speech tagging of Northern Sámi

For information on the latter two points, see subheading "For more
information" below. For analysis and POS-tagging of Bokmål, see
https://wiki.apertium.org/wiki/apertium-nno-nob

Requirements
===============================================================================

You will need the following software installed:

* lttoolbox (>= 3.1.2)
* apertium (>= 3.1.1)
* vislcg3 (>= 0.9.7.8188)
* foma (last tested with SVN revision 49)
* hfst3 (last tested with SVN revision 2174, configured with
  ./configure --enable-lexc --enable-proc --with-foma)

If this does not make any sense, we recommend you look at: wiki.apertium.org

Compiling
===============================================================================

Given the requirements being installed, you should be able to just run:

$ ./configure 
$ make
# make install

You can use ./autogen.sh instead of ./configure if you're compiling
from SVN. If you're using a --prefix to ./configure or ./autogen.sh,
make sure it's the same one you used to install apertium itself.

Testing
===============================================================================

If you are in the source directory after running make, the following
commands should work:

$ echo "Mus lea oahpahus gaskkal guovtti ja njealji" | apertium -d . sme-nob
Jeg har undervisning mellom to og fire 

The following commands run tests which are on the Apertium wiki page:

$ sh regression-tests.sh 

$ sh pending-tests.sh 

Files and data
===============================================================================

Bilingual files:

* sme-nob.prob                         - Tagger model for Sámi
* apertium-sme-nob.sme-nob.lex         - Constraint Grammar WSD rules for Sámi
* apertium-sme-nob.sme-nob.dix         - Bilingual dictionary 
* apertium-sme-nob.sme-nob.t1x         - Chunking rules for translating into Bokmål
* apertium-sme-nob.sme-nob.t2x         - Interchunk1 rules for translating into Bokmål
* apertium-sme-nob.sme-nob.t3x         - Interchunk2 rules for translating into Bokmål
* apertium-sme-nob.sme-nob.t4x         - Postchunk rules for translating into Bokmål
* apertium-sme-nob.sme-nob.val         - Valency rules for Sámi
* apertium-sme-nob.nob.dix             - Monolingual dictionary for Bokmål
* modes.xml                            - Translation modes

Monolingual files:

* The nob generator is found in apertium-nob in this github
* The sme analyser is found in a repository at UiT The Arctic University of Norway, see:
  https://wiki.apertium.org/wiki/Northern_Sámi_and_Norwegian/Installation
  http://giellatekno.uit.no/doc/infra/GettingStarted.html
  

For more information
===============================================================================

* https://wiki.apertium.org/wiki/Installation
* https://wiki.apertium.org/wiki/apertium-sme-nob
* https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary
* https://wiki.apertium.org/wiki/HFST
* https://wiki.apertium.org/wiki/Constraint_Grammar

Citing
===============================================================================

Academic users of this package are requested to cite the following article:

@inproceedings{trosterud2012evaluating,
  address = {Gothenburg, Sweden},
  author = {Trosterud, Trond and Unhammer, Kevin Brubeck},
  booktitle = {Proceedings of the Third International Workshop on Free/Open-Source Rule-Based Machine Translation (FreeRBMT 2012)},
  editor = {España-Bonet, Cristina and Ranta, Aarne},
  month = {June},
  number = {2013:03},
  pages = {13--26},
  publisher = {Chalmers University of Technology},
  title = {{Evaluating North Sámi to Norwegian assimilation RBMT}},
  url = {http://www.molto-project.eu/sites/default/files/FreeRBMT-2012.pdf#19},
  year = 2012
}

The nob resources used were adapted from the apertium-nno-nob package;
to cite that, please use:

@inproceedings{unhammer2009rfr,
  address = {Alicante},
  author = {Unhammer, Kevin Brubeck and Trosterud, Trond},
  booktitle = {{Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation}},
  editor = {Pérez-Ortiz, Juan Antonio and Sánchez-Martínez, Felipe and Tyers, Francis M.},
  pages = {35--42},
  publisher = {Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos},
  title = {{Reuse of Free Resources in Machine Translation between Nynorsk and Bokm{\aa}l}},
  url = {http://hdl.handle.net/10045/12025},
  year = 2009
}

To cite Apertium, please use the following:

@article{apertium,year={2011},
issn={0922-6567},
journal={Machine Translation},
volume={25},
number={2},
doi={10.1007/s10590-011-9090-0},
title={Apertium: a free/open-source platform for rule-based machine translation},
url={http://dx.doi.org/10.1007/s10590-011-9090-0},publisher={Springer Netherlands},
keywords={Free/open-source machine translation; Rule-based machine translation; 
Apertium; Shallow transfer; Finite-state transducers},
author={Forcada, Mikel~L. and Ginestí-Rosell, Mireia and Nordfalk, Jacob and O’Regan, Jim and Ortiz-Rojas, Sergio and Pérez-Ortiz, Juan~Antonio and Sánchez-Martínez, Felipe and Ramírez-Sánchez, Gema and Tyers, Francis~M.},
pages={127-144},
language={English}
}

Help and support
===============================================================================

If you need help using this language pair or data, you can contact:

* Mailing list: apertium-stuff@lists.sourceforge.net
* IRC: #apertium on irc.oftc.net

See also the file AUTHORS included in this distribution.

Owner

  • Name: Apertium
  • Login: apertium
  • Kind: organization
  • Email: apertium-contact@lists.sourceforge.net

Free/open-source platform for developing rule-based machine translation systems and language technology

Citation (CITATION.cff)

authors:
  - family-names: Trosterud
    given-names: Trond
    orcid: "https://orcid.org/0000-0002-2300-2995"
  - family-names: Unhammer
    given-names: Kevin Brubeck
    orcid: "https://orcid.org/0000-0002-2883-1899"
cff-version: 1.2.0
identifiers:
  - description: Technical report no 2013:03, Proceedings of a Workshop Held in Gothenburg 14-15 June, 2012
    type: url
    value: http://www.molto-project.eu/sites/default/files/FreeRBMT-2012.pdf#page=19
keywords:
  - Norwegian
  - Bokmål
  - Sámi
  - Saami
  - North Saami
  - MT
  - RBMT
  - dictionary
message: If you use this software, please cite it using these metadata.
repository-code: "https://github.com/apertium/apertium-sme-nob"
title: Apertium Northern Sámi–Norwegian Bokmål
version: 0.6.1
preferred-citation:
  authors:
    - family-names: Trosterud
      given-names: Trond
    - family-names: Unhammer
      given-names: Kevin Brubeck
  title: "Evaluating North Sámi to Norwegian assimilation RBMT"
  type: article
  year: 2012
  url: "http://www.molto-project.eu/sites/default/files/FreeRBMT-2012.pdf#page=19"
  abstract: "We describe the development and evaluation of a rule-based machine translation (MT) assimilation system from North Śami to Norwegian Bokmål, built on a combination of Free and Open Source Software (FOSS) resources: the Apertium platform and the Giellatekno HFST lexicon and Constraint Grammar disambiguator. We detail the integration of these and other resources in the system along with the construction of the lexical and structural transfer, and evaluate the translation quality using various methods, focusing on evaluating the users’ comprehension of the text. Finally, some future work is suggested."
license: GPL-3.0-or-later
url: https://github.com/apertium/apertium-sme-nob/

GitHub Events

Total
  • Push event: 7
Last Year
  • Push event: 7

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 4,053
  • Total Committers: 15
  • Avg Commits per committer: 270.2
  • Development Distribution Score (DDS): 0.472
Past Year
  • Commits: 48
  • Committers: 3
  • Avg Commits per committer: 16.0
  • Development Distribution Score (DDS): 0.396
Top Committers
Name Email Commits
Kevin Brubeck Unhammer u****r@f****g 2,139
Lene Antonsen l****n@u****o 1,244
Trond Trosterud t****d@u****o 402
Francis M. Tyers f****s@p****m 151
Linda Wiechetek l****k@u****o 49
Berit Merete Nystad Eskonsipo b****o@g****m 27
Ritva Nystad r****d@u****o 17
Sjur Nørstebø Moshagen s****n@u****o 11
Daniel Swanson a****s@g****m 4
Børre Gaup a****s@g****m 3
Tino Didriksen m****l@t****m 2
Mikel L. Forcada m****f@d****s 1
Sushain Cherivirala s****n@s****e 1
Tanmai Khanna k****i@g****m 1
Trond Trosterud t****0@t****o 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 3
  • Total pull requests: 0
  • Average time to close issues: about 10 hours
  • Average time to close pull requests: N/A
  • Total issue authors: 3
  • Total pull request authors: 0
  • Average comments per issue: 2.33
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • unhammer (1)
  • sanvila (1)
  • jonorthwash (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels