apertium-sme-nob
Apertium translation pair for Northern Sámi and Norwegian Bokmål
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 4 DOI reference(s) in README -
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.7%) to scientific vocabulary
Keywords
apertium-trunk
Keywords from Contributors
apertium-nursery
Last synced: 7 months ago
·
JSON representation
·
Repository
Apertium translation pair for Northern Sámi and Norwegian Bokmål
Basic Info
- Host: GitHub
- Owner: apertium
- License: gpl-2.0
- Language: XML
- Default Branch: master
- Homepage: http://wiki.apertium.org/wiki/Apertium-sme-nob
- Size: 1000 MB
Statistics
- Stars: 4
- Watchers: 12
- Forks: 0
- Open Issues: 2
- Releases: 0
Topics
apertium-trunk
Created about 8 years ago
· Last pushed 7 months ago
Metadata Files
Readme
Changelog
License
Citation
Authors
README
Northern Sámi and Norwegian Bokmål
apertium-sme-nob
===============================================================================
This is an Apertium language pair for translating between Northern
Sámi and Norwegian Bokmål. What you can use this language package for:
* Translating from Northern Sámi to Norwegian Bokmål
* Morphological analysis of Northern Sámi
* Part-of-speech tagging of Northern Sámi
For information on the latter two points, see subheading "For more
information" below. For analysis and POS-tagging of Bokmål, see
https://wiki.apertium.org/wiki/apertium-nno-nob
Requirements
===============================================================================
You will need the following software installed:
* lttoolbox (>= 3.1.2)
* apertium (>= 3.1.1)
* vislcg3 (>= 0.9.7.8188)
* foma (last tested with SVN revision 49)
* hfst3 (last tested with SVN revision 2174, configured with
./configure --enable-lexc --enable-proc --with-foma)
If this does not make any sense, we recommend you look at: wiki.apertium.org
Compiling
===============================================================================
Given the requirements being installed, you should be able to just run:
$ ./configure
$ make
# make install
You can use ./autogen.sh instead of ./configure if you're compiling
from SVN. If you're using a --prefix to ./configure or ./autogen.sh,
make sure it's the same one you used to install apertium itself.
Testing
===============================================================================
If you are in the source directory after running make, the following
commands should work:
$ echo "Mus lea oahpahus gaskkal guovtti ja njealji" | apertium -d . sme-nob
Jeg har undervisning mellom to og fire
The following commands run tests which are on the Apertium wiki page:
$ sh regression-tests.sh
$ sh pending-tests.sh
Files and data
===============================================================================
Bilingual files:
* sme-nob.prob - Tagger model for Sámi
* apertium-sme-nob.sme-nob.lex - Constraint Grammar WSD rules for Sámi
* apertium-sme-nob.sme-nob.dix - Bilingual dictionary
* apertium-sme-nob.sme-nob.t1x - Chunking rules for translating into Bokmål
* apertium-sme-nob.sme-nob.t2x - Interchunk1 rules for translating into Bokmål
* apertium-sme-nob.sme-nob.t3x - Interchunk2 rules for translating into Bokmål
* apertium-sme-nob.sme-nob.t4x - Postchunk rules for translating into Bokmål
* apertium-sme-nob.sme-nob.val - Valency rules for Sámi
* apertium-sme-nob.nob.dix - Monolingual dictionary for Bokmål
* modes.xml - Translation modes
Monolingual files:
* The nob generator is found in apertium-nob in this github
* The sme analyser is found in a repository at UiT The Arctic University of Norway, see:
https://wiki.apertium.org/wiki/Northern_Sámi_and_Norwegian/Installation
http://giellatekno.uit.no/doc/infra/GettingStarted.html
For more information
===============================================================================
* https://wiki.apertium.org/wiki/Installation
* https://wiki.apertium.org/wiki/apertium-sme-nob
* https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary
* https://wiki.apertium.org/wiki/HFST
* https://wiki.apertium.org/wiki/Constraint_Grammar
Citing
===============================================================================
Academic users of this package are requested to cite the following article:
@inproceedings{trosterud2012evaluating,
address = {Gothenburg, Sweden},
author = {Trosterud, Trond and Unhammer, Kevin Brubeck},
booktitle = {Proceedings of the Third International Workshop on Free/Open-Source Rule-Based Machine Translation (FreeRBMT 2012)},
editor = {España-Bonet, Cristina and Ranta, Aarne},
month = {June},
number = {2013:03},
pages = {13--26},
publisher = {Chalmers University of Technology},
title = {{Evaluating North Sámi to Norwegian assimilation RBMT}},
url = {http://www.molto-project.eu/sites/default/files/FreeRBMT-2012.pdf#19},
year = 2012
}
The nob resources used were adapted from the apertium-nno-nob package;
to cite that, please use:
@inproceedings{unhammer2009rfr,
address = {Alicante},
author = {Unhammer, Kevin Brubeck and Trosterud, Trond},
booktitle = {{Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation}},
editor = {Pérez-Ortiz, Juan Antonio and Sánchez-Martínez, Felipe and Tyers, Francis M.},
pages = {35--42},
publisher = {Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos},
title = {{Reuse of Free Resources in Machine Translation between Nynorsk and Bokm{\aa}l}},
url = {http://hdl.handle.net/10045/12025},
year = 2009
}
To cite Apertium, please use the following:
@article{apertium,year={2011},
issn={0922-6567},
journal={Machine Translation},
volume={25},
number={2},
doi={10.1007/s10590-011-9090-0},
title={Apertium: a free/open-source platform for rule-based machine translation},
url={http://dx.doi.org/10.1007/s10590-011-9090-0},publisher={Springer Netherlands},
keywords={Free/open-source machine translation; Rule-based machine translation;
Apertium; Shallow transfer; Finite-state transducers},
author={Forcada, Mikel~L. and Ginestí-Rosell, Mireia and Nordfalk, Jacob and O’Regan, Jim and Ortiz-Rojas, Sergio and Pérez-Ortiz, Juan~Antonio and Sánchez-Martínez, Felipe and Ramírez-Sánchez, Gema and Tyers, Francis~M.},
pages={127-144},
language={English}
}
Help and support
===============================================================================
If you need help using this language pair or data, you can contact:
* Mailing list: apertium-stuff@lists.sourceforge.net
* IRC: #apertium on irc.oftc.net
See also the file AUTHORS included in this distribution.
Owner
- Name: Apertium
- Login: apertium
- Kind: organization
- Email: apertium-contact@lists.sourceforge.net
- Website: https://wiki.apertium.org/
- Repositories: 630
- Profile: https://github.com/apertium
Free/open-source platform for developing rule-based machine translation systems and language technology
Citation (CITATION.cff)
authors:
- family-names: Trosterud
given-names: Trond
orcid: "https://orcid.org/0000-0002-2300-2995"
- family-names: Unhammer
given-names: Kevin Brubeck
orcid: "https://orcid.org/0000-0002-2883-1899"
cff-version: 1.2.0
identifiers:
- description: Technical report no 2013:03, Proceedings of a Workshop Held in Gothenburg 14-15 June, 2012
type: url
value: http://www.molto-project.eu/sites/default/files/FreeRBMT-2012.pdf#page=19
keywords:
- Norwegian
- Bokmål
- Sámi
- Saami
- North Saami
- MT
- RBMT
- dictionary
message: If you use this software, please cite it using these metadata.
repository-code: "https://github.com/apertium/apertium-sme-nob"
title: Apertium Northern Sámi–Norwegian Bokmål
version: 0.6.1
preferred-citation:
authors:
- family-names: Trosterud
given-names: Trond
- family-names: Unhammer
given-names: Kevin Brubeck
title: "Evaluating North Sámi to Norwegian assimilation RBMT"
type: article
year: 2012
url: "http://www.molto-project.eu/sites/default/files/FreeRBMT-2012.pdf#page=19"
abstract: "We describe the development and evaluation of a rule-based machine translation (MT) assimilation system from North Śami to Norwegian Bokmål, built on a combination of Free and Open Source Software (FOSS) resources: the Apertium platform and the Giellatekno HFST lexicon and Constraint Grammar disambiguator. We detail the integration of these and other resources in the system along with the construction of the lexical and structural transfer, and evaluate the translation quality using various methods, focusing on evaluating the users’ comprehension of the text. Finally, some future work is suggested."
license: GPL-3.0-or-later
url: https://github.com/apertium/apertium-sme-nob/
GitHub Events
Total
- Push event: 7
Last Year
- Push event: 7
Committers
Last synced: 11 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Kevin Brubeck Unhammer | u****r@f****g | 2,139 |
| Lene Antonsen | l****n@u****o | 1,244 |
| Trond Trosterud | t****d@u****o | 402 |
| Francis M. Tyers | f****s@p****m | 151 |
| Linda Wiechetek | l****k@u****o | 49 |
| Berit Merete Nystad Eskonsipo | b****o@g****m | 27 |
| Ritva Nystad | r****d@u****o | 17 |
| Sjur Nørstebø Moshagen | s****n@u****o | 11 |
| Daniel Swanson | a****s@g****m | 4 |
| Børre Gaup | a****s@g****m | 3 |
| Tino Didriksen | m****l@t****m | 2 |
| Mikel L. Forcada | m****f@d****s | 1 |
| Sushain Cherivirala | s****n@s****e | 1 |
| Tanmai Khanna | k****i@g****m | 1 |
| Trond Trosterud | t****0@t****o | 1 |
Committer Domains (Top 20 + Academic)
uit.no: 5
tf-hsl-m0016.bargi.uit.no: 1
skc.name: 1
dlsi.ua.es: 1
tinodidriksen.com: 1
prompsit.com: 1
fsfe.org: 1
Issues and Pull Requests
Last synced: 9 months ago
All Time
- Total issues: 3
- Total pull requests: 0
- Average time to close issues: about 10 hours
- Average time to close pull requests: N/A
- Total issue authors: 3
- Total pull request authors: 0
- Average comments per issue: 2.33
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- unhammer (1)
- sanvila (1)
- jonorthwash (1)