eno-flex

Repository of custom R codes to wrangle .lift and .flexttext output of FLEx into a new SFM database to be re-imported into a new FLEx dictionary/lexicon project. The spinoff of this project is available at https://github.com/engganolang/eno-learner-lift

https://github.com/engganolang/eno-flex

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 19 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.1%) to scientific vocabulary

Keywords

data-science data-wrangling fieldworks-language-explorer lexical-database lexicography lexicon r-programming rstats tidyverse xml xml2
Last synced: 7 months ago · JSON representation ·

Repository

Repository of custom R codes to wrangle .lift and .flexttext output of FLEx into a new SFM database to be re-imported into a new FLEx dictionary/lexicon project. The spinoff of this project is available at https://github.com/engganolang/eno-learner-lift

Basic Info
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Topics
data-science data-wrangling fieldworks-language-explorer lexical-database lexicography lexicon r-programming rstats tidyverse xml xml2
Created over 2 years ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.Rmd

---
output: github_document
title: R codes and dataset for processing the Contemporary Enggano FLEx database into the digital and printed Contemporary Enggano dictionary
author: '[Gede Primahadi Wijaya Rajeg](https://www.ling-phil.ox.ac.uk/people/gede-rajeg) ORCID iD icon
University of Oxford, UK & Universitas Udayana, Indonesia' --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) ``` [![The University of Oxford](file-oxweb-logo.gif){width="84"}](https://www.ox.ac.uk/) [![Faculty of Linguistics, Philology and Phonetics, the University of Oxford](file-lingphil.png){width="83"}](https://www.ling-phil.ox.ac.uk/) [![Arts and Humanities Research Council (AHRC)](file-ahrc.png){width="325"}](https://www.ukri.org/councils/ahrc/)
*This work is part of the AHRC-funded grants ([grant ID AH/W007290/1](https://gtr.ukri.org/projects?ref=AH%2FW007290%2F1) and [grant ID AH/S011064/1](https://gtr.ukri.org/projects?ref=AH%2FS011064%2F1)). Visit the [central webpage of the Enggano project](https://enggano.ling-phil.ox.ac.uk/)*.

R codes and dataset for processing the Contemporary Enggano FLEx database into the digital and printed Contemporary Enggano dictionary by Rajeg et al. is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International

[![DOI](https://img.shields.io/badge/DOI-10.25446/oxford.28229732-blue.svg?style=flat&labelColor=whitesmoke&logo=data%3Aimage%2Fpng%3Bbase64%2CiVBORw0KGgoAAAANSUhEUgAAAB8AAAAfCAYAAAAfrhY5AAAJsklEQVR42qWXd1DTaRrHf%2BiB2Hdt5zhrAUKz4IKEYu9IGiGFFJJQ0gkJCAKiWFDWBRdFhCQUF3UVdeVcRQEBxUI3yY9iEnQHb3bdW1fPubnyz%2F11M7lvEHfOQee2ZOYzPyDv%2B3yf9%2Fk95YX4fx%2BltfUt08GcFEuPR4U9hDDZ%2FVngIlhb%2FSiI6InkTgLzgDcgfvtnovhH4BzoVlrbwr55QnhCtBW4QHXnFrZbPBaQoBh4%2FSYH2EnpBEtqcDMVzB93wA%2F8AFwa23XFGcc8CkT3mxz%2BfXWtq9T9IQlLIXYEuHojudb%2BCM7Hgdq8ydi%2FAHiBXyY%2BLjwFlAEnS6Jnar%2FvnQVhvdzasad0eKvWZKe8hvDB2ofLZ%2FZEcWsh%2BhyIuyO5Bxs2iZIE4nRv7NWAb0EO8AC%2FWPxjYAWuOEX2MSXZVgPxzmRL3xKz3ScGpx6p6QnOx4mDIFqO0w6Q4fEhO5IzwxlSwyD2FYHzwAW%2BAZ4fEsf74gCumykwNHskLM7taQxLYjjIyy8MUtraGhTWdkfhkFJqtvuVl%2F9l2ZquDfEyrH8B0W06nnpH3JtIyRGpH1iJ6SfxDIHjRXHJmdQjLpfHeN54gnfFx4W9QRnovx%2FN20aXZeTD2J84hn3%2BqoF2Tqr14VqTPUCIcP%2B5%2Fly4qC%2BUL3sYxSvNj1NwsVYPsWdMUfomsdkYm3Tj0nbV0N1wRKwFe1MgKACDIBdMAhPE%2FwicwNWxll8Ag40w%2BFfhibJkGHmutjYeQ8gVlaN%2BjO51nDysa9TwNUFMqaGbKdRJZFfOJSp6mkRKsv0rRIpEVWjAvyFkxNOEpwvcAVPfEe%2Bl8ojeNTx3nXLBcWRrYGxSRjDEk0VlpxYrbe1ZmaQ5xuT0u3r%2B2qe5j0J5uytiZPGsRL2Jm32AldpxPUNJ3jmmsN4x62z1cXrbedXBQf2yvIFCeZrtyicZZG2U2nrrBJzYorI2EXLrvTfCSB43s41PKEvbZDEfQby6L4JTj%2FfIwam%2B4%2BwucBu%2BDgNK05Nle1rSt9HvR%2FKPC4U6LTfvUIaip1mjIa8fPzykii23h2eanT57zQ7fsyYH5QjywwlooAUcAdOh5QumgTHx6aAO7%2FL52eaQNEShrxfhL6albEDmfhGflrsT4tps8gTHNOJbeDeBlt0WJWDHSgxs6cW6lQqyg1FpD5ZVDfhn1HYFF1y4Eiaqa18pQf3zzYMBhcanlBjYfgWNayAf%2FASOgklu8bmgD7hADrk4cRlOL7NSOewEcbqSmaivT33QuFdHXj5sdvjlN5yMDrAECmdgDWG2L8P%2BAKLs9ZLZ7dJda%2BB4Xl84t7QvnKfvpXJv9obz2KgK8dXyqISyV0sXGZ0U47hOA%2FAiigbEMECJxC9aoKp86re5O5prxOlHkcksutSQJzxZRlPZmrOKhsQBF5zEZKybUC0vVjG8PqOnhOq46qyDTDnj5gZBriWCk4DvXrudQnXQmnXblebhAC2cCB6zIbM4PYgGl0elPSgIf3iFEA21aLdHYLHUQuVkpgi02SxFdrG862Y8ymYGMvXDzUmiX8DS5vKZyZlGmsSgQqfLub5RyLNS4zfDiZc9Edzh%2FtCE%2BX8j9k%2FqWB071rcZyMImne1SLkL4GRw4UPHMV3jjwEYpPG5uW5fAEot0aTSJnsGAwHJi2nvF1Y5OIqWziVCQd5NT7t6Q8guOSpgS%2Fa1dSRn8JGGaCD3BPXDyQRG4Bqhu8XrgAp0yy8DMSvvyVXDgJcJTcr1wQ2BvFKf65jqhvmxXUuDpGBlRvV36XvGjQzLi8KAKT2lYOnmxQPGorURSV0NhyTIuIyqOmKTMhQ%2BieEsgOgpc4KBbfDM4B3SIgFljvfHF6cef7qpyLBXAiQcXvg5l3Iunp%2FWv4dH6qFziO%2BL9PbrimQ9RY6MQphEfGUpOmma7KkGzuS8sPUFnCtIYcKCaI9EXo4HlQLgGrBjbiK5EqMj2AKWt9QWcIFMtnVvQVDQV9lXJJqdPVtUQpbh6gCI2Ov1nvZts7yYdsnvRgxiWFOtNJcOMVLn1vgptVi6qrNiFOfEjHCDB3J%2BHDLqUB77YgQGwX%2Fb1eYna3hGKdlqJKIyiE4nSbV8VFgxmxR4b5mVkkeUhMgs5YTi4ja2XZ009xJRHdkfwMi%2BfocaancuO7h%2FMlcLOa0V%2FSw6Dq47CumRQAKhgbOP8t%2BMTjuxjJGhXCY6XpmDDFqWlVYbQ1aDJ5Cptdw4oLbf3Ck%2BdWkVP0LpH7s9XLPXI%2FQX8ws%2Bj2In63IcRvOOo%2BTTjiN%2BlssfRsanW%2B3REVKoavBOAPTXABW4AL7e4NygHdpAKBscmlDh9Jysp4wxbnUNna3L3xBvyE1jyrGIkUHaqQMuxhHElV6oj1picvgL1QEuS5PyZTEaivqh5vUCKJqOuIgPFGESns8kyFk7%2FDxyima3cYxi%2FYOQCj%2F%2B9Ms2Ll%2Bhn4FmKnl7JkGXQGDKDAz9rUGL1TIlBpuJr9Be2JjK6qPzyDg495UxXYF7JY1qKimw9jWjF0iV6DRIqE%2B%2FeWG0J2ofmZTk0mLYVd4GLiFCOoKR0Cg727tWq981InYynvCuKW43aXgEjofVbxIqrm0VL76zlH3gQzWP3R3Bv9oXxclrlO7VVtgBRpSP4hMFWJ8BrUSBCJXC07l40X4jWuvtc42ofNCxtlX2JH6bdeojXgTh5TxOBKEyY5wvBE%2BACh8BtOPNPkApjoxi5h%2B%2FFMQQNpWvZaMH7MKFu5Ax8HoCQdmGkJrtnOiLHwD3uS5y8%2F2xTSDrE%2F4PT1yqtt6vGe8ldMBVMEPd6KwqiYECHDlfbvzphcWP%2BJiZuL5swoWQYlS%2Br7Yu5mNUiGD2retxBi9fl6RDGn4Ti9B1oyYy%2BMP5G87D%2FCpRlvdnuy0PY6RC8BzTA40NXqckQ9TaOUDywkYsudxJzPgyDoAWn%2BB6nEFbaVxxC6UXjJiuDkW9TWq7uRBOJocky9iMfUhGpv%2FdQuVVIuGjYqACbXf8aa%2BPeYNIHZsM7l4s5gAQuUAzRUoT51hnH3EWofXf2vkD5HJJ33vwE%2FaEWp36GHr6GpMaH4AAPuqM5eabH%2FhfG9zcCz4nN6cPinuAw6IHwtvyB%2FdO1toZciBaPh25U0ducR2PI3Zl7mokyLWKkSnEDOg1x5fCsJE9EKhH7HwFNhWMGMS7%2BqxyYsbHHRUDUH4I%2FAheQY7wujJNnFUH4KdCju83riuQeHU9WEqNzjsJFuF%2FdTDAZ%2FK7%2F1WaAU%2BAWymT59pVMT4g2AxcwNa0XEBDdBDpAPvgDIH73R25teeuAF5ime2Ul0OUIiG4GpSAEJeYW9wDTf43wfwHgHLKJoPznkwAAAABJRU5ErkJggg%3D%3D)](http://dx.doi.org/10.25446/oxford.28229732) [![DOI](https://zenodo.org/badge/681818609.svg)](https://doi.org/10.5281/zenodo.14656622) [![](https://img.shields.io/badge/DOI-10.17605/OSF.IO/4BHMU-lightblue.svg)](https://doi.org/10.17605/OSF.IO/4BHMU) ## Description How to cite this repository (in APA 7^th^): > Rajeg, G. P. W., Hemmings, C., Sangian, E. Z., Wijaya, D., Pramartha, C., & Arka, I. W. (2024). *R codes and dataset for processing the Contemporary Enggano FLEx database into the [digital](https://doi.org/10.25446/oxford.28188665) and [printed](https://doi.org/10.25446/oxford.28022312.v1) Contemporary Enggano dictionary* (Version 0.0.1) [Computer software]. University of Oxford. https://doi.org/10.25446/oxford.28229732 https://github.com/engganolang/eno-flex This repository records the R codes I use to process the LIFT output of FLEx Lexicon and integrate it with the .flextext output of the interlinear texts to build the SFM format to be imported back to FLEx. For my personal use, I put my notes [here](https://github.com/engganolang/eno-flex/blob/main/contemporary-enggano-interlinear-text/README.md). These processing are needed to automate the linking between the root and the complex forms (as sub-entries of the roots), including adding the example sentences and source references of these examples. Here is **how to cite** the open-access PDF of the [print dictionary](https://www.zaraabadipublisher.com/2024/12/kamus-bahasa-enggano_64.html) (in APA 7^th^) based on the R codes and dataset processed in this GitHub repository: > Rajeg, G. P. W., Hemmings, C., Sangian, E. Z., Wijaya, D., & Arka, I W. (*with* Milson Kaitora, Harun Kaharubi, M. Raflizen Kaitora (alm.), Aron Kaitora (alm.), Johansen Kaharubi, Ishar Timius Kaitora, Marlansius Kaharubi, Adam Kurniawan Kauno, & Resiawati Kaitora). (2025). *Kamus Bahasa Enggano* (1st ed.). Zara Abadi; University of Oxfords Sustainable Digital Scholarship. https://doi.org/10.25446/oxford.28022312 The digital, web-based and mobile-responsive Contemporary Enggano dictionary is available at https://enggano.cirhss.org/ and **below is how to cite** this digital Enggano dictionary (in APA 7^th^): > Rajeg, G. P. W., Hemmings, C., Pramartha, C. R. A., Sangian, E. Z., Wijaya, D., Ogilvie, S., Kraue, D., Arka, I. W., Dalrymple, M., Nothofer, B., Artanta Wibawa, P. W., Kusuma, P. A. D., Mahardika Adi Putra, I. P. G., & Gotra, A. A. N. M. A. (*with* Milson Kaitora, Harun Kaharubi, M. Raflizen Kaitora (alm.), Aron Kaitora (alm.), Johansen Kaharubi, Ishar Timius Kaitora, Marlansius Kaharubi, Adam Kurniawan Kauno, & Resiawati Kaitora). (2025). *Kamus Digital Bahasa Enggano*. University of Oxford & Centre for Interdisciplinary Research on the Humanities and Social Sciences (CIRHSS), Udayana University; University of Oxfords Sustainable Digital Scholarship. https://doi.org/10.25446/oxford.28188665 https://enggano.cirhss.org/ *NOTE*: The original, contemporary Enggano FLEx database needs to be cited independently from this repository: > Hemmings, Charlotte, Engga Zakaria Sangian, Erik Zobel, Gede Primahadi Wijaya Rajeg. 2024. Contemporary Enggano FLEx database. Unpublished corpus. https://enggano.ling-phil.ox.ac.uk/

Owner

  • Login: engganolang
  • Kind: user

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: >-
  R codes and dataset for processing the Contemporary
  Enggano FLEx database into the digital and printed
  Contemporary Enggano dictionary
message: >-
  If you use and adapt any materials from this repository,
  please cite this repository using the metadata from this
  .cff file as well as the Contemporary Enggano FLEx
  database
type: software
authors:
  - given-names: Gede Primahadi Wijaya
    family-names: Rajeg
    affiliation: University of Oxford; Udayana University
    orcid: 'https://orcid.org/0000-0002-2047-8621'
    email: primahadi_wijaya@unud.ac.id
  - given-names: Charlotte
    family-names: Hemmings
    affiliation: University of Oxford
    orcid: 'https://orcid.org/0000-0002-3076-5544'
    email: charlotte.hemmings@ling-phil.ox.ac.uk
  - given-names: Engga Zakaria
    family-names: Sangian
    affiliation: Universitas Dehasen Bengkulu
    orcid: 'https://orcid.org/0009-0000-8802-6819'
    email: ezs21072@gmail.com
  - given-names: Dendi
    family-names: Wijaya
    affiliation: National Research and Innovation Agency
    orcid: 'https://orcid.org/0000-0002-8767-9364'
    email: dendi0587@gmail.com
  - given-names: Cokorda
    family-names: Pramartha
    affiliation: Udayana University
    email: cokorda@unud.ac.id
    orcid: 'https://orcid.org/0000-0002-2835-3989'
  - given-names: I Wayan
    family-names: Arka
    affiliation: Australian National University
    orcid: 'https://orcid.org/0000-0002-2819-6186'
    email: wayan.arka@anu.edu.au
identifiers:
  - type: doi
    value: 10.17605/OSF.IO/4BHMU
    description: Open Science Framework (OSF)
repository-code: 'https://github.com/engganolang/eno-flex'
url: 'https://enggano.ling-phil.ox.ac.uk/'
repository: 'https://osf.io/4bhmu/'
abstract: >-
  This repository records the R codes that the author uses
  to process the LIFT output of FLEx Lexicon and integrate
  it with the .flextext output of the interlinear texts to
  build the SFM format to be imported back into FLEx.


  These processes are needed to automate the linking between
  the root and the complex forms (as subentries of the
  roots), including adding the example sentences (for both
  the root and subentries) and source references of these
  examples.
keywords:
  - Enggano
  - R programming
  - Fieldwork Language Explorer
  - FLEx
  - endangered language
  - Indonesian language
  - data science
  - tidyverse
  - xml
  - lexical database
  - lexicography
  - computational lexicography
  - dictionary
  - Enggano dictionary
license: CC-BY-NC-SA-4.0
date-released: '2024-12-17'

GitHub Events

Total
  • Release event: 1
  • Issues event: 1
  • Issue comment event: 1
  • Push event: 38
Last Year
  • Release event: 1
  • Issues event: 1
  • Issue comment event: 1
  • Push event: 38