Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.3%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: LanguageStructure
  • Default Branch: main
  • Size: 21.8 MB
Statistics
  • Stars: 1
  • Watchers: 3
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed about 1 year ago
Metadata Files
Readme Citation

README.md

Corpus Bororo CorBo

Welcome to Corpus Bororo (CorBo) — a curated and linguistically annotated collection designed to support research on the Bororo language and culture. This corpus includes a wide range of text genres, such as traditional stories, myths, fieldwork materials, biblical translations, and ceremonial songs.

Whether you are a linguist, anthropologist, or simply passionate about Indigenous languages, CorBo offers valuable insights into the grammar, lexicon, and sociocultural context of the Bororo people. The corpus is being annotated according to the Universal Dependencies framework and is part of the ongoing development of the Bororo UD-Treebank.

🔗 Access the preliminary version of the corpus here


About the Language

Language | ISO Code | Glottocode ---------|----------|------------ Bororo | bor | boro1282

Bororo, also known as Boe or Bororo Proper, is an Indigenous language spoken in central Brazil, primarily in the state of Mato Grosso. Although it is currently classified as a language isolate, linguistic evidence suggests that it once formed part of a small language family with now-extinct relatives.

Despite a reduced number of active speakers, Bororo remains a vital component of cultural identity and ceremonial life among the Bororo people. Current revitalization efforts include school-based teaching programs, digital tools, and comprehensive documentation projects to support its intergenerational transmission.


Como citar

Ferraz Gerardi, F., Toribio Serrano, L., & Sollberger, D. (2025). Corpus Bororo (CorBo) (v0.3) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15567900

DOI


Em Português

Corpus Bororo CorBo

Bem-vindo ao Corpus Bororo (CorBo) — uma coleção curada e anotada linguisticamente, criada para apoiar a pesquisa sobre a língua e a cultura Bororo. O corpus abrange uma variedade de gêneros textuais, incluindo histórias tradicionais, mitos, material de campo, traduções bíblicas e cantos cerimoniais.

Seja você linguista, antropólogo ou entusiasta de línguas indígenas, o CorBo oferece perspectivas valiosas sobre a gramática, o léxico e o contexto sociocultural do povo Bororo. O corpus está sendo anotado conforme os princípios do Universal Dependencies e integra o desenvolvimento contínuo do Bororo UD-Treebank.

🔗 Acesse a versão preliminar do corpus aqui


Sobre a Língua

Língua | ISO | Glottocode -------|-----|------------ Bororo | bor | boro1282

O Bororo, também conhecido como Boe ou Bororo propriamente dito, é uma língua indígena falada no centro do Brasil, sobretudo no estado de Mato Grosso. Embora atualmente classificada como uma língua isolada, há evidências de que fazia parte de uma pequena família linguística hoje extinta.

Apesar do número reduzido de falantes, o Bororo continua sendo um componente central da identidade cultural e da vida cerimonial do povo Bororo. Iniciativas de revitalização incluem programas de ensino escolar, ferramentas digitais e projetos de documentação com o objetivo de garantir sua transmissão para as futuras gerações.

Owner

  • Name: FFGerardi
  • Login: LanguageStructure
  • Kind: user
  • Location: Germany
  • Company: University

(Computational) Linguist, language structure, syntax, coffee, Amazon (the forest)

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Ferraz Gerardi"
  given-names: "Fabrício"
  orcid: "https://orcid.org/0000-0002-1438-7336"
- family-names: "Sollberger"
  given-names: "Dolores"
  orcid: "https://orcid.org/0009-0000-9584-5767"
- family-names: "Toribio Serrano"
  given-names: "Lucas"
  orcid: "https://orcid.org/0009-0002-2605-4384"
title: "Corpus Bororo (CorBo)"
version: 0.1
doi: 
date-released: 2024-06-18
url: "https://github.com/LanguageStructure/Bororo-Corpus"

GitHub Events

Total
  • Release event: 1
  • Push event: 22
  • Create event: 1
Last Year
  • Release event: 1
  • Push event: 22
  • Create event: 1