bororo-corpus
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 4 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.3%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: LanguageStructure
- Default Branch: main
- Size: 21.8 MB
Statistics
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Corpus Bororo CorBo
Welcome to Corpus Bororo (CorBo) — a curated and linguistically annotated collection designed to support research on the Bororo language and culture. This corpus includes a wide range of text genres, such as traditional stories, myths, fieldwork materials, biblical translations, and ceremonial songs.
Whether you are a linguist, anthropologist, or simply passionate about Indigenous languages, CorBo offers valuable insights into the grammar, lexicon, and sociocultural context of the Bororo people. The corpus is being annotated according to the Universal Dependencies framework and is part of the ongoing development of the Bororo UD-Treebank.
🔗 Access the preliminary version of the corpus here
About the Language
Language | ISO Code | Glottocode ---------|----------|------------ Bororo | bor | boro1282
Bororo, also known as Boe or Bororo Proper, is an Indigenous language spoken in central Brazil, primarily in the state of Mato Grosso. Although it is currently classified as a language isolate, linguistic evidence suggests that it once formed part of a small language family with now-extinct relatives.
Despite a reduced number of active speakers, Bororo remains a vital component of cultural identity and ceremonial life among the Bororo people. Current revitalization efforts include school-based teaching programs, digital tools, and comprehensive documentation projects to support its intergenerational transmission.
Como citar
Ferraz Gerardi, F., Toribio Serrano, L., & Sollberger, D. (2025). Corpus Bororo (CorBo) (v0.3) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15567900
Em Português
Corpus Bororo CorBo
Bem-vindo ao Corpus Bororo (CorBo) — uma coleção curada e anotada linguisticamente, criada para apoiar a pesquisa sobre a língua e a cultura Bororo. O corpus abrange uma variedade de gêneros textuais, incluindo histórias tradicionais, mitos, material de campo, traduções bíblicas e cantos cerimoniais.
Seja você linguista, antropólogo ou entusiasta de línguas indígenas, o CorBo oferece perspectivas valiosas sobre a gramática, o léxico e o contexto sociocultural do povo Bororo. O corpus está sendo anotado conforme os princípios do Universal Dependencies e integra o desenvolvimento contínuo do Bororo UD-Treebank.
🔗 Acesse a versão preliminar do corpus aqui
Sobre a Língua
Língua | ISO | Glottocode -------|-----|------------ Bororo | bor | boro1282
O Bororo, também conhecido como Boe ou Bororo propriamente dito, é uma língua indígena falada no centro do Brasil, sobretudo no estado de Mato Grosso. Embora atualmente classificada como uma língua isolada, há evidências de que fazia parte de uma pequena família linguística hoje extinta.
Apesar do número reduzido de falantes, o Bororo continua sendo um componente central da identidade cultural e da vida cerimonial do povo Bororo. Iniciativas de revitalização incluem programas de ensino escolar, ferramentas digitais e projetos de documentação com o objetivo de garantir sua transmissão para as futuras gerações.
Owner
- Name: FFGerardi
- Login: LanguageStructure
- Kind: user
- Location: Germany
- Company: University
- Website: https://languagestructure.github.io
- Twitter: fabricioicirbaf
- Repositories: 10
- Profile: https://github.com/LanguageStructure
(Computational) Linguist, language structure, syntax, coffee, Amazon (the forest)
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Ferraz Gerardi" given-names: "Fabrício" orcid: "https://orcid.org/0000-0002-1438-7336" - family-names: "Sollberger" given-names: "Dolores" orcid: "https://orcid.org/0009-0000-9584-5767" - family-names: "Toribio Serrano" given-names: "Lucas" orcid: "https://orcid.org/0009-0002-2605-4384" title: "Corpus Bororo (CorBo)" version: 0.1 doi: date-released: 2024-06-18 url: "https://github.com/LanguageStructure/Bororo-Corpus"
GitHub Events
Total
- Release event: 1
- Push event: 22
- Create event: 1
Last Year
- Release event: 1
- Push event: 22
- Create event: 1