Skip to content

CLDF dataset derived from Chacon et al.'s "Diversity of Arawakan Languages" from 2019

License

Notifications You must be signed in to change notification settings

lexibank/chaconbaniwa

Repository files navigation

CLDF dataset derived from Chacon et al.'s "Diversity of Arawakan Languages" from 2019

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Chacon, T. C.; Gonçalves, A. G.; and da Silva, L. F (2019): A diversidade linguística Aruák no Alto Rio Negro em gravações da década de 1950 [The diversity of Arawakan languages from the upper Rio Negro in recordings from the 1950s]. Forma y Función, 32.2, 41-67. DOI: 10.15446/fyf.v32n2.80814.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at https://github.com/lexibank/chaconbaniwa/

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 98% Source: 97% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 14 (linked to 4 different Glottocodes)
  • Concepts: 243 (linked to 233 different Concepticon concept sets)
  • Lexemes: 2,354
  • Sources: 2
  • Synonymy: 1.00
  • Cognacy: 2,354 cognates in 610 cognate sets (305 singletons)
  • Cognate Diversity: 0.17
  • Invalid lexemes: 0
  • Tokens: 15,986
  • Segments: 91 (0 BIPA errors, 0 CLTS sound class errors, 91 CLTS modified)
  • Inventory size (avg): 44.36

Possible Improvements:

  • Entries missing sources: 71/2354 (3.02%)

Contributors

Name GitHub user Description Role
Johann-Mattis List @LinguList maintainer Editor
Tiago Tresoldi @Tresoldi orthography Other
Christoph Rzymski @chrzyki maintainer Editor
Frederic Blum @FredericBlum maintainer Editor
Thiago Costa Chacon @thiagochacon maintainer Author, DataCollector

CLDF Datasets

The following CLDF datasets are available in cldf: