Convert CLDF Wordlist with cognates to old LingPy's QLC format #5

xrotwang · 2018-05-16T10:32:10Z

The LingPy tutorial uses LingPy's old QLC format (see polynesian.tsv). We should have a recipe to convert a CLDF Wordlist into this format. Should be a csvkit one-liner.

LinguList · 2018-05-16T10:34:10Z

From lingpy, it is:

>>> from lingpy.convert.cldf import from_cldf
>>> from_cldf('path').output('tsv', filename='filename', prettify=False)

xrotwang · 2018-05-16T10:41:11Z

Yes, this would just be a "proof-of-concept" recipe, or for providing backward compatibility with earlier LingPy versions.

LinguList · 2018-05-17T08:00:34Z

BTW: it's also what @thiagochacon wanted, namely that we help convert data to "edictor" format.

Anaphory · 2018-11-15T14:20:57Z

If you want support for non-standard CLDF column headers, it is

>>> from lingpy import Wordlist
>>> Wordlist.from_cldf('path').output('tsv', filename='filename', prettify=False)

although that keeps the non-standard column headers and does not yet change them into the standard DOCULECT CONCEPT IPA headers that Edictor expects.

LinguList · 2018-11-15T15:38:15Z

you can easily find a workaround:

wl = wordlist.from_cldf('path.json')
wl.add_entries('doculect', 'language_name', lambda x: x)
wl.add_entries('concept', 'concept_name', lambda x: x)
wl.add_entries('tokens', 'segments', lambda x: x)
wl.output('tsv', filename='bla', prettify=False, subset=True, cols=['doculect', 'concept', 'tokens'])

This is okay enough for the time being, I'd say.

xrotwang added the recipe label May 16, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert CLDF Wordlist with cognates to old LingPy's QLC format #5

Convert CLDF Wordlist with cognates to old LingPy's QLC format #5

xrotwang commented May 16, 2018

LinguList commented May 16, 2018

xrotwang commented May 16, 2018

LinguList commented May 17, 2018

Anaphory commented Nov 15, 2018 •

edited

Loading

LinguList commented Nov 15, 2018

Convert CLDF Wordlist with cognates to old LingPy's QLC format #5

Convert CLDF Wordlist with cognates to old LingPy's QLC format #5

Comments

xrotwang commented May 16, 2018

LinguList commented May 16, 2018

xrotwang commented May 16, 2018

LinguList commented May 17, 2018

Anaphory commented Nov 15, 2018 • edited Loading

LinguList commented Nov 15, 2018

Anaphory commented Nov 15, 2018 •

edited

Loading