Skip to content

keeleleek/dfdl-pextract-schema

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DFDL parser/unparser for pextract and XML

This is a first try on a Data Format Description Language (DFDL) parser and unparser for pextract paradigm files. The DFDL schema enables to read and modify extracted paradigm files using XML technology. This makes possible lossless archiving of original pextract files and simpler transformation to the Lexical Markup Framework.

What is more important - this makes possible automatic generation of a morphology module in the Grammatical Framework using the pextract2gf tool.

The XML schema is not definite yet. Any comments welcome.

Creating XML from pextract files

The DFDL schema has been developed and tested using the open source tool Daffodil.

Parsing the example pextract file.

$ ../bin/daffodil parse --schema ./pextract.dfdl.xsd ./examples/vot_noun.p

Creating pextract file from XML

Un-parsing (that is serializing) the parsed example XML infoset back to a textual pextract file.

$ ../bin/daffodil unparse --schema ./pextract.dfdl.xsd ./examples/vot_noun.tdml

About

DFDL schema for paradigm extraction with pextract

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages