Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation is unclear on uploading a document #116

Closed
borissmidt opened this issue Jun 15, 2017 · 3 comments
Closed

Documentation is unclear on uploading a document #116

borissmidt opened this issue Jun 15, 2017 · 3 comments

Comments

@borissmidt
Copy link

Dear,

I cannot seem to find any documentation on how to create a folia formatted document from a simple text file. Thus i cannot upload a starting file to use this tool.

Kind regards, Boris Smidt

@proycon
Copy link
Owner

proycon commented Jun 15, 2017

Good point, I should provide more pointers in the FLAT documentation for people who are yet unfamiliar with FoLiA. Two tools can be used to get a preliminary FoLiA document from a text:

This relates to issue #74 also, which proposes to integrate these tools as converter options in FLAT so you don't have to run it manually, but that has not been a priority yet thus-far so isn't implemented yet.

If you deal with a custom format, you can always write your own conversion script in Python using the FoLiA API (part of https://github.com/proycon/pynlpl , documentation: http://pynlpl.readthedocs.io/en/latest/folia.html)

@proycon
Copy link
Owner

proycon commented Jun 15, 2017

I should also add, if you happen to work on Dutch data, you might want to consider Frog ()https://languagemachines.github.io/frog), it does all kinds of automated linguistic annotation (PoS, lemma, NER, etc) and can produce output in FoLiA XML (which you can load into FLAT again).

@borissmidt
Copy link
Author

Now it is clear.

Thanks for mentioning Frogger, But i need a different kind of NER annotation. I might be able to use it once i have made my own tagged data. Its a chickens and eggs problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants