Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which dictionary files are you using? Are they the most up-to-date? #1

Open
trosel opened this issue Mar 4, 2017 · 10 comments
Open

Comments

@trosel
Copy link

trosel commented Mar 4, 2017

No description provided.

@kyegupov
Copy link
Owner

There's a link in the code. It's Dyer's dictionary from 1924 which has been updated by Brian Drake: http://www.ido.li/dicionarii/IdoAngladicionarii/

I actually am not sure where to find most "up to date" official dictionaries, I will ask around. Wiktionary is pretty comprehensive, but I'm not sure how reliable it is.

@trosel
Copy link
Author

trosel commented Mar 16, 2017

@kyegupov I asked around about this, but it may be worthwhile to host a dictionary file on github in its own repo. For posterity's sake, but also in case people want to work on updating it together.

@kyegupov
Copy link
Owner

kyegupov commented Jul 29, 2017 via email

@trosel
Copy link
Author

trosel commented Jul 30, 2017

I wouldn't say that it is "human readable". It seems like a mix of YAML and XML here https://github.com/kyegupov/ido_web_dictionary/blob/master/backend/src/main/resources/dyer_by_letter/i/ai.yaml

@kyegupov
Copy link
Owner

kyegupov commented Jul 30, 2017 via email

@trosel
Copy link
Author

trosel commented Jul 31, 2017

If you choose one or the other, you can run it through pandoc to convert it to virtually anything.

Thinking forward towards the future, perhaps JSON would be easiest to work with in apps and to search and display.

Thoughts?

@kyegupov
Copy link
Owner

kyegupov commented Jul 31, 2017 via email

@trosel
Copy link
Author

trosel commented Jul 31, 2017

Do you have a plan for what you want to use this for in the future (other than your current website?)

@kyegupov
Copy link
Owner

kyegupov commented Aug 8, 2017 via email

@hall
Copy link

hall commented Jan 28, 2018

I've imported a recent data dump from the Ido Wiktionary into an SQLite database; however, Wikimedia's formatting makes it nearly impossible to create an adequate parser so much of the data has been dropped or corrupted. I'm writing an API here: https://github.com/linguo-io/api and a basic front-end here: https://github.com/linguo-io/vortaro.

They are both very much unstable but I think this is a good path to go down as far as to what format we should store the dictionary files in so they remain maintainable and extensible without too much duplicated effort. Let me know what you think.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants