Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to changes the images data into xml files #10

Open
hnn123 opened this issue Aug 20, 2018 · 3 comments
Open

How to changes the images data into xml files #10

hnn123 opened this issue Aug 20, 2018 · 3 comments

Comments

@hnn123
Copy link

hnn123 commented Aug 20, 2018

Do you know how to changes the images data into xml files. I have the handwritting images data and labels, but I don't know how to change them into xml files.

@Grzego
Copy link
Owner

Grzego commented Aug 21, 2018

So you have images of handwriting and labels with text? I assume you want to generate those files to train a model that can then be used to generate handwriting. If that's the case, then converting images to handwriting data would already required a model able to generate handwriting (so this is almost impossible to do automatically, unless your images are similar enough to the those in IAM dataset).

Is your dataset of images and text labels available somewhere? I could look it up and say something more about this problem.

@hnn123
Copy link
Author

hnn123 commented Aug 22, 2018

@Grzego I have images of handwriting and labels with text. Like the following picture
image

image

These data are collected offline and almost not similar to the those in IAM dataset. After I look the detail of the data format of IAM dataset, I realized that it is almost impossible to convert my data into xml files. What I want to do is training a model that can be used to generate handwriting using my own data. Do you know other method which can do that .

@Grzego
Copy link
Owner

Grzego commented Aug 23, 2018

@hnn123 IIRC the IAM dataset has images, handwriting and text. So it could be possible to train model that uses image and text to predict handwriting. If this model trained well you could then transcribe your images. And if this transcribed data is good enough you could train final handwriting model only on those data.

Those are quite big ifs, so it probably will be hard to achieve.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants