Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Image Processing? #47

Open
Vaccano opened this issue Sep 23, 2016 · 3 comments
Open

Image Processing? #47

Vaccano opened this issue Sep 23, 2016 · 3 comments
Labels

Comments

@Vaccano
Copy link

Vaccano commented Sep 23, 2016

I am totally new to machine learning. I am trying to figure out where to dive in.

My job is to be able to categorize images. Specifically patent labels. I will need to categorize common indicators on the label. (Though not my scenario, a decent example may be patent race: African-American, Caucasian, etc.)

But the image will also have barcodes and other numbers on them that are not the same from image to image (and should be ignored by the system).

To add one more level of complexity, there are many different kinds of patient labels. All of them will have the "race" info on them, but in different fonts and in different places. (And maybe even abbreviated differently.)

Is NuML able to do this kind of thing? If so I will dig in and learn it.

@bdschrisk
Copy link
Collaborator

Hey, at the moment numl isn't really suited for image processing. You would ideally use a convolutional neural network for recognising specific parts in patient label images. However, if you were to convert the image to text first using OCR (check out the Microsoft portable OCR library) and then apply a Naïve Bayes or Logistic Regression for recognising race, etc., then numl would work.

@sethjuarez
Copy link
Owner

I would love to add an Image feature (even if just to pass off the numbers to a better algorithm). My guess is we would resize all the images on the first pass then use rgb number as the actual value (then normalize). It would be fun to implement... right?? 😄

@bdschrisk
Copy link
Collaborator

Seth, I think we should look into this soon.

We need to revisit the Matrix library to include depth for each of the colour channels. We then need to create sampling algorithms i.e. Convolutional and Pooling nodes / layers in the Neural Network objects.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants