You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am totally new to machine learning. I am trying to figure out where to dive in.
My job is to be able to categorize images. Specifically patent labels. I will need to categorize common indicators on the label. (Though not my scenario, a decent example may be patent race: African-American, Caucasian, etc.)
But the image will also have barcodes and other numbers on them that are not the same from image to image (and should be ignored by the system).
To add one more level of complexity, there are many different kinds of patient labels. All of them will have the "race" info on them, but in different fonts and in different places. (And maybe even abbreviated differently.)
Is NuML able to do this kind of thing? If so I will dig in and learn it.
The text was updated successfully, but these errors were encountered:
Hey, at the moment numl isn't really suited for image processing. You would ideally use a convolutional neural network for recognising specific parts in patient label images. However, if you were to convert the image to text first using OCR (check out the Microsoft portable OCR library) and then apply a Naïve Bayes or Logistic Regression for recognising race, etc., then numl would work.
I would love to add an Image feature (even if just to pass off the numbers to a better algorithm). My guess is we would resize all the images on the first pass then use rgb number as the actual value (then normalize). It would be fun to implement... right?? 😄
We need to revisit the Matrix library to include depth for each of the colour channels. We then need to create sampling algorithms i.e. Convolutional and Pooling nodes / layers in the Neural Network objects.
I am totally new to machine learning. I am trying to figure out where to dive in.
My job is to be able to categorize images. Specifically patent labels. I will need to categorize common indicators on the label. (Though not my scenario, a decent example may be patent race: African-American, Caucasian, etc.)
But the image will also have barcodes and other numbers on them that are not the same from image to image (and should be ignored by the system).
To add one more level of complexity, there are many different kinds of patient labels. All of them will have the "race" info on them, but in different fonts and in different places. (And maybe even abbreviated differently.)
Is NuML able to do this kind of thing? If so I will dig in and learn it.
The text was updated successfully, but these errors were encountered: