-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hipi on Spark #31
Comments
@sdikby have you tried hipi hibImport.sh with millions of images successfully? |
@yangboz sorry for the delay. |
@yangboz i know some 2 other tools for image processing, but i didn't try them yet (i just began my master thesis :) ) |
@sdikby thanks for your ideas suggestion, I will try them, and my ideas comes from : |
@yangboz it would be also great to know how the 3 tools/frameworks store images on hdfs (to deal with the blocksize problem for example) and the big differences between them(read/write performance from/into hdfs). |
@sdikby before those 3 tools/framework, existed solutions that I have studied on Ceph and even Cassandra image blob storage. Conclusion will coming soon. |
@sdikby compare Mipr: https://github.com/sozykin/mipr (full documentation an code example passed) |
@yangboz oh good job ! and what's about performance? did you compare the both in terms of # image write/read per second? |
@sdikby there is a paper(please drop a letter to me if you need it.) on hadoop/spark performance compare includes indexing and retrieval |
@yangboz could you please provide me this paper. |
Dear HIPI developers,
do you plan on integrating apache spark instead of the old mapreduce?? if so when?
Otherwise could you give me some hints on how to do it?
My use case is that i need to classify millions of images and with mapreduce it will not be efficient as i need it to be.
@sweeneychris @liuliu @voigtlandier @zverham @hafnium
The text was updated successfully, but these errors were encountered: