I'm just curious. I am new here, so please consider my question with a few nobles.
Let's say I'm making an Android application with pattern recognition, where all processes, even computationally intensive, must happen on the processor of a mobile device.
I am at the stage where I already processed the images, some functions were extracted from the image. A set of images comes from only one building, where it must recognize specific objects of interest (different windows, images, artifacts, outside the building). Thus, this is a closed domain, and I can provide enough images of objects from different angles. I plan to train the neural network and provide it to the application instead of the image matching algorithm.
My idea is to extract the key points, compute the descriptors (using FREAK for the ORB key points for the descriptors), and from these descriptors I would like to get a single file or array that will end up with something like this
Desc1 Desc2 Desc3 Desc4 DescN......... Class _________________________________________________________________________________ Picture 1 0.121 0.923 0.553 0.22 0.28 "object1" Picture 2 0.22 0.53 0.54 0.55 0.32 .........."object1" (different scale, angle) Picture 3 .... ... ... ... .. .........."object2" Picture N Picture N+1
therefore, I can attribute it to a neural network for training, however I am stuck as I have no idea how the binary function / descriptor is presented in Matrice (Class Mat - openCV). and how would I continue to normalize these binary descriptors, so I can feed it in Neural Net (Multi-Layer Perceptron) for training. (even pseudo code helped a lot)
source share