There is one confusion in your question: the “fingerprint” associated with you is clearly not intended to search for similar images (quote):
TinEye usually does not find similar images (that is, another image with the same subject); it finds exact matches, including those that have been cropped, edited, or modified.
Now, I said, I’m just assuming that you know what you are asking for and that you really want to find all similar images, not just edited replicas.
If you want to try to understand this in detail, I would suggest looking for documents from Sivic, Zisserman and Nister, Stewenius . The idea that these two articles (as well as quite a few others have recently) used is to try to apply text search methods to image databases and search the image database in the same way that Google will search for a document (web -page).
The first document I am involved with is a good starting point for this approach, since it mainly addresses the big question: what are the “words” in the images ?. Text search methods focus on words and base their similarity methods on computations, including word count. Thus, the successful presentation of images as collections of visual words is the first step to applying text search methods to image databases.
the second document then broadens the idea of using text technology, introducing a more appropriate search structure. Thanks to this, they provide faster image processing and large image databases. They also offer how to create an image descriptor based on a basic search structure.
The functions used as visual words in both documents should satisfy your invariance restrictions, and the second should definitely be able to work with the required database size (maybe even working with the 1st job will work).
Finally, I recommend looking for new articles from the same authors (I'm sure Nister did something new, just so that the tied paper approach has been enough for me so far), looking at some of my links and generally looking for content-based content indexing and search (CBIR) documents are a very popular subject right now, so there should be a lot.
penelope May 7 '12 at 12:40 2012-05-07 12:40
source share