I have a collection of media files, mostly music, most of which were imported from CD many years ago. This collection has been transferred between different media players, different file systems, different computers, etc., many times. In this process, some tracks were accidentally duplicated. I also constantly try to check this metadata and get everything correctly tagged, because when most of them were originally imported, I didn’t have fantastic multimedia playback software and didn’t even realize that ID3 tags indicate that everything was just "Track % d "on the classic album" Album ".
This creates a situation where I have some files with modern metadata, but “duplicate” the same media file that I would like to delete, whose metadata has not been properly updated. Since metadata is present in the file, the contents of these files are now different, and tools like liten2 do not work.
My question is: is there a library that I can use that will conveniently extract a uniquely identifying fingerprint (possibly a SHA-1 hash, but this is not a strict requirement) of the media content of the file only, ignoring the metadata? If so, how to use it?
source
share