I am new to approximate string matching.
I am learning to use the Bitap algorithm , but so far its limited template length has bothered me. I work with Flash and I have 32-bit unsigned integers and a double-precision floating-point IEEE-754 type that can allocate up to 53 bits for integers. However, I would prefer to have a fuzzy matching algorithm that can handle longer patterns than 50 characters.
The Wikipedia page in the Bitap algorithm mentions libbitap, which supposedly demonstrates an unlimited implementation of the length of the algorithm, but it's hard for me to get the idea from its sources.
Do you have any suggestions on how to generalize Bitap to patterns of unlimited length or on another algorithm that can fuzzy match needle strings near a suggested location in a haystack?
source
share