Boyer-Moore-Horspool algorithm for all matches (find byte array inside Byte array)

Here is my implementation of the BMH algorithm (it works like a charm):

public static Int64 IndexOf(this Byte[] value, Byte[] pattern) { if (value == null) throw new ArgumentNullException("value"); if (pattern == null) throw new ArgumentNullException("pattern"); Int64 valueLength = value.LongLength; Int64 patternLength = pattern.LongLength; if ((valueLength == 0) || (patternLength == 0) || (patternLength > valueLength)) return -1; Int64[] badCharacters = new Int64[256]; for (Int64 i = 0; i < 256; ++i) badCharacters[i] = patternLength; Int64 lastPatternByte = patternLength - 1; for (Int64 i = 0; i < lastPatternByte; ++i) badCharacters[pattern[i]] = lastPatternByte - i; // Beginning Int64 index = 0; while (index <= (valueLength - patternLength)) { for (Int64 i = lastPatternByte; value[(index + i)] == pattern[i]; --i) { if (i == 0) return index; } index += badCharacters[value[(index + lastPatternByte)]]; } return -1; } 

I tried changing it to return all matches, not just the first index, but I get an IndexOutOfRangeException everywhere D:

Obviously, I am missing something important or I did not understand how it works. What am I doing wrong?

 public static List<Int64> IndexesOf(this Byte[] value, Byte[] pattern) { if (value == null) throw new ArgumentNullException("value"); if (pattern == null) throw new ArgumentNullException("pattern"); Int64 valueLength = value.LongLength; Int64 patternLength = pattern.LongLength; if ((valueLength == 0) || (patternLength == 0) || (patternLength > valueLength)) return (new List<Int64>()); Int64[] badCharacters = new Int64[256]; for (Int64 i = 0; i < 256; ++i) badCharacters[i] = patternLength; Int64 lastPatternByte = patternLength - 1; for (Int64 i = 0; i < lastPatternByte; ++i) badCharacters[pattern[i]] = lastPatternByte - i; // Beginning Int64 index = 0; List<Int64> indexes = new List<Int64>(); while (index <= (valueLength - patternLength)) { for (Int64 i = lastPatternByte; value[(index + i)] == pattern[i]; --i) { if (i == 0) indexes.Add(index); } index += badCharacters[value[(index + lastPatternByte)]]; } return indexes; } 
+6
source share
1 answer

Edit

 if (i == 0) indexes.Add(index); 

to

 if (i == 0) { indexes.Add(index); break; } 
+8
source

Source: https://habr.com/ru/post/943776/


All Articles