Implementing Lucene without an analyzer for the content language used?

Does it make sense?

It is too expensive for my client to develop an analyzer for the Croatian language, I did not find any existing ones ... so my question is ... I tell them to abandon Lucene's idea for Croatian content?

Thank!

+3
source share
2 answers

Robert Muir, Chris Male and others built the Lunzen Morphological Analyzer based on Hunspell . The code is here . Croatian is one of the supported languages ​​on the list. There may be licensing issues, since hunspell is the GPL, I think, but it's worth checking out.

+1
source
+2

Source: https://habr.com/ru/post/1783758/


All Articles