This question has been asked in different ways, but I wonder if people who have experience with the suggestion of automatic search can offer advice on the most useful and effective approaches. Here's the script:
I am just starting with a website for a book that is a glossary of terms (approximately 1000 entries, an average of 300 words each), many of which are rather obscure, and it is likely that many website visitors would not know how to pronounce the words. The publisher wants to make full-text search available for each entry. So, I hope to implement a search engine with spelling correction. The main site is likely to be run in the PHP framework (or perhaps Django) with a MySQL database.
Can anyone with experience in this field give advice on the following:
- With this type of chassis installed, should I use something like Lucene or Sphinx for the search engine?
- As far as I can tell, none of them have a built-in sentence function. Therefore, it seems to me that I will need to integrate one or more of the following. What are the advantages / disadvantages:
- Request for Quotations through the Google Search API
- Phonetic comparison algorithm, e.g. metaphone () in PHP
- Spell checker such as Aspell
- Easier spelling of a script like Peter Norvig
- Levenshtein function
, Google , . , , - .