Elasticsearch and Rails: using ngram to find part of a word

Question

Elasticsearch and Rails: using ngram to find part of a word

I am trying to use Elasticsearch-Gem in my project. As far as I understand: by now there is no longer need for Tire-Gem, or am I mistaken?

In my project, I have a search (obivously) that currently applies to a single model. Now I try to avoid wildcards, as they do not scale well, but I cannot get ngram parsers to work correctly. If I search for whole words, the search still works, but not for parts of it.

class Pictures < ActiveRecord::Base

  include Elasticsearch::Model
  include Elasticsearch::Model::Callbacks

  settings  :analysis => {
          :analyzer => {
            :my_index_analyzer => {
                :tokenizer => "keyword",
                :filter => ["lowercase", "substring"]
            },
            :my_search_analyzer => {
              :tokenizer => "keyword",
              :filter => ["lowercase", "substring"]
            }
          },
          :filter => {
            :substring => {
              :type => "nGram",
              :min_gram => 2,
              :max_gram => 50
            }
          }
    } do  
mapping do
  indexes :title, 
  :properties => {
    :type => "string",
    :index_analyzer => 'my_index_analyzer',
    :search_analyzer => "my_search_analyzer"
  }

Maybe someone can give me a hint in the right direction.

+4

ruby ruby-on-rails search rails-activerecord elasticsearch

stiller_leser Jun 23 '14 at 20:04

source share

2

Geordee Naliyath · Answer 1 · 2014-07-08T05:07:34+0000

. , .

, . / db/folder rake .

https://gist.github.com/geordee/9313f4867d61ce340a08

def as_indexed_json(options={})
  self.as_json(only: [:id, :name, :description, :price])
end

Martin Pape · Answer 2 · 2014-06-24T06:45:16+0000

, edgeNGram (, nGram, ) :

{
   "en_suggestions": {
      "settings": {
         "index": {
            "analysis": {
               "filter": {
                  "tpNGramFilter": {
                     "min_gram": "4",
                     "type": "edgeNGram",
                     "max_gram": "50"
                  }
               },
               "analyzer": {
                  "tpNGramAnalyzer": {
                     "type": "custom",
                     "filter": [
                        "tpNGramFilter"
                     ],
                     "tokenizer": "lowercase"
                  }
               }
            }
         }
      }
   }
}

:

{
   "en_suggestions": {
      "mappings": {
         "suggest": {
            "properties": {
               "proposal": {
                  "type": "string",
                  "analyzer": "tpNGramAnalyzer"
               }
            }
         }
      }
   }
}

Elasticsearch and Rails: using ngram to find part of a word

More articles: