Is a schema change required to override all Solr documents or only documents containing modified schema fields?

I have millions of documents in my Solr index. Only a thousand of these documents have field A, the scheme of which I want to change. Schema changes include changing multiValued from true to false, stored from false to true, and type from text to line, things that require re-indexing. Re-indexing thousands of documents will take me several minutes, where - when re-indexing, it will take several days.

The reindexing page on the Solr wiki ( http://wiki.apache.org/solr/HowToReindex ) says: “You may need to delete all documents before you begin the indexing process,” but does not say when you are not doing this.

Is it possible to delete only thousands of documents containing field A and re-index these thousands, or do I need to delete the entire index (all documents) before reindexing them all?

I tested the "delete multiple" scenario in a small sample index; and updates and requests work as expected in the changed area. However, I do not know if I was just lucky, and some problems are hidden due to the fact that they did not delete everything.

+4
source share
1
  • ( , schema.xml), . .

, "", . Vector Vector Analysis ( ).

, "", solr.

  • , . .

, , . . , .

+7

Source: https://habr.com/ru/post/1536508/


All Articles