_idBy default, it is not indexed or saved , so there is no performance problem storage.
Since you will be indexing millions of documents, the only serious performance issue you will encounter is this bulk indexing. You have to make sure there is one sequential patternfor your _ids. From Documents
- If you do not have a natural identifier for each document, use the Elasticsearchs automatic identifier function. It is optimized to avoid the version because the auto-generated identifier is unique.
- , Lucene. , UUID-1, ; , . , , UUID-4, , Lucene.
, Lucene committer Michael McCandless _id IMO, , .
, !