I decided to send my comment in response.
I would suggest that you do not store your text data in Solr / Lucene. Index data only to search for and store a unique identifier or URL to identify a document. The contents of the document must be extracted from a separate storage system.
Solr / Lucene are search optimized. This is not your data warehouse or database, and they should not be used in this way. When you store more data in Solr than necessary, you negatively impact your entire search engine. You inflate the size of indexes, increase the replication time between masters and slaves, copy data that requires only one copy, and also the waste cache in document caches, which should be used to speed up the search.
So, I would suggest 2 things.
First, optimally, remove the entire text store from your search index. Retrieve the preview text and all the text from the secondary system, which is optimized for storing documents such as a file server.
Secondly, suboptimal, save only the preview text in your search index. Store the entire document in a different location, such as a file server.
rfeak source share