Sample full-text search database

I am looking to do some benchmarking on full-text search indexes in PostgreSQL, SQLServer and Lucene.

Any ideas on where to find a good large database of examples for querying?

Thank you very much in advance.

+3
source share
1 answer

I think the best source would be a dump of the wikipedia database, since they contain a really large amount of text. They are available here: http://dumps.wikimedia.org/

You can also try the usenet archive, but it’s harder to choose the target language, and the quality of the language used is also lower.

+2
source

Source: https://habr.com/ru/post/1763718/


All Articles