Help with .NET C # Search Architecture

I am trying to create a search engine for all literature (books, articles, etc.), music and videos related to a particular spiritual group. When a keyword is entered, I want to display a link to all PDF articles where the keyword is displayed, as well as all music and video files marked with the keyword in the question. The user should be able to filter it with information such as author / artist, location, date / time, etc. When a user clicks on one of the links to the results (for example, the names of books), they go to another page where fragments from this book, wherever the keyword is located, are displayed.

I was thinking about using the Lucene library (or Searcharoo) to implement my search in PDF format, but I also need a database to mark all other information so that the results can be filtered by author / artist, etc. Therefore, I thought of having tables for text, music and video, and a field containing the file path for each. When a keyword is entered, I need to search the database for music and video files, and also search in PDF files, and when using the filter, searching for music and videos is simple, but the restriction of text search based on filters gets confused.

Is my approach right? Are there any better ways to do this? Since the content of the search is limited only to the spiritual group, there are no infinite number of elements to search. I would say about 100-500 books and 1000-5000 songs.

+3
source share
4 answers

If you definitely want to navigate the database path, you should use SQL Server with Full Text Search . You can also use this with Express versions. You can store and search the contents of PDF files very easily (as long as you install the free Adobe PDF iFilter).

+1
source

Lucene - , , . , HTML/XML, PDF, MS Word ..

, ( , , , , , ..), .

, , , , MySQL SQL Server, Lucene. Lucene , , , , . , , .

.. , , " ", Lucene, ( , ), ( ) .

, #.NET, Lucene.Net, #. , , , Java .

+6

, . Solr , , . .

+3

You can try using MS Search Server Express Edition, one of the main advantages is that it is free.

http://www.microsoft.com/enterprisesearch/en/us/search-server-express.aspx#none

+1
source

Source: https://habr.com/ru/post/1715118/


All Articles