Offer a database for storing metadata for 200 million images (1 million books) (NoSQL? SQL?)

Friends,

We will conduct a knowledge preservation project to scan over 1 million books. We need some suggestions for implementing a database for storing and retrieving metadata, as well as for using it to track the scanning status of each object (book)

  • Can you guys suggest that we switch to SQL or NoSQL (metadata can vary from project to project, they say that this project can have 15 fields)

  • We are thinking of something based on Lucene / Solr or some scalable RDF database

  • Any open source solution where we have the ability to define custom metadata fields and store information using the search function?

+3
source share
2 answers

Disclaimer: never attempted this type of project

I saw very good performance on an MSSQL filestream server. It uses the NTFS file APIs to store binary data and stores a pointer in the rows of your table.

If you don't have a metadata structure, you can use XML, but if you have a repeating structure, drag it into the relationship data and then you can use indexing, etc. to help you get performance.

Filter type

0
source

, , , , , CMS ( ). CMS .

CMS, , , , . Alfresco ( open-source), , , , , Enterprise. .

, , . , , , , .

0

Source: https://habr.com/ru/post/1751546/


All Articles