I have a dataset that looks like this:
id name c1 c2 c3 c4 ... c50
-----------------------------------------------
1 string1 0.1 0.32 0.54 -1.2 ... 2.3
2 string2 0.12 0.12 -0.34 2.45 ... 1.3
...
(millions of records)
So, I have an identifier column, a row column, and then 50 floating point columns.
Only one type of query will be executed in this data, which in a traditional SQL SELECT statement will look like this:
SELECT name FROM table WHERE ((a1-c1)+(a2-c2)+(a3-c3)+...+(a50-c50)) > 1;where a1,a2,a3,etcare the values that are generated before sending the request (do not fit into the data table).
My question is: Does anyone have any recommendations as to which type of database will handle this type of query the fastest. I used SQL server(which is very slow), so I am looking for other opinions.
Will there be a way to optimize the SQL server for this type of query? I was also interested in learning about column storage databases such as MonetDB. Or perhaps a document repository database, for example MongoDB. Anyone have any suggestions?
Thanks a lot, Brett
source
share