Excuse me if this has already been asked (I know very little about Data Warehouse / BI and have not yet mastered the keywords).
I have a table that grows by more than 100,000 rows per day, each row has a timestamp and multiple information about the element (size, weight, color, etc.). Some data may be useful about a month after this period, we are only interested in clusters. I have special software that allows me to visualize individual lines in more detail and mainly use PowerPivot for my reporting needs.
I could find an SQL query that populated a new table daily: In which I would have a row for each hour / point / batch, and I would summarize the information (sum / average / stddev / etc.)
During the day, my script will be running and I can use powerpivot against this new table. All this, staying where I feel comfortable: plain old SQL.
From a few of the information I gathered to read about DataWarehouse and BI, what I'm going to do sounds just like creating dimensions and facts. So my question is: is it worth investigating further in this direction (BI), or since my problem is relatively simple, I would rather stay in a relational database.
NB Reports that are produced are usually linked to another database for more informative information. A task that is very well performed by Powerpivot.
source share