I am going to compile data from different data sets into one data set for analysis. I will be engaged in data exploration, trying different things to find out what patterns can be hidden in the data, so at the moment I do not have a specific method. Now I am wondering if I should compile my data in long or wide format.
Which format should be used and why?
I understand that data can be reformatted from long to wide or vice versa, but the simple existence of this functionality implies that sometimes there is a need to change the form, and this need in turn implies that a particular format may be better suited to a specific task. So, when do I need which format and why?
I am not asking about performance. This has been considered in other matters.
user1322720
source
share