I have several time series entries that partially overlap and do not necessarily have the same start and end dates. Each row represents a different time series. I made them the same length to maintain the actual data collection time.
For example, at t (1,2,3,4,5,6):
Station 1: nan, nan, 2, 4, 5, 10 Station 2: nan, 1, 4, nan, 10, 8 Station 3: 1, 9, 4, 7, nan, nan
I am trying to run cluster analysis in Python to group stations with similar behavior where action time is important, so I can't just get rid of nans. (What do I know).
Any ideas?
source share