I have a set of columns (col1, col2, col3) in dataframe df1 I have another set of columns (col4, col5, col6) in dataframe df2 Suppose these two data frames have the same number of rows.
How to create a correlation table that makes pair correlation between df1 and df2?
the table will look like
col1 col2 col3 col4 .. .. .. col5 .. .. .. col6 .. .. ..
I am using df1.corrwith(df2) , it does not seem to generate a table as needed.
I asked a similar question here: How to correlate between two data files with different column names but now I am dealing with categorical columns.
If this is not directly comparable, is there a standard way to make it comparable (e.g. using get_dummies)? and is this a faster way to automatically process all fields (suppose all of them are categorical) and calculate their correlation?
source share