Trying to select data from all columns starting with a row from pandas dataframe

Question

Trying to select data from all columns starting with a row from pandas dataframe

I am trying to select all columns starting with a specific row and then populate all null values with a new value. What I'm doing now is turning all the column headers into a list instead.

lifestyle_var = [col for col in list(df) if col.startswith('lifestyle')]

df[lifestyle_var].fillna(1, inplace=True)

+4

python null pandas dataframe

Ron Sep 17 '15 at 16:22

source share

2 answers

Try

df.update(df[lifestyle_var].fillna(1))

Cm. .

Example:

import pandas as pd
import numpy as np
data = pd.DataFrame([ [ 1, 2, np.nan ], [ np.nan, np.nan, 6] ], columns=   ['a1', 'b', 'a2'])
vars = [ col for col in list(data) if col.startswith('a')]
data.update(data[vars].fillna(value=1))

0

vmg Sep 17 '15 at 16:50

source share

Vlad Mironov · Accepted Answer · 2015-09-17T16:53:53+0000

I had the same problem: https://github.com/pydata/pandas/issues/10342

You can use this command: df.loc[:,lifestyle_var] = df.loc[:,lifestyle_var].fillna(1)

This problem occurs because you are trying to populate a copy of the data frame, not the original data.

Trying to select data from all columns starting with a row from pandas dataframe

More articles: