I am trying to calculate the percentile of each number within a data frame and add it to a new column called "percentile".
This is my attempt:
import pandas as pd
from scipy import stats
data = {'symbol':'FB','date':['2012-05-18','2012-05-21','2012-05-22','2012-05-23'],'close':[38.23,34.03,31.00,32.00]}
df = pd.DataFrame(data)
close = df['close']
for i in df:
df['percentile'] = stats.percentileofscore(close,df['close'])
The column is not populated and results in "NaN". This should be pretty easy, but I'm not sure where I am going wrong.
Thanks in advance for your help.
source
share