@cel
I think in my case one could also use the following pattern.
import numpy import pandas import graphlab df abc 1 1 1 1 2 3 2 1 3 .... df['log c'] = df.groupby('a')['c'].apply(lambda x: numpy.log(x))
for an SFrame object ( sf instead of df ) it may look a little different
logvals = numpy.log(sf['c']) log_sf = graphlab.SFrame(logvals) sf = sf.join(log_sf, how = 'outer')
The code snippet may be a bit long with numpy , but it works ...
The main problem is, of course, runtime. I really hoped I could use a specific function to minimize my time ....
source share