Weighted Gaussian kernel density estimate in `python`

Question

Weighted Gaussian kernel density estimate in `python`

It is currently not possible to use scipy.stats.gaussian_kde to estimate the density of a random variable based on weighted samples . What methods are available for estimating the density of continuous random variables based on weighted samples?

+6

python scipy statistics kernel-density

Till hoffmann Dec 23 '14 at 16:06

source share

2 answers

Check out the PyQT-Fit packages and statistics for Python. It seems that they have a kernel density estimate with weighted observations.

0

dikdirk Jun 03 '15 at 16:23

source share

Till hoffmann · Accepted Answer · 2014-12-23T16:06:37+0000

Neither sklearn.neighbors.KernelDensity nor statsmodels.nonparametric seems to support weighted samples. I modified scipy.stats.gaussian_kde to provide heterogeneous sample weights and thought that the results might be useful to others. An example is shown below.

example

You can find the ipython laptop ipython : http://nbviewer.ipython.org/gist/tillahoffmann/f844bce2ec264c1c8cb5

Implementation Details

Weighted average arithmetic mean

weighted arithmetic mean

the covariance matrix of unbiased data is then determined unbiased covariance matrix

Bandwidth can be selected by scott or silverman rules, as in scipy . However, the number of samples used to calculate the bandwidth, Kish approximation for the effective sample size .

Weighted Gaussian kernel density estimate in `python`

Implementation Details

More articles: