I believe that some classifiers in sklearn
have a method partial_fit
. This method allows you to transfer thumbnails of data to the classifier, so that for each mini-channel, the gradient descent step is performed. You just loaded the mini-disk from the disk, transfer it to partial_fit
, release the mini-compartment from the memory and repeat.
If you are particularly interested in doing this for logistic regression, then you will want to use SGDClassifier
which you can set to use logistic regression when loss = 'log'
.
You simply pass the functions and shortcuts for your mini-camera to the partial_fit
same way you would use fit
:
clf.partial_fit(X_minibatch, y_minibatch)
Update
dask-ml
library, , dask
partial_fit
. - .