What is the difference between a Sklearn pipeline and a DataFrameMapper?

Sklearn Pipeline: http://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.html

DataFrameMapper: https://github.com/paulgb/sklearn-pandas

What is the difference between the two?

It seems to me that the sklearn pipeline has more features, but the DataFrameMapper is cleaner for me.

+4
source share
1 answer

Basically, the package DataFrameMapper(and the entire sklearn-pandas package) aims to combine the benefits of pandas DataFramewith the power of the sklearn machine learning package.

An sklearn.Pipeline , ( numpy scipy). .

( ) - , , .

A DataFrameMapper, , DataFrame, pandas, ( ). DataFrames numpy scipy, . DataFrames , .

DataFrameMapper , .

:

  • A DataFrameMapper - , , sklearn.Pipeline , .
  • A DataFrameMapper , pandas DataFrame, sklearn.Pipeline / numpy/scipy.
+5

Source: https://habr.com/ru/post/1659468/


All Articles