What grade to use in scikit-learn?

Question

What grade to use in scikit-learn?

This is my first machine learning brush, so I'm trying to figure out how it all works. I have a data set where I collected all the statistics of each player to play with my baseball team in high school. I also have a list of all the players who have ever done this in MLB from my high school. What I would like to do is to split the data into a training set and a test set, and then submit it to some algorithm in the scikit-learn package and predict the probability of creating an MLB.

So, I looked through a number of sources and found this cheat sheet, which suggests starting with linear SVC.

, , , , , ( , , yada, yada), X_train; , 1 ( MLB) 0 ( MLB), Y_train. Fit (X, Y), pred (X_test), , Y_test.

, ?

EDIT, : 20 , , , , .. ; - , .

10 , ; , , , 1% MLB.

+4

algorithm python-2.7 scikit-learn machine-learning

Zach 26 . '16 20:07

2

ML, scikit learn ( ) ( ).

, , :

http://scikit-learn.org/stable/modules/generated/sklearn.tree.DecisionTreeClassifier.html

- , , .

, , :

http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html

:

http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html

, SVM.

, :

https://en.wikipedia.org/wiki/Cross-validation_(statistics)

- . coursera , :

https://www.coursera.org/learn/machine-learning

+1

Rob 26 . '16 21:18

kraskevich · Accepted Answer · 2016-10-26T21:18:54+0000

, , :

. , , . /. X% , scikit-learn : http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html. , : . /, , , , 70% 70% .
. : http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html, API.

:

clf = LogisticRegression()
clf.fit(X_train, y_train)

:
```
y_pred = clf.predict(X_test)
```
. - : , , 0, . f1: http://scikit-learn.org/stable/modules/generated/sklearn.metrics.f1_score.html.
, predict_proba .

. ! , , , , , , .

What grade to use in scikit-learn?

More articles: