How to Access Scikit Learn Nested Cross-Validation Scores

Question

How to Access Scikit Learn Nested Cross-Validation Scores

I am using python and I would like to use nested cross validation using scikit. I found a very good example :

NUM_TRIALS = 30
non_nested_scores = np.zeros(NUM_TRIALS)
nested_scores = np.zeros(NUM_TRIALS)
# Choose cross-validation techniques for the inner and outer loops,
# independently of the dataset.
# E.g "LabelKFold", "LeaveOneOut", "LeaveOneLabelOut", etc.
inner_cv = KFold(n_splits=4, shuffle=True, random_state=i)
outer_cv = KFold(n_splits=4, shuffle=True, random_state=i)

# Non_nested parameter search and scoring
clf = GridSearchCV(estimator=svr, param_grid=p_grid, cv=inner_cv)
clf.fit(X_iris, y_iris)
non_nested_scores[i] = clf.best_score_

# Nested CV with parameter optimization
nested_score = cross_val_score(clf, X=X_iris, y=y_iris, cv=outer_cv)
nested_scores[i] = nested_score.mean()

How can I get access to the best set of parameters, as well as to all sets of parameters (with their corresponding evaluation) from the nested cross validation?

+3

python scikit-learn machine-learning

machinery Jan 26 '17 at 16:03

source share

1 answer

Vivek Kumar · Accepted Answer · 2017-01-30T07:26:36+0000

cross_val_score. , cross_val_score , , fit score X, y .

, :

#put below code inside your NUM_TRIALS for loop
cv_iter = 0
temp_nested_scores_train = np.zeros(4)
temp_nested_scores_test = np.zeros(4)
for train, test in outer_cv.split(X_iris):
    clf.fit(X_iris[train], y_iris[train])
    temp_nested_scores_train[cv_iter] = clf.best_score_
    temp_nested_scores_test[cv_iter] = clf.score(X_iris[test], y_iris[test])
    #You can access grid search params here
nested_scores_train[i] = temp_nested_scores_train.mean()
nested_scores_test[i] = temp_nested_scores_test.mean()

How to Access Scikit Learn Nested Cross-Validation Scores

More articles: