Python random state in data set splitting

Question

Python random state in data set splitting

I am a little new to python. can anyone tell me why we set the random state to zero in splitting the train and test suite.

X_train, X_test, y_train, y_test = \
    train_test_split(X, y, test_size=0.30, random_state=0)

I saw situations like this when a random state is set to unity!

X_train, X_test, y_train, y_test = \
    train_test_split(X, y, test_size=0.30, random_state=1)

What is the consequence of this random state during cross-validation?

+7

python scikit-learn random machine-learning

Shelly Feb 12 '17 at 18:47

source share

3 answers

Vivek Kumar · Answer 1 · 2017-02-13T06:00:34+0000

, random_state 0 1 . , , . , , random_state=42 scikit, .

random_state, , , . , :

random_state - None np.random, RandomState.
random_state , RandomState.
random_state - RandomState, .

. random_state , , . - , , . .

Ganesh · Answer 2 · 2018-12-12T20:31:53+0000

Random_state , . random_state. , . 0 no, random_state, . : , random_state=0 . , random_state=5 random_state=0 . 0 . random_state=None .

,

Rishi bansal · Answer 3 · 2018-12-13T03:58:18+0000

If you do not mention random_state in the code, then whenever you execute your code, a new random value is generated, and the train and test datasets will have different values each time.

However, if you use a specific value for random_state (random_state = 1 or any other value) every time the result is the same, that is, the same values in the train and test datasets.

Python random state in data set splitting

More articles: