Observation Value - OpenAI Gym

I want to know the observation specification CartPole-v0in OpenAI Gym ( https://gym.openai.com/ ).

For example, in the following code output observation. One observation is similar to [-0.061586 -0.75893141 0.05793238 1.15547541]I want to know what the numbers mean. And I want any way to find out the specification of others Environments, such as MountainCar-v0, MsPacman-v0etc.

I tried to read https://github.com/openai/gym , but I don't know that. Could you tell me how to find out the specifications?

import gym
env = gym.make('CartPole-v0')
for i_episode in range(20):
    observation = env.reset()
    for t in range(100):
        env.render()
        print(observation)
        action = env.action_space.sample()
        observation, reward, done, info = env.step(action)
        if done:
            print("Episode finished after {} timesteps".format(t+1))
            break

(from https://gym.openai.com/docs )

The next way out

[-0.061586   -0.75893141  0.05793238  1.15547541]
[-0.07676463 -0.95475889  0.08104189  1.46574644]
[-0.0958598  -1.15077434  0.11035682  1.78260485]
[-0.11887529 -0.95705275  0.14600892  1.5261692 ]
[-0.13801635 -0.7639636   0.1765323   1.28239155]
[-0.15329562 -0.57147373  0.20218013  1.04977545]
Episode finished after 14 timesteps
[-0.02786724  0.00361763 -0.03938967 -0.01611184]
[-0.02779488 -0.19091794 -0.03971191  0.26388759]
[-0.03161324  0.00474768 -0.03443415 -0.04105167]
+4
2

, - OpenAI Gym, , , , CartPole-v0 :

[Barto83] . . , . . . . , " , ", IEEE Transactions on Systems, Man and Cybernetics, 1983.

, :

, observation - .

, MountainCar-v0

[Moore90] , , , , 1990 .

..

+3

, OpenAI Gym, . OpenAI wiki, . 4- , :

Num Observation Min Max 0 Cart Position -2.4 2.4 1 Cart Velocity -Inf Inf 2 Pole Angle ~ -41.8° ~ 41.8° 3 Pole Velocity At Tip -Inf Inf

+3

Source: https://habr.com/ru/post/1653740/


All Articles