Tutorial_OpenAI_Gym - sankhaMukherjee/RLalgos GitHub Wiki

This is the .md version of this Notebook

cd ../src
/home/sankha/Documents/programs/ML/RLalgos/src

OpenAI Gym

The OpenAI gym environment should be already installed in this virtual environment. A few cases of its sue are as follows:

import gym
from gym import envs

1. Lets check the available environments

A set of all names can be obtained with the .regustry.all() method. Howeever, it is important to note that not all the names can be used. The ones available for free will allow the environment to be reset(). We can use this function to check whether the particular name can be used.

list(envs.registry.all())[:10]
[EnvSpec(Copy-v0),
 EnvSpec(RepeatCopy-v0),
 EnvSpec(ReversedAddition-v0),
 EnvSpec(ReversedAddition3-v0),
 EnvSpec(DuplicatedInput-v0),
 EnvSpec(Reverse-v0),
 EnvSpec(CartPole-v0),
 EnvSpec(CartPole-v1),
 EnvSpec(MountainCar-v0),
 EnvSpec(MountainCarContinuous-v0)]
allNames        = [str(k)[8:-1]  for k in envs.registry.all()]
currName        = allNames[0]
allNamesReduced = [currName] 

currName = currName.split('-')[0]
for name in allNames[1:]:
    if name.startswith(currName):
        continue
    allNamesReduced.append(name)
    currName = name.split('-')[0]

print(f'Number of available names: {len(allNames)}\n')
print(f'All the names:\n\t{", ".join(allNames)}')
Number of available names: 833

All the names:
	Copy-v0, RepeatCopy-v0, ReversedAddition-v0, ReversedAddition3-v0, DuplicatedInput-v0, Reverse-v0, CartPole-v0, CartPole-v1, MountainCar-v0, MountainCarContinuous-v0, Pendulum-v0, Acrobot-v1, LunarLander-v2, LunarLanderContinuous-v2, BipedalWalker-v2, BipedalWalkerHardcore-v2, CarRacing-v0, Blackjack-v0, KellyCoinflip-v0, KellyCoinflipGeneralized-v0, FrozenLake-v0, FrozenLake8x8-v0, CliffWalking-v0, NChain-v0, Roulette-v0, Taxi-v2, GuessingGame-v0, HotterColder-v0, Reacher-v2, Pusher-v2, Thrower-v2, Striker-v2, InvertedPendulum-v2, InvertedDoublePendulum-v2, HalfCheetah-v2, HalfCheetah-v3, Hopper-v2, Hopper-v3, Swimmer-v2, Swimmer-v3, Walker2d-v2, Walker2d-v3, Ant-v2, Ant-v3, Humanoid-v2, Humanoid-v3, HumanoidStandup-v2, FetchSlide-v1, FetchPickAndPlace-v1, FetchReach-v1, FetchPush-v1, HandReach-v0, HandManipulateBlockRotateZ-v0, HandManipulateBlockRotateParallel-v0, HandManipulateBlockRotateXYZ-v0, HandManipulateBlockFull-v0, HandManipulateBlock-v0, HandManipulateBlockTouchSensors-v0, HandManipulateEggRotate-v0, HandManipulateEggFull-v0, HandManipulateEgg-v0, HandManipulateEggTouchSensors-v0, HandManipulatePenRotate-v0, HandManipulatePenFull-v0, HandManipulatePen-v0, HandManipulatePenTouchSensors-v0, FetchSlideDense-v1, FetchPickAndPlaceDense-v1, FetchReachDense-v1, FetchPushDense-v1, HandReachDense-v0, HandManipulateBlockRotateZDense-v0, HandManipulateBlockRotateParallelDense-v0, HandManipulateBlockRotateXYZDense-v0, HandManipulateBlockFullDense-v0, HandManipulateBlockDense-v0, HandManipulateBlockTouchSensorsDense-v0, HandManipulateEggRotateDense-v0, HandManipulateEggFullDense-v0, HandManipulateEggDense-v0, HandManipulateEggTouchSensorsDense-v0, HandManipulatePenRotateDense-v0, HandManipulatePenFullDense-v0, HandManipulatePenDense-v0, HandManipulatePenTouchSensorsDense-v0, Adventure-v0, Adventure-v4, AdventureDeterministic-v0, AdventureDeterministic-v4, AdventureNoFrameskip-v0, AdventureNoFrameskip-v4, Adventure-ram-v0, Adventure-ram-v4, Adventure-ramDeterministic-v0, Adventure-ramDeterministic-v4, Adventure-ramNoFrameskip-v0, Adventure-ramNoFrameskip-v4, AirRaid-v0, AirRaid-v4, AirRaidDeterministic-v0, AirRaidDeterministic-v4, AirRaidNoFrameskip-v0, AirRaidNoFrameskip-v4, AirRaid-ram-v0, AirRaid-ram-v4, AirRaid-ramDeterministic-v0, AirRaid-ramDeterministic-v4, AirRaid-ramNoFrameskip-v0, AirRaid-ramNoFrameskip-v4, Alien-v0, Alien-v4, AlienDeterministic-v0, AlienDeterministic-v4, AlienNoFrameskip-v0, AlienNoFrameskip-v4, Alien-ram-v0, Alien-ram-v4, Alien-ramDeterministic-v0, Alien-ramDeterministic-v4, Alien-ramNoFrameskip-v0, Alien-ramNoFrameskip-v4, Amidar-v0, Amidar-v4, AmidarDeterministic-v0, AmidarDeterministic-v4, AmidarNoFrameskip-v0, AmidarNoFrameskip-v4, Amidar-ram-v0, Amidar-ram-v4, Amidar-ramDeterministic-v0, Amidar-ramDeterministic-v4, Amidar-ramNoFrameskip-v0, Amidar-ramNoFrameskip-v4, Assault-v0, Assault-v4, AssaultDeterministic-v0, AssaultDeterministic-v4, AssaultNoFrameskip-v0, AssaultNoFrameskip-v4, Assault-ram-v0, Assault-ram-v4, Assault-ramDeterministic-v0, Assault-ramDeterministic-v4, Assault-ramNoFrameskip-v0, Assault-ramNoFrameskip-v4, Asterix-v0, Asterix-v4, AsterixDeterministic-v0, AsterixDeterministic-v4, AsterixNoFrameskip-v0, AsterixNoFrameskip-v4, Asterix-ram-v0, Asterix-ram-v4, Asterix-ramDeterministic-v0, Asterix-ramDeterministic-v4, Asterix-ramNoFrameskip-v0, Asterix-ramNoFrameskip-v4, Asteroids-v0, Asteroids-v4, AsteroidsDeterministic-v0, AsteroidsDeterministic-v4, AsteroidsNoFrameskip-v0, AsteroidsNoFrameskip-v4, Asteroids-ram-v0, Asteroids-ram-v4, Asteroids-ramDeterministic-v0, Asteroids-ramDeterministic-v4, Asteroids-ramNoFrameskip-v0, Asteroids-ramNoFrameskip-v4, Atlantis-v0, Atlantis-v4, AtlantisDeterministic-v0, AtlantisDeterministic-v4, AtlantisNoFrameskip-v0, AtlantisNoFrameskip-v4, Atlantis-ram-v0, Atlantis-ram-v4, Atlantis-ramDeterministic-v0, Atlantis-ramDeterministic-v4, Atlantis-ramNoFrameskip-v0, Atlantis-ramNoFrameskip-v4, BankHeist-v0, BankHeist-v4, BankHeistDeterministic-v0, BankHeistDeterministic-v4, BankHeistNoFrameskip-v0, BankHeistNoFrameskip-v4, BankHeist-ram-v0, BankHeist-ram-v4, BankHeist-ramDeterministic-v0, BankHeist-ramDeterministic-v4, BankHeist-ramNoFrameskip-v0, BankHeist-ramNoFrameskip-v4, BattleZone-v0, BattleZone-v4, BattleZoneDeterministic-v0, BattleZoneDeterministic-v4, BattleZoneNoFrameskip-v0, BattleZoneNoFrameskip-v4, BattleZone-ram-v0, BattleZone-ram-v4, BattleZone-ramDeterministic-v0, BattleZone-ramDeterministic-v4, BattleZone-ramNoFrameskip-v0, BattleZone-ramNoFrameskip-v4, BeamRider-v0, BeamRider-v4, BeamRiderDeterministic-v0, BeamRiderDeterministic-v4, BeamRiderNoFrameskip-v0, BeamRiderNoFrameskip-v4, BeamRider-ram-v0, BeamRider-ram-v4, BeamRider-ramDeterministic-v0, BeamRider-ramDeterministic-v4, BeamRider-ramNoFrameskip-v0, BeamRider-ramNoFrameskip-v4, Berzerk-v0, Berzerk-v4, BerzerkDeterministic-v0, BerzerkDeterministic-v4, BerzerkNoFrameskip-v0, BerzerkNoFrameskip-v4, Berzerk-ram-v0, Berzerk-ram-v4, Berzerk-ramDeterministic-v0, Berzerk-ramDeterministic-v4, Berzerk-ramNoFrameskip-v0, Berzerk-ramNoFrameskip-v4, Bowling-v0, Bowling-v4, BowlingDeterministic-v0, BowlingDeterministic-v4, BowlingNoFrameskip-v0, BowlingNoFrameskip-v4, Bowling-ram-v0, Bowling-ram-v4, Bowling-ramDeterministic-v0, Bowling-ramDeterministic-v4, Bowling-ramNoFrameskip-v0, Bowling-ramNoFrameskip-v4, Boxing-v0, Boxing-v4, BoxingDeterministic-v0, BoxingDeterministic-v4, BoxingNoFrameskip-v0, BoxingNoFrameskip-v4, Boxing-ram-v0, Boxing-ram-v4, Boxing-ramDeterministic-v0, Boxing-ramDeterministic-v4, Boxing-ramNoFrameskip-v0, Boxing-ramNoFrameskip-v4, Breakout-v0, Breakout-v4, BreakoutDeterministic-v0, BreakoutDeterministic-v4, BreakoutNoFrameskip-v0, BreakoutNoFrameskip-v4, Breakout-ram-v0, Breakout-ram-v4, Breakout-ramDeterministic-v0, Breakout-ramDeterministic-v4, Breakout-ramNoFrameskip-v0, Breakout-ramNoFrameskip-v4, Carnival-v0, Carnival-v4, CarnivalDeterministic-v0, CarnivalDeterministic-v4, CarnivalNoFrameskip-v0, CarnivalNoFrameskip-v4, Carnival-ram-v0, Carnival-ram-v4, Carnival-ramDeterministic-v0, Carnival-ramDeterministic-v4, Carnival-ramNoFrameskip-v0, Carnival-ramNoFrameskip-v4, Centipede-v0, Centipede-v4, CentipedeDeterministic-v0, CentipedeDeterministic-v4, CentipedeNoFrameskip-v0, CentipedeNoFrameskip-v4, Centipede-ram-v0, Centipede-ram-v4, Centipede-ramDeterministic-v0, Centipede-ramDeterministic-v4, Centipede-ramNoFrameskip-v0, Centipede-ramNoFrameskip-v4, ChopperCommand-v0, ChopperCommand-v4, ChopperCommandDeterministic-v0, ChopperCommandDeterministic-v4, ChopperCommandNoFrameskip-v0, ChopperCommandNoFrameskip-v4, ChopperCommand-ram-v0, ChopperCommand-ram-v4, ChopperCommand-ramDeterministic-v0, ChopperCommand-ramDeterministic-v4, ChopperCommand-ramNoFrameskip-v0, ChopperCommand-ramNoFrameskip-v4, CrazyClimber-v0, CrazyClimber-v4, CrazyClimberDeterministic-v0, CrazyClimberDeterministic-v4, CrazyClimberNoFrameskip-v0, CrazyClimberNoFrameskip-v4, CrazyClimber-ram-v0, CrazyClimber-ram-v4, CrazyClimber-ramDeterministic-v0, CrazyClimber-ramDeterministic-v4, CrazyClimber-ramNoFrameskip-v0, CrazyClimber-ramNoFrameskip-v4, Defender-v0, Defender-v4, DefenderDeterministic-v0, DefenderDeterministic-v4, DefenderNoFrameskip-v0, DefenderNoFrameskip-v4, Defender-ram-v0, Defender-ram-v4, Defender-ramDeterministic-v0, Defender-ramDeterministic-v4, Defender-ramNoFrameskip-v0, Defender-ramNoFrameskip-v4, DemonAttack-v0, DemonAttack-v4, DemonAttackDeterministic-v0, DemonAttackDeterministic-v4, DemonAttackNoFrameskip-v0, DemonAttackNoFrameskip-v4, DemonAttack-ram-v0, DemonAttack-ram-v4, DemonAttack-ramDeterministic-v0, DemonAttack-ramDeterministic-v4, DemonAttack-ramNoFrameskip-v0, DemonAttack-ramNoFrameskip-v4, DoubleDunk-v0, DoubleDunk-v4, DoubleDunkDeterministic-v0, DoubleDunkDeterministic-v4, DoubleDunkNoFrameskip-v0, DoubleDunkNoFrameskip-v4, DoubleDunk-ram-v0, DoubleDunk-ram-v4, DoubleDunk-ramDeterministic-v0, DoubleDunk-ramDeterministic-v4, DoubleDunk-ramNoFrameskip-v0, DoubleDunk-ramNoFrameskip-v4, ElevatorAction-v0, ElevatorAction-v4, ElevatorActionDeterministic-v0, ElevatorActionDeterministic-v4, ElevatorActionNoFrameskip-v0, ElevatorActionNoFrameskip-v4, ElevatorAction-ram-v0, ElevatorAction-ram-v4, ElevatorAction-ramDeterministic-v0, ElevatorAction-ramDeterministic-v4, ElevatorAction-ramNoFrameskip-v0, ElevatorAction-ramNoFrameskip-v4, Enduro-v0, Enduro-v4, EnduroDeterministic-v0, EnduroDeterministic-v4, EnduroNoFrameskip-v0, EnduroNoFrameskip-v4, Enduro-ram-v0, Enduro-ram-v4, Enduro-ramDeterministic-v0, Enduro-ramDeterministic-v4, Enduro-ramNoFrameskip-v0, Enduro-ramNoFrameskip-v4, FishingDerby-v0, FishingDerby-v4, FishingDerbyDeterministic-v0, FishingDerbyDeterministic-v4, FishingDerbyNoFrameskip-v0, FishingDerbyNoFrameskip-v4, FishingDerby-ram-v0, FishingDerby-ram-v4, FishingDerby-ramDeterministic-v0, FishingDerby-ramDeterministic-v4, FishingDerby-ramNoFrameskip-v0, FishingDerby-ramNoFrameskip-v4, Freeway-v0, Freeway-v4, FreewayDeterministic-v0, FreewayDeterministic-v4, FreewayNoFrameskip-v0, FreewayNoFrameskip-v4, Freeway-ram-v0, Freeway-ram-v4, Freeway-ramDeterministic-v0, Freeway-ramDeterministic-v4, Freeway-ramNoFrameskip-v0, Freeway-ramNoFrameskip-v4, Frostbite-v0, Frostbite-v4, FrostbiteDeterministic-v0, FrostbiteDeterministic-v4, FrostbiteNoFrameskip-v0, FrostbiteNoFrameskip-v4, Frostbite-ram-v0, Frostbite-ram-v4, Frostbite-ramDeterministic-v0, Frostbite-ramDeterministic-v4, Frostbite-ramNoFrameskip-v0, Frostbite-ramNoFrameskip-v4, Gopher-v0, Gopher-v4, GopherDeterministic-v0, GopherDeterministic-v4, GopherNoFrameskip-v0, GopherNoFrameskip-v4, Gopher-ram-v0, Gopher-ram-v4, Gopher-ramDeterministic-v0, Gopher-ramDeterministic-v4, Gopher-ramNoFrameskip-v0, Gopher-ramNoFrameskip-v4, Gravitar-v0, Gravitar-v4, GravitarDeterministic-v0, GravitarDeterministic-v4, GravitarNoFrameskip-v0, GravitarNoFrameskip-v4, Gravitar-ram-v0, Gravitar-ram-v4, Gravitar-ramDeterministic-v0, Gravitar-ramDeterministic-v4, Gravitar-ramNoFrameskip-v0, Gravitar-ramNoFrameskip-v4, Hero-v0, Hero-v4, HeroDeterministic-v0, HeroDeterministic-v4, HeroNoFrameskip-v0, HeroNoFrameskip-v4, Hero-ram-v0, Hero-ram-v4, Hero-ramDeterministic-v0, Hero-ramDeterministic-v4, Hero-ramNoFrameskip-v0, Hero-ramNoFrameskip-v4, IceHockey-v0, IceHockey-v4, IceHockeyDeterministic-v0, IceHockeyDeterministic-v4, IceHockeyNoFrameskip-v0, IceHockeyNoFrameskip-v4, IceHockey-ram-v0, IceHockey-ram-v4, IceHockey-ramDeterministic-v0, IceHockey-ramDeterministic-v4, IceHockey-ramNoFrameskip-v0, IceHockey-ramNoFrameskip-v4, Jamesbond-v0, Jamesbond-v4, JamesbondDeterministic-v0, JamesbondDeterministic-v4, JamesbondNoFrameskip-v0, JamesbondNoFrameskip-v4, Jamesbond-ram-v0, Jamesbond-ram-v4, Jamesbond-ramDeterministic-v0, Jamesbond-ramDeterministic-v4, Jamesbond-ramNoFrameskip-v0, Jamesbond-ramNoFrameskip-v4, JourneyEscape-v0, JourneyEscape-v4, JourneyEscapeDeterministic-v0, JourneyEscapeDeterministic-v4, JourneyEscapeNoFrameskip-v0, JourneyEscapeNoFrameskip-v4, JourneyEscape-ram-v0, JourneyEscape-ram-v4, JourneyEscape-ramDeterministic-v0, JourneyEscape-ramDeterministic-v4, JourneyEscape-ramNoFrameskip-v0, JourneyEscape-ramNoFrameskip-v4, Kangaroo-v0, Kangaroo-v4, KangarooDeterministic-v0, KangarooDeterministic-v4, KangarooNoFrameskip-v0, KangarooNoFrameskip-v4, Kangaroo-ram-v0, Kangaroo-ram-v4, Kangaroo-ramDeterministic-v0, Kangaroo-ramDeterministic-v4, Kangaroo-ramNoFrameskip-v0, Kangaroo-ramNoFrameskip-v4, Krull-v0, Krull-v4, KrullDeterministic-v0, KrullDeterministic-v4, KrullNoFrameskip-v0, KrullNoFrameskip-v4, Krull-ram-v0, Krull-ram-v4, Krull-ramDeterministic-v0, Krull-ramDeterministic-v4, Krull-ramNoFrameskip-v0, Krull-ramNoFrameskip-v4, KungFuMaster-v0, KungFuMaster-v4, KungFuMasterDeterministic-v0, KungFuMasterDeterministic-v4, KungFuMasterNoFrameskip-v0, KungFuMasterNoFrameskip-v4, KungFuMaster-ram-v0, KungFuMaster-ram-v4, KungFuMaster-ramDeterministic-v0, KungFuMaster-ramDeterministic-v4, KungFuMaster-ramNoFrameskip-v0, KungFuMaster-ramNoFrameskip-v4, MontezumaRevenge-v0, MontezumaRevenge-v4, MontezumaRevengeDeterministic-v0, MontezumaRevengeDeterministic-v4, MontezumaRevengeNoFrameskip-v0, MontezumaRevengeNoFrameskip-v4, MontezumaRevenge-ram-v0, MontezumaRevenge-ram-v4, MontezumaRevenge-ramDeterministic-v0, MontezumaRevenge-ramDeterministic-v4, MontezumaRevenge-ramNoFrameskip-v0, MontezumaRevenge-ramNoFrameskip-v4, MsPacman-v0, MsPacman-v4, MsPacmanDeterministic-v0, MsPacmanDeterministic-v4, MsPacmanNoFrameskip-v0, MsPacmanNoFrameskip-v4, MsPacman-ram-v0, MsPacman-ram-v4, MsPacman-ramDeterministic-v0, MsPacman-ramDeterministic-v4, MsPacman-ramNoFrameskip-v0, MsPacman-ramNoFrameskip-v4, NameThisGame-v0, NameThisGame-v4, NameThisGameDeterministic-v0, NameThisGameDeterministic-v4, NameThisGameNoFrameskip-v0, NameThisGameNoFrameskip-v4, NameThisGame-ram-v0, NameThisGame-ram-v4, NameThisGame-ramDeterministic-v0, NameThisGame-ramDeterministic-v4, NameThisGame-ramNoFrameskip-v0, NameThisGame-ramNoFrameskip-v4, Phoenix-v0, Phoenix-v4, PhoenixDeterministic-v0, PhoenixDeterministic-v4, PhoenixNoFrameskip-v0, PhoenixNoFrameskip-v4, Phoenix-ram-v0, Phoenix-ram-v4, Phoenix-ramDeterministic-v0, Phoenix-ramDeterministic-v4, Phoenix-ramNoFrameskip-v0, Phoenix-ramNoFrameskip-v4, Pitfall-v0, Pitfall-v4, PitfallDeterministic-v0, PitfallDeterministic-v4, PitfallNoFrameskip-v0, PitfallNoFrameskip-v4, Pitfall-ram-v0, Pitfall-ram-v4, Pitfall-ramDeterministic-v0, Pitfall-ramDeterministic-v4, Pitfall-ramNoFrameskip-v0, Pitfall-ramNoFrameskip-v4, Pong-v0, Pong-v4, PongDeterministic-v0, PongDeterministic-v4, PongNoFrameskip-v0, PongNoFrameskip-v4, Pong-ram-v0, Pong-ram-v4, Pong-ramDeterministic-v0, Pong-ramDeterministic-v4, Pong-ramNoFrameskip-v0, Pong-ramNoFrameskip-v4, Pooyan-v0, Pooyan-v4, PooyanDeterministic-v0, PooyanDeterministic-v4, PooyanNoFrameskip-v0, PooyanNoFrameskip-v4, Pooyan-ram-v0, Pooyan-ram-v4, Pooyan-ramDeterministic-v0, Pooyan-ramDeterministic-v4, Pooyan-ramNoFrameskip-v0, Pooyan-ramNoFrameskip-v4, PrivateEye-v0, PrivateEye-v4, PrivateEyeDeterministic-v0, PrivateEyeDeterministic-v4, PrivateEyeNoFrameskip-v0, PrivateEyeNoFrameskip-v4, PrivateEye-ram-v0, PrivateEye-ram-v4, PrivateEye-ramDeterministic-v0, PrivateEye-ramDeterministic-v4, PrivateEye-ramNoFrameskip-v0, PrivateEye-ramNoFrameskip-v4, Qbert-v0, Qbert-v4, QbertDeterministic-v0, QbertDeterministic-v4, QbertNoFrameskip-v0, QbertNoFrameskip-v4, Qbert-ram-v0, Qbert-ram-v4, Qbert-ramDeterministic-v0, Qbert-ramDeterministic-v4, Qbert-ramNoFrameskip-v0, Qbert-ramNoFrameskip-v4, Riverraid-v0, Riverraid-v4, RiverraidDeterministic-v0, RiverraidDeterministic-v4, RiverraidNoFrameskip-v0, RiverraidNoFrameskip-v4, Riverraid-ram-v0, Riverraid-ram-v4, Riverraid-ramDeterministic-v0, Riverraid-ramDeterministic-v4, Riverraid-ramNoFrameskip-v0, Riverraid-ramNoFrameskip-v4, RoadRunner-v0, RoadRunner-v4, RoadRunnerDeterministic-v0, RoadRunnerDeterministic-v4, RoadRunnerNoFrameskip-v0, RoadRunnerNoFrameskip-v4, RoadRunner-ram-v0, RoadRunner-ram-v4, RoadRunner-ramDeterministic-v0, RoadRunner-ramDeterministic-v4, RoadRunner-ramNoFrameskip-v0, RoadRunner-ramNoFrameskip-v4, Robotank-v0, Robotank-v4, RobotankDeterministic-v0, RobotankDeterministic-v4, RobotankNoFrameskip-v0, RobotankNoFrameskip-v4, Robotank-ram-v0, Robotank-ram-v4, Robotank-ramDeterministic-v0, Robotank-ramDeterministic-v4, Robotank-ramNoFrameskip-v0, Robotank-ramNoFrameskip-v4, Seaquest-v0, Seaquest-v4, SeaquestDeterministic-v0, SeaquestDeterministic-v4, SeaquestNoFrameskip-v0, SeaquestNoFrameskip-v4, Seaquest-ram-v0, Seaquest-ram-v4, Seaquest-ramDeterministic-v0, Seaquest-ramDeterministic-v4, Seaquest-ramNoFrameskip-v0, Seaquest-ramNoFrameskip-v4, Skiing-v0, Skiing-v4, SkiingDeterministic-v0, SkiingDeterministic-v4, SkiingNoFrameskip-v0, SkiingNoFrameskip-v4, Skiing-ram-v0, Skiing-ram-v4, Skiing-ramDeterministic-v0, Skiing-ramDeterministic-v4, Skiing-ramNoFrameskip-v0, Skiing-ramNoFrameskip-v4, Solaris-v0, Solaris-v4, SolarisDeterministic-v0, SolarisDeterministic-v4, SolarisNoFrameskip-v0, SolarisNoFrameskip-v4, Solaris-ram-v0, Solaris-ram-v4, Solaris-ramDeterministic-v0, Solaris-ramDeterministic-v4, Solaris-ramNoFrameskip-v0, Solaris-ramNoFrameskip-v4, SpaceInvaders-v0, SpaceInvaders-v4, SpaceInvadersDeterministic-v0, SpaceInvadersDeterministic-v4, SpaceInvadersNoFrameskip-v0, SpaceInvadersNoFrameskip-v4, SpaceInvaders-ram-v0, SpaceInvaders-ram-v4, SpaceInvaders-ramDeterministic-v0, SpaceInvaders-ramDeterministic-v4, SpaceInvaders-ramNoFrameskip-v0, SpaceInvaders-ramNoFrameskip-v4, StarGunner-v0, StarGunner-v4, StarGunnerDeterministic-v0, StarGunnerDeterministic-v4, StarGunnerNoFrameskip-v0, StarGunnerNoFrameskip-v4, StarGunner-ram-v0, StarGunner-ram-v4, StarGunner-ramDeterministic-v0, StarGunner-ramDeterministic-v4, StarGunner-ramNoFrameskip-v0, StarGunner-ramNoFrameskip-v4, Tennis-v0, Tennis-v4, TennisDeterministic-v0, TennisDeterministic-v4, TennisNoFrameskip-v0, TennisNoFrameskip-v4, Tennis-ram-v0, Tennis-ram-v4, Tennis-ramDeterministic-v0, Tennis-ramDeterministic-v4, Tennis-ramNoFrameskip-v0, Tennis-ramNoFrameskip-v4, TimePilot-v0, TimePilot-v4, TimePilotDeterministic-v0, TimePilotDeterministic-v4, TimePilotNoFrameskip-v0, TimePilotNoFrameskip-v4, TimePilot-ram-v0, TimePilot-ram-v4, TimePilot-ramDeterministic-v0, TimePilot-ramDeterministic-v4, TimePilot-ramNoFrameskip-v0, TimePilot-ramNoFrameskip-v4, Tutankham-v0, Tutankham-v4, TutankhamDeterministic-v0, TutankhamDeterministic-v4, TutankhamNoFrameskip-v0, TutankhamNoFrameskip-v4, Tutankham-ram-v0, Tutankham-ram-v4, Tutankham-ramDeterministic-v0, Tutankham-ramDeterministic-v4, Tutankham-ramNoFrameskip-v0, Tutankham-ramNoFrameskip-v4, UpNDown-v0, UpNDown-v4, UpNDownDeterministic-v0, UpNDownDeterministic-v4, UpNDownNoFrameskip-v0, UpNDownNoFrameskip-v4, UpNDown-ram-v0, UpNDown-ram-v4, UpNDown-ramDeterministic-v0, UpNDown-ramDeterministic-v4, UpNDown-ramNoFrameskip-v0, UpNDown-ramNoFrameskip-v4, Venture-v0, Venture-v4, VentureDeterministic-v0, VentureDeterministic-v4, VentureNoFrameskip-v0, VentureNoFrameskip-v4, Venture-ram-v0, Venture-ram-v4, Venture-ramDeterministic-v0, Venture-ramDeterministic-v4, Venture-ramNoFrameskip-v0, Venture-ramNoFrameskip-v4, VideoPinball-v0, VideoPinball-v4, VideoPinballDeterministic-v0, VideoPinballDeterministic-v4, VideoPinballNoFrameskip-v0, VideoPinballNoFrameskip-v4, VideoPinball-ram-v0, VideoPinball-ram-v4, VideoPinball-ramDeterministic-v0, VideoPinball-ramDeterministic-v4, VideoPinball-ramNoFrameskip-v0, VideoPinball-ramNoFrameskip-v4, WizardOfWor-v0, WizardOfWor-v4, WizardOfWorDeterministic-v0, WizardOfWorDeterministic-v4, WizardOfWorNoFrameskip-v0, WizardOfWorNoFrameskip-v4, WizardOfWor-ram-v0, WizardOfWor-ram-v4, WizardOfWor-ramDeterministic-v0, WizardOfWor-ramDeterministic-v4, WizardOfWor-ramNoFrameskip-v0, WizardOfWor-ramNoFrameskip-v4, YarsRevenge-v0, YarsRevenge-v4, YarsRevengeDeterministic-v0, YarsRevengeDeterministic-v4, YarsRevengeNoFrameskip-v0, YarsRevengeNoFrameskip-v4, YarsRevenge-ram-v0, YarsRevenge-ram-v4, YarsRevenge-ramDeterministic-v0, YarsRevenge-ramDeterministic-v4, YarsRevenge-ramNoFrameskip-v0, YarsRevenge-ramNoFrameskip-v4, Zaxxon-v0, Zaxxon-v4, ZaxxonDeterministic-v0, ZaxxonDeterministic-v4, ZaxxonNoFrameskip-v0, ZaxxonNoFrameskip-v4, Zaxxon-ram-v0, Zaxxon-ram-v4, Zaxxon-ramDeterministic-v0, Zaxxon-ramDeterministic-v4, Zaxxon-ramNoFrameskip-v0, Zaxxon-ramNoFrameskip-v4, CubeCrash-v0, CubeCrashSparse-v0, CubeCrashScreenBecomesBlack-v0, MemorizeDigits-v0

2. Check whether an environment can be used:

for name in allNamesReduced:
    can = 'can not'
    try:
        env = gym.make(name)
        env.reset()
        env.close()
        can = 'can'
    except:
        can = 'can not'
    print(f'{name.rjust(40)} {can} be used')
                                 Copy-v0 can be used
                           RepeatCopy-v0 can be used
                     ReversedAddition-v0 can be used
                      DuplicatedInput-v0 can be used
                              Reverse-v0 can be used
                             CartPole-v0 can be used
                          MountainCar-v0 can be used
                             Pendulum-v0 can be used
                              Acrobot-v1 can be used
                          LunarLander-v2 can not be used
                        BipedalWalker-v2 can not be used
                            CarRacing-v0 can not be used
                            Blackjack-v0 can be used
                        KellyCoinflip-v0 can be used
                           FrozenLake-v0 can be used
                         CliffWalking-v0 can be used
                               NChain-v0 can be used
                             Roulette-v0 can be used
                                 Taxi-v2 can be used
                         GuessingGame-v0 can be used
                         HotterColder-v0 can be used
                              Reacher-v2 can not be used
                               Pusher-v2 can not be used
                              Thrower-v2 can not be used
                              Striker-v2 can not be used
                     InvertedPendulum-v2 can not be used
               InvertedDoublePendulum-v2 can not be used
                          HalfCheetah-v2 can not be used
                               Hopper-v2 can not be used
                              Swimmer-v2 can not be used
                             Walker2d-v2 can not be used
                                  Ant-v2 can not be used
                             Humanoid-v2 can not be used
                           FetchSlide-v1 can not be used
                    FetchPickAndPlace-v1 can not be used
                           FetchReach-v1 can not be used
                            FetchPush-v1 can not be used
                            HandReach-v0 can not be used
           HandManipulateBlockRotateZ-v0 can not be used
    HandManipulateBlockRotateParallel-v0 can not be used
         HandManipulateBlockRotateXYZ-v0 can not be used
              HandManipulateBlockFull-v0 can not be used
                  HandManipulateBlock-v0 can not be used
              HandManipulateEggRotate-v0 can not be used
                HandManipulateEggFull-v0 can not be used
                    HandManipulateEgg-v0 can not be used
              HandManipulatePenRotate-v0 can not be used
                HandManipulatePenFull-v0 can not be used
                    HandManipulatePen-v0 can not be used
                      FetchSlideDense-v1 can not be used
               FetchPickAndPlaceDense-v1 can not be used
                      FetchReachDense-v1 can not be used
                       FetchPushDense-v1 can not be used
                       HandReachDense-v0 can not be used
      HandManipulateBlockRotateZDense-v0 can not be used
HandManipulateBlockRotateParallelDense-v0 can not be used
    HandManipulateBlockRotateXYZDense-v0 can not be used
         HandManipulateBlockFullDense-v0 can not be used
             HandManipulateBlockDense-v0 can not be used
 HandManipulateBlockTouchSensorsDense-v0 can not be used
         HandManipulateEggRotateDense-v0 can not be used
           HandManipulateEggFullDense-v0 can not be used
               HandManipulateEggDense-v0 can not be used
   HandManipulateEggTouchSensorsDense-v0 can not be used
         HandManipulatePenRotateDense-v0 can not be used
           HandManipulatePenFullDense-v0 can not be used
               HandManipulatePenDense-v0 can not be used
   HandManipulatePenTouchSensorsDense-v0 can not be used
                            Adventure-v0 can be used
                              AirRaid-v0 can be used
                                Alien-v0 can be used
                               Amidar-v0 can be used
                              Assault-v0 can be used
                              Asterix-v0 can be used
                            Asteroids-v0 can be used
                             Atlantis-v0 can be used
                            BankHeist-v0 can be used
                           BattleZone-v0 can be used
                            BeamRider-v0 can be used
                              Berzerk-v0 can be used
                              Bowling-v0 can be used
                               Boxing-v0 can be used
                             Breakout-v0 can be used
                             Carnival-v0 can be used
                            Centipede-v0 can be used
                       ChopperCommand-v0 can be used
                         CrazyClimber-v0 can be used
                             Defender-v0 can be used
                          DemonAttack-v0 can be used
                           DoubleDunk-v0 can be used
                       ElevatorAction-v0 can be used
                               Enduro-v0 can be used
                         FishingDerby-v0 can be used
                              Freeway-v0 can be used
                            Frostbite-v0 can be used
                               Gopher-v0 can be used
                             Gravitar-v0 can be used
                                 Hero-v0 can be used
                            IceHockey-v0 can be used
                            Jamesbond-v0 can be used
                        JourneyEscape-v0 can be used
                             Kangaroo-v0 can be used
                                Krull-v0 can be used
                         KungFuMaster-v0 can be used
                     MontezumaRevenge-v0 can be used
                             MsPacman-v0 can be used
                         NameThisGame-v0 can be used
                              Phoenix-v0 can be used
                              Pitfall-v0 can be used
                                 Pong-v0 can be used
                               Pooyan-v0 can be used
                           PrivateEye-v0 can be used
                                Qbert-v0 can be used
                            Riverraid-v0 can be used
                           RoadRunner-v0 can be used
                             Robotank-v0 can be used
                             Seaquest-v0 can be used
                               Skiing-v0 can be used
                              Solaris-v0 can be used
                        SpaceInvaders-v0 can be used
                           StarGunner-v0 can be used
                               Tennis-v0 can be used
                            TimePilot-v0 can be used
                            Tutankham-v0 can be used
                              UpNDown-v0 can be used
                              Venture-v0 can be used
                         VideoPinball-v0 can be used
                          WizardOfWor-v0 can be used
                          YarsRevenge-v0 can be used
                               Zaxxon-v0 can be used
                            CubeCrash-v0 can be used
                       MemorizeDigits-v0 can be used

3. Render one of the environments

from time import sleep
name = 'Breakout-v0'
env = gym.make(name)
env.reset()
env.render()
sleep(1)
env.close()

4. Lets play one of the games with random inputs

One of the games played with random input results in a set of steps. These are shown below. The state space is an image for normal games, and some form of concatinated input for games that have -ram ni their name. These games typically done need CNN's to run.

names = 'Breakout-v0, Breakout-v4, BreakoutDeterministic-v0, BreakoutDeterministic-v4, BreakoutNoFrameskip-v0, BreakoutNoFrameskip-v4, Breakout-ram-v0, Breakout-ram-v4, Breakout-ramDeterministic-v0, Breakout-ramDeterministic-v4, Breakout-ramNoFrameskip-v0, Breakout-ramNoFrameskip-v4'
names = names.split(', ')

for name in names:
    env = gym.make(name)
    env.reset()

    print(f'Example Action Space: ')
    print(f'+-------------------------------------- ')
    print(f'| [{name}] : {env.action_space}')
    print(f'+-------------------- ')


    for _ in range(10):
        env.render()
        observation, reward, done, info = env.step(env.action_space.sample())
        print(f'observation: {observation.shape}, reward: {reward}')
        if done:
            print('='*30)
            break

    env.close()
Example Action Space: 
+-------------------------------------- 
| [Breakout-v0] : Discrete(4)
+-------------------- 
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-v4] : Discrete(4)
+-------------------- 
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [BreakoutDeterministic-v0] : Discrete(4)
+-------------------- 
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [BreakoutDeterministic-v4] : Discrete(4)
+-------------------- 
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [BreakoutNoFrameskip-v0] : Discrete(4)
+-------------------- 
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [BreakoutNoFrameskip-v4] : Discrete(4)
+-------------------- 
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
observation: (210, 160, 3), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-ram-v0] : Discrete(4)
+-------------------- 
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-ram-v4] : Discrete(4)
+-------------------- 
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-ramDeterministic-v0] : Discrete(4)
+-------------------- 
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-ramDeterministic-v4] : Discrete(4)
+-------------------- 
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-ramNoFrameskip-v0] : Discrete(4)
+-------------------- 
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
Example Action Space: 
+-------------------------------------- 
| [Breakout-ramNoFrameskip-v4] : Discrete(4)
+-------------------- 
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0
observation: (128,), reward: 0.0

References:

  1. OpenAI Gym: https://gym.openai.com/docs/