fleet-commander/train.py at 32477def6a1bf89e9e8cb02813bdf82f257a73ac - fleet-commander - csd4ni3l's Git

csd4ni3l/fleet-commander

mirror of https://github.com/csd4ni3l/fleet-commander.git synced 2026-01-01 04:23:47 +01:00

Files

csd4ni3l 32477def6a add RL training which doesnt work that wall yet, and start to make UI for model training

2025-11-15 15:56:56 +01:00

20 lines

380 B

Python

Raw Blame History

 from stable_baselines3 import PPO
 from utils.ml import SpaceInvadersEnv
 env = SpaceInvadersEnv()
 model = PPO(
     "MlpPolicy",
     env,
     n_steps=2048,
     batch_size=64,
     n_epochs=10,
     learning_rate=3e-4,
     verbose=1,
     device="cpu",
     gamma=0.99,
     ent_coef=0.02,
     clip_range=0.2,
     gae_lambda=0.95
 )
 model.learn(1_000_000)
 model.save("invader_agent")