A3C for Kung Fu in OpenAI Gym

This repository contains code for training an A3C agent to play Kung Fu MasterDeterministic-v0 environment in OpenAI Gym.

Key Features:

Implementation of the A3C (Asynchronous Advantage Actor-Critic) algorithm for multi-agent training.
Preprocessing pipeline for Kung Fu observations using the PreprocessAtari wrapper.
Environment batching for parallel interaction with multiple environments.
Evaluation of the trained agent on single episodes.
Video recording and visualization of the agent's gameplay.
Train the agent for 3000 episodes and periodically show the average agent reward during training.

Environment: https://gymnasium.farama.org/environments/atari/kung_fu_master/

video.mp4

Requirements:

Python 3 PyTorch NumPy OpenAI Gym tqdm

Additional Notes:

The script currently trains 10 agents in 10 parallel environments. You can modify these numbers in the number_environments and EnvBatch class. The reward scaling (batch_rewards *= 0.01) is optional and might need adjustment depending on your environment and training dynamics.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ABK_A3C_for_Kung_Fu.ipynb		ABK_A3C_for_Kung_Fu.ipynb
README.md		README.md
abk_a3c_for_kung_fu.py		abk_a3c_for_kung_fu.py
video.mp4		video.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A3C for Kung Fu in OpenAI Gym

Key Features:

Requirements:

Additional Notes:

About

Releases

Packages

Languages

afif95/A3C-for-KungFuMaster-Environment

Folders and files

Latest commit

History

Repository files navigation

A3C for Kung Fu in OpenAI Gym

Key Features:

Requirements:

Additional Notes:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages