Merge pull request #3 from Microgorath/python-3-11

Microgorath · web-flow · commit e56111844a38 · 2024-07-29T20:13:09.000-04:00
Improve config support, tune default config hyperparameters
diff --git a/.devcontainer/devcontainer.json b/.devcontainer/devcontainer.json
@@ -2,17 +2,17 @@
 // README at: https://github.com/devcontainers/templates/tree/main/src/docker-existing-dockerfile
 {
 	"name": "poke-rl",
-	"image": "tensorflow/tensorflow:2.13.0-gpu-jupyter",
+	"image": "tensorflow/tensorflow:2.15.0-gpu-jupyter",
 
-	"runArgs": ["--gpus=all"
+	"runArgs": ["--gpus=all",
+        "--shm-size=50gb"
     ],
 
 	// Features to add to the dev container. More info: https://containers.dev/features.
 	// "features": {},
 
 	// Use 'forwardPorts' to make a list of ports inside the container available locally.
-	// TODO: Add pokemon showdown local server installation and setup. Showdown uses port 8000 by default.
-	// Set Jupyter notebook to use port 8888, if necessary. 
+	// Showdown uses port 8000 by default.
 	"forwardPorts": [8000],
 
 	// Uncomment the next line to run commands after the container is created.
diff --git a/README.md b/README.md
@@ -2,25 +2,25 @@
 
 ### Requirements Overview
 This notebook uses RLLib, an open-source scalable reinforcement learning library in the Ray framework.  
-RLLib currently only supports Python 3.8.  
-RLLib supports both PyTorch and Tensorflow, so either may be used. Preferably, the library used will be CUDA-enabled to utilize the GPU, but it is optional. Nvidia GPU support only.  
+RLLib currently supports Python 3.9 - 3.12.  
+RLLib supports both PyTorch and Tensorflow, so either may be used. This setup will assume GPU will be used, but it is not necessary for most algorithms. Training with GPU was found to be slightly slower than only using CPU for DQN. GPU use is most likely only useful for large models that take longer for inference or backprop.  
 
 ### Tensorflow GPU Support
-A dev container is provided that will set up a Linux Tensorflow 2.13.0-gpu-jupyter Docker container with everything set up for Tensorflow GPU support, which also starts its own local pokemon showdown server when started. The showdown server is port forwarded to be visible on the host, at http://localhost:8000.  
+A dev container is provided that will set up a Linux Tensorflow 2.15.0-gpu-jupyter Docker container with everything set up for Tensorflow GPU support, which also starts its own local pokemon showdown server when started. The showdown server is port forwarded to be visible on the host, at http://localhost:8000.  
 Requires Docker Desktop, with Nvidia Container Toolkit set up.  
 
 If on Windows, also requires WSL2. Follow [this guide](https://gdevakumar.medium.com/setup-windows-10-11-machines-for-deep-learning-with-docker-and-gpu-using-wsl-9349f0224971) to set up Docker Desktop with WSL2 and Nvidia Container Toolkit. The CUDA toolkit version installed on the local WSL2 does not matter, as the Docker image installs its own CUDA Toolkit and cuDNN automatically.  
-Currently, Tensorflow 2.13 is the most recent version supported by RLLib, which requires specifically CUDA Toolkit 11.8 and cuDNN 8.6. As of Tensorflow 2.11, using GPU on Windows is not supported, thus why WSL2 is required.  
+As of Tensorflow 2.11, using GPU on Windows is not supported, thus why WSL2 is required.  
 
-The resulting container is 7.7 GB.  
+The resulting container is 7.3 GB.  
 
 ### PyTorch GPU Support
-PyTorch works just fine, without needing WSL2 or Docker. Training time is about the same as Tensorflow, but has very limited Tensorboard support. Set up PyTorch with GPU support however you would normally on a new Conda environment. Here I will be using CUDA 12.1, but any version that works with Python 3.8 should be fine.  
+PyTorch works just fine, without needing WSL2 or Docker. Training time is about the same as Tensorflow, but has very limited Tensorboard support. Set up PyTorch with GPU support however you would normally on a new Conda environment.
 ```
-conda create -n poke-rl-torch python=3.8  
+conda create -n poke-rl-torch python=3.11  
 conda activate poke-rl-torch  
-pip3 install torch --index-url https://download.pytorch.org/whl/cu121  
-pip install -r requirements.txt  
+pip3 install --user torch --index-url https://download.pytorch.org/whl/cu121  
+pip3 install --user -r requirements.txt  
 ```
 Once installed, in basic_rl.ipynb be sure to change ```"tf2"``` to ```"torch"``` in the line: 
 ```python 
diff --git a/notebooks/basic_rl.ipynb b/notebooks/basic_rl.ipynb