guide-unity_python_scripting for Deep Reinforcement Learning

Preview

pytorch model path

unity_python\Assets\StreamingAssets\python\pytorch_models/*.pth

guide-unity_python_scripting for Deep Reinforcement Learning

Execute python on unity
TCPIP between unity and python
Deep Reinforcement Learning(DRL) without Unity ML-Agent

Python install

python version: 3.9.12(64-bit)
(option) Do Not Add Python 3.9 to path

python directory

C:\Users\[username]\AppData\Local\Programs\Python\Python39/python.exe

vscode linting

File-Open Folder

change interpreter to venv

you should install venv manually

C:\Users\[username]\AppData\Local\Programs\Python\Python39\python.exe -m venv .venv

venv directory

Assets/StreamingAssets/python/.venv/

why in /StreamingAssets/?

.venv must exist in build path StreamingAssets folder can align both editor and build environment.

active.bat modify(only when move venv folder)

In my experience, It runs well without activate

but for whom need it..

set VIRTUAL_ENV=[Absolutepath]\.venv

=>

set VIRTUAL_ENV=%cd%\..\..\.venv

module

numpy

install numpy for just module import test

Pytorch

PyTorch

cuda install

CUDA Toolkit 12.1 Downloads | NVIDIA Developer

Cuda version must be matched with pytorch cude version(specified on pytorch.org) In this project, I installed CUDA 12.1

pytorch install

[venv dir]\.venv\Scripts\python.exe -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

or

cd to parent of ./venv and
.\.venv\Scripts\python.exe -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

Unity

Scene: PythonExecuteScene.unity

script

C# script

python script

result

Ball balancing project

input:

ball position (x,y,z), ball speed (x,y,z), target position(x,y,z), current plate Euler angle (rx,rz), rotation speed(rad/s?)

3+3+3+2+1 = 12

PythonProcess.StandardInput.WriteLine($"{Vector3ToString(BallPosition)},{Vector3ToString(BallSpeed)},{PlateRX},{PlateRZ},{Vector3ToString(TargetPosition)},{PlateAngularSpeed}");

output:

5-Action (RX+, RX-, RZ+, RZ-, NOTHING)

Version

ballbalancing_model v5.1

DQN:

class DQN(nn.Module):##v5.1
    def __init__(self, state_size, action_size):
        super(DQN, self).__init__()
        self.fc1 = nn.Linear(state_size, 64)
        self.fc2 = nn.Linear(64, 64)
        self.fc3 = nn.Linear(64, 64)
        self.relu = nn.ReLU()
        self.fc4 = nn.Linear(64, action_size)

    def forward(self, x):
        x = self.relu(self.fc1(x))
        x = self.relu(self.fc2(x))
        x = self.relu(self.fc3(x))
        return self.fc4(x)

ballbalancing_model_v5_1_231209.pth

trained 1000 episodes
50~70%

reward:

So far, the linear distance reward system has performed the best.

Reward += (TargetThreshold-dist)/10f;

ballbalancing_model v6

Add guide wall(negative reward when collision)
multi agent
condition:

self.optimizer = optim.Adam(self.net.parameters(), lr=0.001)
self.batch_size = 64
self.gamma = 0.99

self.local_memory = ReplayMemory(10000)
self.eps_start = 0.99
self.eps_end = 0.05
self.eps_decay = 50
self.steps_done = 0

DQN:

class DQN(nn.Module):##v6
    def __init__(self, state_size, action_size):
        super(DQN, self).__init__()
        self.fc1 = nn.Linear(state_size, 64)
        self.fc2 = nn.Linear(64, 64)
        self.fc3 = nn.Linear(64, 64)
        self.relu = nn.ReLU()
        self.fc4 = nn.Linear(64, action_size)

    def forward(self, x):
        x = self.relu(self.fc1(x))
        x = self.relu(self.fc2(x))
        x = self.relu(self.fc3(x))
        return self.fc4(x)

reward:

So far, the linear distance reward system has performed the best.

Reward += (TargetThreshold-dist)/10f;

Normalization

PlateRX = this.PlateRX/10f,
PlateRZ = this.PlateRZ/10f,

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
readme_img		readme_img
unity_python		unity_python
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Preview

pytorch model path

guide-unity_python_scripting for Deep Reinforcement Learning

Python install

python directory

vscode linting

you should install venv manually

venv directory

why in /StreamingAssets/?

active.bat modify(only when move venv folder)

module

numpy

Pytorch

cuda install

pytorch install

Unity

Scene: PythonExecuteScene.unity

script

result

Ball balancing project

Version

ballbalancing_model v5.1

ballbalancing_model_v5_1_231209.pth

reward:

ballbalancing_model v6

reward:

Normalization

About

Uh oh!

Releases

Packages

Languages

son-engr-kr/Unity_Python_DeepReinforcementLearning

Folders and files

Latest commit

History

Repository files navigation

Preview

pytorch model path

guide-unity_python_scripting for Deep Reinforcement Learning

Python install

python directory

vscode linting

you should install venv manually

venv directory

why in /StreamingAssets/?

active.bat modify(only when move venv folder)

module

numpy

Pytorch

cuda install

pytorch install

Unity

Scene: PythonExecuteScene.unity

script

result

Ball balancing project

Version

ballbalancing_model v5.1

ballbalancing_model_v5_1_231209.pth

reward:

ballbalancing_model v6

reward:

Normalization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages