T2TL

Exploiting Transformer Sparse Reward in Reinforcement Learning for Interpretable Temporal Logic Motion Planning

If any questions, feel free to contact: zcharlie0257@gmail.com.

Installation instructions

We recommend using Python 3.7 or Python 3.8 to run this code.

pip install -r requirements.txt or conda env create -f conda_env.yml (Note that .yml is only effective for Python 3.7)
Install Spot-2.9:
- For Python 3.7, follow the installation instructions at the link. Spot should be installed in /usr/local/lib/python3.7/site-packages/spot. This step usually takes around 20 mins.
- For Python 3.8+, conda install -c conda-forge spot (Note that this command needs Python is 3.8+)
To train the agent in ZoneEnv, you will need Mujoco installed, as well as an active license.
- pip install mujoco-py==2.0.2.9
- pip install -e src/envs/safety/safety-gym/
- pip install numpy==1.21.5, if Python 3.7 or pip install numpy==1.23, if Python 3.8
Install dgl:
- If you only reproduce the results from Transformer-encoded method, just pip install dgl-cu113 -f https://data.dgl.ai/wheels/repo.html
- If you also want to reproduce the results from GNN-encoded method, we found the dgl team has not maintained dgl_cu111==0.7a210520. You can install it as follows:
  - download the dgl file from google drive
  - unzip dgl.zip and unzip dgl_cu111-0.7a210520.dist-info.zip
  - mv -f dgl/ /home/{your_workspace_name}/anaconda3/envs/T2TL/lib/python3.7/site-packages/
  - mv -f dgl_cu111-0.7a210520.dist-info/ /home/{your_workspace_name}/anaconda3/envs/T2TL/lib/python3.7/site-packages/

Examples

cd dis_src/
To train the agent with GNN method, run:
- python T1TL.py --gnn RGCN_8x32_ROOT_SHARED
To train the agent with pretrained GNN method, run:
- python train_PreGNNAgent.py
To train the agent with T1TL method, run:
- python T1TL.py
To train the agent with pretrained T1TL method, run:
- python T1TL_pretrain.py
To train the agent with T2TL method, run:
- python T2TL.py
To train the agent with pretrained T2TL method, run:
- python T2TL_pretrain..py

WaterWorld

ZoneEnv

Dimension Comparison

Interpretability via Attention

Citation

@ARTICLE{10167731,
  author={Zhang, Hao and Wang, Hao and Kan, Zhen},
  journal={IEEE Robotics and Automation Letters}, 
  title={Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning}, 
  year={2023},
  volume={8},
  number={8},
  pages={4831-4838},
  doi={10.1109/LRA.2023.3290511}}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

T2TL

Installation instructions

Examples

WaterWorld

ZoneEnv

Dimension Comparison

Interpretability via Attention

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

T2TL

Installation instructions

Examples

WaterWorld

ZoneEnv

Dimension Comparison

Interpretability via Attention

Citation