Tangled Program Graphs with Indexed Memory in Control Tasks with Short Time Dependencies

# Tangled Program Graphs with Indexed Memory in Control Tasks with Short Time Dependencies _by Tanya Djavaherpour (McMaster University) - 2025.01.07_ ###### tags: `VAADER` `Seminar` ![TPGbob](https://hackmd.io/_uploads/HJdq7xkV1g.jpg) ## Abstract This study addresses the challenges of shared temporal memory for evolutionary reinforcement learning agents in partially observable control tasks with short time dependencies. Tangled Program Graphs (TPG) is a genetic programming framework which has been widely studied in memory intensive tasks from video games, time series forecasting, and predictive control domains. In this study, we aim to improve external indexed memory usage in TPG by minimizing the impact of destructive agents during cultural transmission. We test various memory resetting strategies—per agent, per episode, and a no-memory control group—and evaluate their effectiveness in mitigating destructive effects while maintaining performance. Results from Acrobot, Pendulum, and CartPole tasks show that resetting memory more often can significantly boost TPG performance while preserving computational efficiency. These findings highlight the importance of memory management in Reinforcement Learning (RL) and suggest opportunities for further optimization for more complex visual RL environments, including adaptive memory resetting and evolved probabilistic memory operations.