Deep Reinforcement Learning Agent with Randomly Generated Mazes

Implemented a deep Q network using PyTorch to train an agent that is capable of tackling any random maze with a set of unknown obstacles and unknown goal locations.

Illusterative video:


The flashing yellow line represents the optimal path found so far, the agent has no clue about the environment before. After some trials, the agent was able to reach the goal and continues to choose its path.