Robot Arm Plays an Arcade Game

Work in Progress

Project Overview

This repository contains the spring‑quarter achievements exploring how a robot arm can learn to play an arcade game (Pong) under delayed and sparse reward conditions, using reinforcement learning.

The core idea is to investigate algorithms that handle delayed feedback and sparse rewards in a physical setup. A custom interface delays state/action observations. The agent is trained end‑to‑end despite observation/action latency. The robot used is the Stretch3 from Hello Robotic.

Outcomes & Deliverables

A set of Jupyter notebooks demonstrating CNN‑based RL, computer‑vision control, and a VAE for state representation.
Latency‑logging and analysis scripts.
A robot control script to validate real‑world performance under induced delays.
A written spring report (spring-report/report.md) summarizing results and insights.

Prerequisites & Installation

Python ≥ 3.8

Clone the repo:

git clone git@github.com:BenBenyamin/ArcadeRobot.git
cd ArcadeRobot

Install dependencies:

pip install torch torchvision gymnasium stable-baselines3 opencv-python matplotlib pandas jupyterlab ale_py

How to Run

Jupyter Notebooks
- agent/CNN-approach/pong.ipynb & pong-wrapped.ipynb — CNN‑based RL experiments
- agent/cv-approach/cv-approach.ipynb — OpenCV control baseline
- agent/VAE/train_on_dataset.ipynb — VAE training & visualization
Latency Logging & Plotting
```
python utils/plot-latency.py
```
Robot Validation Script (Run on the Strech Robot)
```
python robot/check_lat_com.py
```
See logs/latency_log_*.csv for an output example.

Project Structure

├── agent                   # All agent development code and experiments
│   ├── CNN-approach        # PPO/CNN notebooks for Pong under delay
│   │   ├── pong.ipynb      # Raw environment CNN-based RL notebook
│   │   ├── pong-wrapped.ipynb  # Wrapped environment CNN-based RL notebook
│   │   └── wrapper         # Custom Gym wrappers for delay and observation transforms
│   ├── cv-approach         # OpenCV-based control proof-of-concept
│   │   ├── cv-approach.ipynb  # Notebook applying CV to detect paddle and ball
│   │   └── cv.py           # Script encapsulating CV frame processing logic
│   └── VAE                 # Variational Autoencoder for state representation
│       ├── loss.py         # VAE loss functions
│       ├── train_on_dataset.ipynb  # VAE *offline* training pipeline notebook
│       ├── vae.py          # VAE model implementation
│       └── vae-train.ipynb # VAE *online* training training pipeline notebook
├── logs                    # Latency measurement CSV logs
│   └── latency_log_1748472382.csv  # Example latency log file
├── robot                   # Robot arm communication and validation scripts
│   └── check_lat_com.py    # Tests round-trip communication latency
├── spring-report           # Spring report and associated figures
│   ├── figures             # Generated plots and demo video (see spring-report/repord.md for more context)
│   └── report.md           # Written summary of methodology, results, insights for the spring quarter
└── utils                   # Utility scripts for analysis and plotting
    └── plot-latency.py     # Parses CSV logs and generates latency plots

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
agent		agent
extras		extras
fall		fall
logs		logs
report		report
robot		robot
ros		ros
utils		utils
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Robot Arm Plays an Arcade Game

Project Overview

Outcomes & Deliverables

Prerequisites & Installation

How to Run

Project Structure

About

Uh oh!

Releases

Packages

Languages

BenBenyamin/ArcadeRobot

Folders and files

Latest commit

History

Repository files navigation

Robot Arm Plays an Arcade Game

Project Overview

Outcomes & Deliverables

Prerequisites & Installation

How to Run

Project Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages