Interpretative Interfaces

A Python backend for exploring GPT-2's internal representations through mechanistic interpretability. Built with TransformerLens and Flask, this project provides API endpoints that let a visual frontend inspect how a language model processes text, token by token, layer by layer.

What This Does

This backend powers an interactive visualization tool that lets users:

Tokenize text and see how GPT-2 breaks it into subword tokens
Trace token embeddings through all 12 layers of GPT-2, reduced to 2D via PCA
Inspect attention patterns — which tokens attend to which, at any layer and head
Apply the logit lens — see what the model would predict at each intermediate layer, revealing how meaning builds up through the network

Project Structure

interpretative-interfaces/
├── app.py                             # Flask server with all API endpoints
├── model_utils.py                     # Core functions: tokenize, trace, attention, predict
├── requirements.txt                   # Pinned Python dependencies
├── notebooks/
│   ├── tutorial-walkthrough.ipynb     # Annotated TransformerLens tutorial
│   ├── tokenization.ipynb             # Tokenization experiments
│   ├── embedding_extraction.ipynb     # Layer-by-layer embedding extraction
│   └── dimensionality_reduction.ipynb # PCA/UMAP reduction + trajectory plots
├── examples/                          # Saved JSON responses for frontend mock data
│   ├── example1_tokenize.json
│   ├── example1_trace.json
│   ├── example1_attention.json
│   ├── example1_predict.json
│   └── ...
└── API.md                             # Full endpoint documentation

Setup

Prerequisites

Python 3.10+
~2 GB disk space for the GPT-2 model (downloaded on first run)

Installation

git clone https://github.com/YOUR_USERNAME/interpretative-interfaces.git
cd interpretative-interfaces

python -m venv venv
source venv/bin/activate   # on Windows: venv\Scripts\activate

pip install -r requirements.txt

Run the Server

python app.py

The server starts on http://localhost:5001. The first run will download GPT-2 small (~500 MB).

API Endpoints

`POST /tokenize`

Break text into GPT-2 tokens.

curl -X POST http://localhost:5001/tokenize \
  -H "Content-Type: application/json" \
  -d '{"text": "The cat sat on the mat"}'

`POST /trace`

Get 2D trajectory of selected tokens through all 12 layers (PCA-reduced).

curl -X POST http://localhost:5001/trace \
  -H "Content-Type: application/json" \
  -d '{"text": "The cat sat on the mat", "token_indices": [1, 4]}'

`POST /attention`

Get the attention matrix for a specific layer and head.

curl -X POST http://localhost:5001/attention \
  -H "Content-Type: application/json" \
  -d '{"text": "The cat sat on the mat", "layer": 5, "head": 3}'

`POST /predict`

Apply the logit lens: see top-5 predicted tokens at each layer for a given position.

curl -X POST http://localhost:5001/predict \
  -H "Content-Type: application/json" \
  -d '{"text": "The cat sat on the mat", "token_index": 5}'

See API.md for full request/response schemas.

Tech Stack

TransformerLens — hooks into GPT-2 internals (activations, attention patterns, residual stream)
Flask — lightweight API server
scikit-learn — PCA for dimensionality reduction
NumPy — tensor/array manipulation

Resources

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
backend		backend
documentation		documentation
examples		examples
frontend		frontend
notebooks		notebooks
.gitignore		.gitignore
API.md		API.md
README.md		README.md
week7-8-plan.pdf		week7-8-plan.pdf
week9-plan.pdf		week9-plan.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interpretative Interfaces

What This Does

Project Structure

Setup

Prerequisites

Installation

Run the Server

API Endpoints

`POST /tokenize`

`POST /trace`

`POST /attention`

`POST /predict`

Tech Stack

Resources

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Interpretative Interfaces

What This Does

Project Structure

Setup

Prerequisites

Installation

Run the Server

API Endpoints

POST /tokenize

POST /trace

POST /attention

POST /predict

Tech Stack

Resources

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /tokenize`

`POST /trace`

`POST /attention`

`POST /predict`

Packages