PII Detection System

A modern web application for detecting Personally Identifiable Information (PII) in CSV files using AWS Bedrock's Titan model.

Architecture

This application uses a clean separation between backend and frontend:

Backend: Node.js/Express server with AWS Bedrock integration
Frontend: React/Next.js with shadcn/ui components
AI Model: Amazon Titan Text Express v1 via AWS Bedrock for PII classification

Project Structure

PII_detection/
├── backend/
│   ├── app/
│   │   ├── __init__.py
│   │   ├── main.py              # FastAPI application
│   │   ├── models.py            # Pydantic data models
│   │   └── pii_detector.py      # Core PII detection logic
│   ├── data/                    # Test CSV files
│   ├── requirements.txt         # Python dependencies
│   ├── tests/                   # Backend tests
│   └── venv/                    # Virtual environment
├── frontend/
│   ├── src/
│   │   ├── app/                 # Next.js app directory
│   │   ├── components/          # React components
│   │   └── lib/                 # Utility functions
│   ├── package.json             # Node.js dependencies
│   └── tailwind.config.ts       # Tailwind CSS config
└── CLAUDE.md                    # Development instructions

Setup & Installation

Backend Setup

Navigate to backend directory:
```
cd backend
```

Create and activate virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Configure AWS credentials (choose one method):

Option A: AWS CLI (Recommended)

aws configure

Option B: Environment Variables

export AWS_ACCESS_KEY_ID=your_access_key
export AWS_SECRET_ACCESS_KEY=your_secret_key
export AWS_REGION=us-east-1

Option C: AWS Profile

export AWS_PROFILE=your-profile-name

Start the backend server:

uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Frontend Setup

Navigate to frontend directory:
```
cd frontend
```
Install dependencies:
```
npm install
```
Start the development server:
```
npm run dev
```

Usage

Start both servers:
- Backend: http://localhost:8000
- Frontend: http://localhost:3000
Access the web application at http://localhost:3000
Upload a CSV file using the drag-and-drop interface
View results showing PII classification for each column with:
- PII detection status
- Confidence scores
- Sample values
- AI reasoning

API Endpoints

Backend API (Port 8000)

GET / - Health check
GET /health - Detailed health status
POST /analyze-csv - Upload and analyze CSV file
POST /analyze-column - Analyze individual column

Example API Usage

# Upload CSV for analysis
curl -X POST "http://localhost:8001/analyze-csv" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@your-file.csv"

AWS Requirements

AWS Account with Bedrock access
IAM permissions for Bedrock model invocation
Bedrock model access to anthropic.claude-instant-v1
Regional availability: Ensure Bedrock is available in your selected region

Security Considerations

This is a defensive security tool designed to help identify PII in datasets. Key security practices:

Never commit AWS credentials to version control
Use IAM roles with minimal required permissions
Process sensitive data in secure environments
Review detected PII classifications before taking action

Development

Backend Development

cd backend
source venv/bin/activate
uvicorn app.main:app --reload

Frontend Development

cd frontend
npm run dev

Adding New Features

Backend: Add new endpoints in app/main.py
Frontend: Create new components in src/components/
UI Components: Use shadcn/ui for consistent styling

Testing

Backend Tests

cd backend
python -m pytest tests/

Frontend Tests

cd frontend
npm test

Deployment

Backend Deployment

Deploy as Docker container or serverless function
Ensure AWS credentials are configured in production environment

Frontend Deployment

Build: npm run build
Deploy to Vercel, Netlify, or static hosting

Troubleshooting

Common Issues

AWS Authentication Error: Verify AWS credentials and region
CORS Issues: Ensure backend allows frontend origin
Port Conflicts: Change ports in configuration if needed
Model Access: Verify Bedrock model permissions in AWS console

Backend Logs

Check FastAPI logs for detailed error information:

uvicorn app.main:app --log-level debug

Frontend Logs

Check browser console for frontend errors and network issues.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.claude		.claude
.serena		.serena
agents		agents
backend		backend
frontend		frontend
tasks		tasks
.Rhistory		.Rhistory
.dockerignore		.dockerignore
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
check_servers.sh		check_servers.sh
docker-compose.yml		docker-compose.yml
run.sh		run.sh
start-dev.sh		start-dev.sh
test.csv		test.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PII Detection System

Architecture

Project Structure

Setup & Installation

Backend Setup

Frontend Setup

Usage

API Endpoints

Backend API (Port 8000)

Example API Usage

AWS Requirements

Security Considerations

Development

Backend Development

Frontend Development

Adding New Features

Testing

Backend Tests

Frontend Tests

Deployment

Backend Deployment

Frontend Deployment

Troubleshooting

Common Issues

Backend Logs

Frontend Logs

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Jedelmann90/pii_detection_tool

Folders and files

Latest commit

History

Repository files navigation

PII Detection System

Architecture

Project Structure

Setup & Installation

Backend Setup

Frontend Setup

Usage

API Endpoints

Backend API (Port 8000)

Example API Usage

AWS Requirements

Security Considerations

Development

Backend Development

Frontend Development

Adding New Features

Testing

Backend Tests

Frontend Tests

Deployment

Backend Deployment

Frontend Deployment

Troubleshooting

Common Issues

Backend Logs

Frontend Logs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages