cam-feed-ai-classifier

A real-time AI-powered camera feed classifier that uses YOLOv8 to detect and classify whether people are actively working at their desks. Supports both MacBook built-in cameras and iPhone cameras via Continuity Camera.

🚀 Features

Real-time AI Classification: Live desk work activity detection using YOLOv8
Multi-Camera Support: Works with MacBook cameras and iPhone cameras
Person & Equipment Detection: Identifies people, laptops, keyboards, mice, and phones
Pose Analysis: Advanced body posture analysis for accurate work state detection
Live Statistics: Real-time counts of working vs. idle people
High Performance: Optimized frame processing with adjustable quality modes

📊 Classifications

🟢 Working: Person at desk with active working posture
🟡 At Desk: Person at desk but not actively engaged
🟠 At Desk (Idle): Person at desk with minimal activity
🔴 Away from Desk: Person not positioned at workspace

🛠️ Installation

Prerequisites

Python 3.8+
macOS (tested on macOS 13+)
Camera access permissions

Setup

Clone the repository

git clone https://github.com/yourusername/cam-feed-ai-classifier.git
cd cam-feed-ai-classifier

Install dependencies

# Using pipenv (recommended)
pipenv install

# Or using pip
pip install -r requirements.txt

Grant camera permissions
- System Preferences → Security & Privacy → Camera
- Enable access for Terminal/IDE

🎯 Quick Start

MacBook Camera

pipenv shell
python main.py

iPhone Camera Setup

Method 1: Continuity Camera (macOS 13+)

iPhone: Settings → General → AirPlay & Handoff → Continuity Camera ✓
Mac: System Settings → General → AirPlay & Handoff → iPhone Widgets ✓
Keep iPhone unlocked and nearby

Method 2: Third-party Apps

Install Camo or EpocCam
Connect via USB or WiFi

Then run:

pipenv run python main.py

⌨️ Controls

Key	Action
`q`	Quit application
`s`	Save current frame
`t`	Test camera connection
`f`	Toggle full processing mode
`c`	Show available cameras
`r`	Reset/restart detection

📁 Project Structure

cam-feed-ai-classifier/
├── LICENSE                   # Project license
├── Pipfile                   # Pipenv dependencies
├── Pipfile.lock              # Locked dependency versions
├── main.py                   # Main application
├── pyproject.toml            # Project configuration
├── .gitignore                # Git ignore rules
├── .pre-commit-config.yaml   # Pre-commit hooks
└── README.md                 # This file

🔧 Configuration

Camera Detection

The system automatically detects available cameras:

Index 0: MacBook built-in camera (1280x720)
Index 1: iPhone/external camera (1920x1080+)
Index 2+: Additional cameras

Adjusting Sensitivity

Edit confidence thresholds in main.py:

# Object detection confidence
obj_results = self.yolo_model.predict(frame, conf=0.3)

# Working pose threshold
is_working = working_score > 0.5  # Adjust 0.1-0.9

Performance Tuning

# Frame processing (every Nth frame)
skip_frames = 2  # Process every 3rd frame

# Camera resolution
self.cap.set(cv2.CAP_PROP_FRAME_WIDTH, 1920)   # Higher = better quality
self.cap.set(cv2.CAP_PROP_FRAME_HEIGHT, 1080)  # Higher = slower processing

🧠 How It Works

AI Pipeline

Object Detection (YOLOv8): Detects persons and work equipment
Spatial Analysis: Determines desk positioning relationships
Pose Estimation (YOLOv8-pose): Analyzes body keypoints
Classification Logic: Combines spatial + pose data for final status
Real-time Display: Live visualization with statistics

Detection Logic

Proximity Check: Person within range of work equipment
Posture Analysis: Sitting position, shoulder alignment, head orientation
Hand Position: Typing posture detection via wrist/elbow keypoints
Engagement Score: Combined confidence metric (0.0-1.0)

🎛️ Output Display

The interface shows:

Live camera feed with bounding boxes
Person classifications with confidence scores
Equipment detection highlights
Real-time statistics (People | At Desk | Working)
Processing mode and timestamp
Camera resolution info

🐛 Troubleshooting

Camera Issues

# Test camera availability
python -c "import cv2; print([cv2.VideoCapture(i).isOpened() for i in range(5)])"

Common Solutions:

Close other camera apps (Zoom, Teams, etc.)
Restart Terminal/IDE after granting permissions
Try different camera indices (0, 1, 2)
For iPhone: ensure same Apple ID, WiFi/Bluetooth enabled

Performance Issues

Use MacBook camera for better performance
Reduce resolution in camera settings
Increase frame skipping (skip_frames = 3)
Toggle full processing mode with f key

Detection Quality

Ensure good lighting conditions
Position camera to capture full desk area
Use iPhone camera for higher resolution
Adjust confidence thresholds

🔬 Technical Details

Dependencies

ultralytics: YOLOv8 models for detection and pose estimation
opencv-python: Computer vision and camera handling
numpy: Numerical computations for pose analysis

Models

yolov8n.pt: Lightweight object detection (~6MB)
yolov8n-pose.pt: Human pose estimation (~6MB)

Models are automatically downloaded on first run.

COCO Classes Used

Class 0: Person
Class 56: Chair
Class 63: Laptop
Class 64: Mouse
Class 66: Keyboard
Class 67: Cell phone

📊 Performance

Typical Performance:

MacBook Camera: ~15-20 FPS (720p)
iPhone Camera: ~10-15 FPS (1080p)
Detection Accuracy: ~85-90% in good lighting

System Requirements:

RAM: 4GB+ recommended
CPU: Modern Intel/Apple Silicon
Storage: 200MB for models and dependencies

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit changes (git commit -m 'Add amazing feature')
Push to branch (git push origin feature/amazing-feature)
Open a Pull Request

Please run pre-commit hooks before submitting:

pre-commit run --all-files

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Ultralytics for YOLOv8 models
OpenCV for computer vision capabilities
COCO dataset for training data

⚠️ Privacy Notice

This tool processes camera feeds locally. Please respect privacy laws and obtain consent when monitoring individuals in workplace environments.

Made with ❤️ at Datum Brain for productivity insights.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml

License

datumbrain/cam-feed-ai-classifier

Folders and files

Latest commit

History

Repository files navigation