Skip to content

Syed8558/IT-helpdesk-ticket-classifier-rnn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

20 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿง  IT Helpdesk Ticket Classification using RNN

This project implements a Recurrent Neural Network (RNN) to automatically classify IT helpdesk support tickets based on their textual descriptions. It simulates a real-world enterprise IT support system where incoming tickets are automatically routed to the correct support team.

๐Ÿ“Œ Business Problem

Large IT teams receive thousands of support tickets every day. Manually reading and assigning tickets causes:

Delays

SLA violations

Increased workload

This system predicts the issue category of a ticket so it can be routed instantly.

Example categories:

Hardware Issue

Software Issue

Network Problem

Password Reset

Access Request

System Error

Security Incident

๐Ÿš€ Solution

We use an RNN-based deep learning model trained on historical ticket descriptions to predict the category of a new ticket automatically.

The system:

Extracts text from PDF and JSON tickets

Builds a training dataset

Trains an RNN text classifier

Exposes predictions through an API

๐Ÿง  Machine Learning Pipeline PDF / JSON Tickets โ†“ Text Extraction โ†“ Text Cleaning & Tokenization โ†“ RNN Model Training โ†“ Model Evaluation โ†“ Prediction API

๐Ÿ“ Project Structure it-helpdesk-ticket-classifier-rnn/ โ”‚ โ”œโ”€โ”€ src/ โ”‚ โ”œโ”€โ”€ build_dataset.py # Extracts text from PDF & JSON files โ”‚ โ”œโ”€โ”€ preprocess.py # Text cleaning & tokenization โ”‚ โ”œโ”€โ”€ model.py # RNN architecture โ”‚ โ”œโ”€โ”€ train.py # Model training โ”‚ โ””โ”€โ”€ evaluate.py # Performance evaluation โ”‚ โ”œโ”€โ”€ api/ โ”‚ โ””โ”€โ”€ app.py # REST API for predictions โ”‚ โ”œโ”€โ”€ data/ โ”‚ โ”œโ”€โ”€ .gitkeep โ”‚ โ””โ”€โ”€ README.md # Dataset instructions โ”‚ โ”œโ”€โ”€ models/ # Trained models (not committed) โ”œโ”€โ”€ requirements.txt โ”œโ”€โ”€ .gitignore โ””โ”€โ”€ README.md

๐Ÿ“Š Dataset

The project uses IT helpdesk ticket data in PDF and JSON format.

Due to privacy and size constraints, the dataset is not included in this repository. Place your dataset files inside the data/ folder before running the project.

โš™๏ธ Installation pip install -r requirements.txt

โ–ถ๏ธ Build Dataset

Extract text from tickets and create training data:

python src/build_dataset.py

โ–ถ๏ธ Train the RNN Model python src/train.py

๐ŸŒ Run the Prediction API python api/app.py

API runs at:

http://127.0.0.1:8000

๐Ÿ”ฎ Sample API Request { "ticket_text": "User is unable to connect to WiFi network in office" }

Response:

{ "predicted_category": "Network_Problem" }

๐Ÿ›  Tech Stack

Python

TensorFlow (RNN)

Pandas, NumPy

NLTK, spaCy

PyPDF2

Flask / FastAPI

๐Ÿ’ผ Why This Project Matters

This project demonstrates:

Natural Language Processing

Deep Learning with RNN

Data extraction from real documents

End-to-end ML pipeline

Production-style API deployment

This is a real enterprise use case, not a toy example.

๐Ÿ‘จโ€๐Ÿ’ป Author

Syed Sadath G Data Scientist | NLP | Deep Learning

About

RNN-based NLP system that automatically classifies IT helpdesk support tickets into categories like hardware, software, network, and security issues. Built using TensorFlow, Python, and FastAPI to demonstrate real-world enterprise ticket routing and automation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages