PDF to Audio Conversion Pipeline

Overview

This repository provides a simple pipeline to convert PDF documents into audio files using Python. The process involves extracting text from the PDF, detecting the language of the extracted text, and converting it into an audio file (MP3 format) using Google Text-to-Speech (gTTS).

Key Features:

PDF Text Extraction: Uses PyPDF2 to extract text from PDF files.
Language Detection: Automatically detects the language of the text using the langdetect library.
Text-to-Audio Conversion: Converts text to audio using the gTTS (Google Text-to-Speech) API.
Supports both English and Arabic for text-to-speech conversion.

Requirements

To use this pipeline, you need the following Python libraries:

PyPDF2: For reading and extracting text from PDF files.
langdetect: For detecting the language of the extracted text.
gTTS: For converting text to speech in MP3 format.

Install Dependencies:

To install the necessary dependencies, run the following command:

pip install PyPDF2 langdetect gtts

How It Works

The notebook implements the following steps:

Extract Text from PDF: Read the PDF file and extract text using PyPDF2.
Detect Language: Identify the language of the extracted text using the langdetect library.
Convert Text to Audio: Convert the extracted text into an audio file using Google Text-to-Speech (gTTS).

File Structure

.
PDF_to_Audio_Conversion/
├── app.py
├── requirements.txt
├── static/
│   └── css/
│       └── style.css
├── templates/
│   └── index.html
├── PDF_to_Audio_Conversion.ipynb
│   
└── uploads/
    └── [Uploaded and paraphrased PDFs will be stored here]

Usage

1. Clone the repository

git clone https://github.com/your-username/pdf-to-audio-conversion.git
cd pdf-to-audio-conversion

2. Run the Jupyter Notebook

You can open and run the notebook PDF_to_Audio_Conversion.ipynb in Jupyter. Follow the steps in the notebook to convert your PDF files into audio.

Alternatively, you can use the following example Python script:

3. Example Script

from langdetect import detect
from PyPDF2 import PdfReader
from gtts import gTTS

def detect_language_from_pdf(pdf_path):
    reader = PdfReader(pdf_path)
    text = ""
    for page in reader.pages:
        text += page.extract_text()
    language = detect(text)
    return language, text

def text_to_audio(text, language, output_path):
    lang_code = 'en' if language == 'en' else 'ar'
    tts = gTTS(text=text, lang=lang_code)
    tts.save(output_path)

def pdf_to_audio_pipeline(pdf_path, audio_output_path):
    # Step 1: Detect Language and Extract Text
    language, text = detect_language_from_pdf(pdf_path)

    # Step 2: Convert Text to Audio and Save as MP3
    text_to_audio(text, language, audio_output_path)

# Example usage
pdf_path = "example.pdf"
audio_output_path = "output_audio.mp3"
pdf_to_audio_pipeline(pdf_path, audio_output_path)

Example

To convert a PDF into an MP3 audio file:

Place your PDF file in the same directory as the notebook or script.
Set the pdf_path and audio_output_path in the script or notebook.
Run the pipeline, and it will generate an audio file from the extracted text.

Example:

pdf_path = "example.pdf"
audio_output_path = "output_audio.mp3"
pdf_to_audio_pipeline(pdf_path, audio_output_path)

Customization

Language Support: Currently, the script supports English (en) and Arabic (ar). You can add support for more languages by modifying the text_to_audio function.
Error Handling: The script assumes valid PDFs and text extraction. Add error handling to manage different file formats or empty pages.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Api_code		Api_code
README.md		README.md
convert pdf book into audio.ipynb		convert pdf book into audio.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF to Audio Conversion Pipeline

Overview

Key Features:

Requirements

Install Dependencies:

How It Works

File Structure

Usage

1. Clone the repository

2. Run the Jupyter Notebook

3. Example Script

Example

Customization

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF to Audio Conversion Pipeline

Overview

Key Features:

Requirements

Install Dependencies:

How It Works

File Structure

Usage

1. Clone the repository

2. Run the Jupyter Notebook

3. Example Script

Example

Customization

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages