Abstract

Audio descriptions are a form of narration that provide blind and low vision individuals information about key visual elements of a video. This project aims to develop a machine learning model that generates audio descriptions to improve the accessibility of videos. The model analyzes the frames of a video by quantifying their complexity using JPG image size. Hierarchical clustering is performed on the JPG image sizes to identify the most representative frames. These frames are processed by the Contrastive Language-Image Pre-training (CLIP) Interrogator which generates descriptions for each of the selected frames. The descriptions are then added to the video in text and audio. The limitations of this model include lengthy processing time and inaccurate descriptions generated by the CLIP Interrogator model.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-Training (CLIP) Interrogation Presentation.pptx		Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-Training (CLIP) Interrogation Presentation.pptx
Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-Training (CLIP) Interrogation Presentation.pptx.pdf		Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-Training (CLIP) Interrogation Presentation.pptx.pdf
Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-training (CLIP) Interrogation.docx		Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-training (CLIP) Interrogation.docx
Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-training (CLIP) Interrogation.docx.pdf		Generating Audio Descriptions for Videos Using Contrastive Language-Image Pre-training (CLIP) Interrogation.docx.pdf
README.md		README.md
audio_description_generation.ipynb		audio_description_generation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstract

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages