Skip to content

Audiobook alignment for North American Indigenous languages

License

Notifications You must be signed in to change notification settings

finguist/ReadAlong-Studio

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

174 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

readlongs

Audiobook alignment for North American Indigenous languages

end product

The concept is a web application with a series of stages of processing, which ultimately leads to a time-aligned audiobook - i.e. a package of:

  • SMIL file describing time alignments
  • TEI file describing text
  • Audio file (WAV or MP3)

Which can be loaded using the read-along JavaScript component.

Optionally a book can be generated as a standalone HTML page or as an ePub file.

  1. (optional) Pre-segment inputs, consisting of:
    • Single audio file
    • Text with page markings (assume paragraph breaks = pages)
  2. Input pages: each page consists of
    • Image file
    • Audio file
    • Text
  3. Run alignment
  4. View output and download components

roadmap

  • MVP app:
    • Single page (image, audio, text)
    • Select language (crl or atj for now)
    • Run alignment and launch read-along app with output

running the web app

  1. pip install -e .
  2. python
  3. >>> from readalongs.app import app
  4. app.run()

generating an ePub

  1. pip install -e .
  2. readalongs_align --output-xhtml XMLFILE WAVFILE OUTPUTNAME
  3. readalongs_create_epub OUTPUTNAME.smil OUTPUTNAME.epub

About

Audiobook alignment for North American Indigenous languages

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 72.0%
  • Python 17.5%
  • CSS 8.1%
  • HTML 2.4%