Audiobook Factory

The server at the moment is Kali providing conversion for upto ~50KB of text at a time.

This is a quick hack to convert common ebook, image, and text formats into audio books. Using open source OCR and voice synthesis software. Please don't abuse or overload this test service, a giant 100+ page djvu file will destroy the server unless you specify a subset of a few pages covering a chapter.

Please be patient after pressing "Download" as it takes time to process the input and prepare a file for you. Please do not press the "Download" button repeatedly after submitting a job, this will not make anything go faster, and is likely to bring the server to a grinding halt!

A single line of text translates to about a 15-20KiB MP3 file, but the intermediate wave form held in memory is much bigger, so you can imagine massive amounts of text make the server cry. Someone recently uploaded a 50KiB Asimov book in plain text, which produced a 5MiB (and 25 minute long!) MP3, this is the rough upper-limit of what I want to see being uploaded. If anyone wants to offer me some uber-hosting I would be very grateful ;D

Current features include:

  1. Various input formats and sources supported: .doc .rtf .html .pdf .djvu .ps .txt and most kinds of images.
  2. Various encoded audio outputs.
  3. Use of completely open source technology: ImageMagick, mencoder, Lame, OGGEnc, GOCR, DjVu-libre, xpdf, Festival, Antiword and many other command-line GNU/Linux tools.

Future features might include:

Create Audiobook

/!\.epub .mobi and .azw not currently supported, but coming soon!

From An E-Book Or File

Upload an ebook, image, or plain-text file to the server and have it converted to audio. Currently supports the following formats: .djvu .pdf .ps .doc .rtf .html .txt .jpg .png and all these image formats supported by ImageMagick.
Uploading images or raster based documents, rather than text based files, will require Optical Character Recognition to reconstruct flat text. This is not perfect, but should still give reasonable results. In future development the postprocessing of the text generated from OCR will be improved.

Used with PDF and DjVu only.

From A Wikipedia Article

/!\ Under construction, might not read all article text.

From A Website

/!\ Under construction, tags are stripped without much other consideration for readability.

From Plain Text

Paste or type in the text you want to be converted to audio.

The information provided on this and other pages by me, Matt Oates (, is under my own personal responsibility and not that of Aberystwyth University or the University of Bristol. Similarly, any opinions expressed are my own and are in no way to be taken as those of either institute.