[PDF]

Image to Speech System


Hanci Quan

07/12/2022

Supervised by Bailin Deng; Moderated by Víctor Gutiérrez Basulto

In this project, you will build a prototype system that converts the text inside an image to a speech. Such a system can be useful for people with visual impairment. The hardware consists of a computer attached to a camera. The system will use computer vision libraries to extract text that is inside an image captured by the camera, and then use text-to-speech APIs to convert the text to audio. The system is expected to be deployed on a PC with a webcam, or on a raspberry PI with a camera module.


Final Report (07/12/2022) [Zip Archive]

Publication Form