Image to Speech System

Hanci Quan

07/12/2022

Supervised by Bailin Deng; Moderated by Víctor Gutiérrez Basulto

In this project, you will build a prototype system that converts the text inside an image to a speech. Such a system can be useful for people with visual impairment. The hardware consists of a computer attached to a camera. The system will use computer vision libraries to extract text that is inside an image captured by the camera, and then use text-to-speech APIs to convert the text to audio. The system is expected to be deployed on a PC with a webcam, or on a raspberry PI with a camera module.

Final Report (07/12/2022) [Zip Archive]

1-report.pdf
2-Hanci Quan(21112401)-imageToSpeech.zip
3-PGNet recognition results.zip
4-ReadMe.txt

Image to Speech System

Final Report (07/12/2022) [Zip Archive]

Publication Form