Real Time Speech Driven Face Animation
Independent thesis Basic level (professional degree)Student thesisAlternative title
Talstyrd Ansiktsanimering i Realtid (Swedish)
The goal of this project is to implement a system to analyse an audio signal containing speech, and produce a classifcation of lip shape categories (visemes) in order to synchronize the lips of a computer generated face with the speech.
The thesis describes the work to derive a method that maps speech to lip move- ments, on an animated face model, in real time. The method is implemented in C++ on the PC/Windows platform. The program reads speech from pre-recorded audio files and continuously performs spectral analysis of the speech. Neural networks are used to classify the speech into a sequence of phonemes, and the corresponding visemes are shown on the screen.
Some time delay between input speech and the visualization could not be avoided, but the overall visual impression is that sound and animation are synchronized.
Place, publisher, year, edition, pages
Institutionen för systemteknik , 2003. , 50 p.
Technology, phonemes, visemes, real-time, neural networks
Engineering and Technology
IdentifiersURN: urn:nbn:se:liu:diva-2015OAI: oai:DiVA.org:liu-2015DiVA: diva2:19343