liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Voice Activity Detection in the Tiger Platform
Linköping University, Department of Electrical Engineering.
2006 (English)Independent thesis Basic level (professional degree), 20 points / 30 hpStudent thesis
Abstract [en]

Sectra Communications AB has developed a terminal for encrypted communication called the Tiger platform. During voice communication delays have sometimes been experienced resulting in conversational complications.

A solution to this problem, as was proposed by Sectra, would be to introduce voice activity detection, which means a separation of speech parts and non-speech parts of the input signal, to the Tiger platform. By only transferring the speech parts to the receiver, the bandwidth needed should be dramatically decreased. A lower bandwidth needed implies that the delays slowly should disappear. The problem is then to come up with a method that manages to distinguish the speech parts from the input signal. Fortunately a lot of theory on the subject has been done and numerous voice activity methods exist today.

In this thesis the theory of voice activity detection has been studied. A review of voice activity detectors that exist on the market today followed by an evaluation of some of these was performed in order to select a suitable candidate for the Tiger platform. This evaluation would later become the foundation for the selection of a voice activity detector for implementation.

Finally, the implementation of the chosen voice activity detector, including a comfort noise generator, was done on the platform. This implementation was based on the special requirements of the platform. Tests of the implementation in office environments show that possible delays are steadily being reduced during periods of speech inactivity, while the active speech quality is preserved.

Place, publisher, year, edition, pages
Institutionen för systemteknik , 2006. , 63 p.
Keyword [en]
Voice activity detection, Comfort noise generation, G.729B, G.729D, G.729F, Fuzzy VAD, AMR, EFR, HR, FR, LPC., Discontinuous transmission, Speech coding, Linear predictive coding, Tiger, VAD, CNG, DTX, G.729
National Category
Control Engineering
URN: urn:nbn:se:liu:diva-6586ISRN: LITH-ISY-EX--06/3817--SEOAI: diva2:21887
Available from: 2006-06-09 Created: 2006-06-09

Open Access in DiVA

fulltext(478 kB)762 downloads
File information
File name FULLTEXT01.pdfFile size 478 kBChecksum SHA-1
Type fulltextMimetype application/pdf

By organisation
Department of Electrical Engineering
Control Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 762 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 527 hits
ReferencesLink to record
Permanent link

Direct link