Software of speech identification and compression

Authors

Ing. Jiří Mekyska, Ph.D., prof. Ing. Zdeněk Smékal, CSc.

Date

3. listopadu 2016

Description

This software is used for voice activity detection and consequent compression of cleaned speech signal. The voice activity detection algorithm is based on voicing identification with fundamental frequency ranging from 60 to 400 Hz. The swipep function introduced by Arturo Camacho (http://www.cise.ufl.edu/~acamacho/english/) was used for the fundamental frequency estimation. OPUS codec with 16 kb/s bitrate and 8 kHz sampling frequency was used for the consequent compression. Thanks to its selective and compression capabilities, this software significantly supress the memory usage, especially during a long-term logging.

vad
compression

Projects

This work was supported by project TA04031666. The described research was performed in laboratories supported by the SIX project; the registration number CZ.1.05/2.1.00/03.0072, the operational program Research and Development for Innovation.

License

To negotiate the license terms of use of this software please contact the responsible person Ing. Lukas Novak at Technology Transfer Office, Brno University of Technology, Kounicova 966/67a, Veveří, 60200, Brno, Czech Republic, novak@ro.vutbr.cz.