M. Tech. Project

 

The ability to modify the apparent rate of speech is desirable in a number of applications. For example, one can reduce the bit rate required for medium band speech coding by time-scale compression of the input speech, followed by coding and transmission, followed by time-scale expansion to original signal at the receiver.

 

There are mainly two types of Speech Modifications namely, Time-Scale Modification and Pitch-Scale Modification. Which, further can be done in two ways, first method is Time-Domain and second method is Frequency-Domain.  In time-domain method the OLA (OverLap and Add) method is applied for synthesizing the speech. For more accuracy the synchronized OLA method is used, which is synchronized with pitch of the speech signal. Other method is based on Short Time Fourier Transform (STFT). STFT method is somewhat complex as it involves many calculations whereas in Time Domain method is faster because of less number of calculations.

 

In this project the Phase Vocoder Based time-scale modification, and Synchronized Overlap Add time-scale modification Method both are implemented. Both methods are also described theoretically, and after successfully implementation both methods are compared in three different aspects, namely, Quality at different rates, Complexity, and limitation of both methods.