Abstract
In this paper a new method for estimating F0, the fundamental frequency of voiced speech versus time, is presented. The algorithm is based upon a simplified version of a general technique for fundamental frequency extraction using digital inverse filtering. It is demonstrated that the simplified inverse filter tracking algorithm (hereafter referred to as the SIFT algorithm) encompasses the desirable properties of both autocorrelation and cepstral pitch analysis techniques. In addition, the SIFT algorithm is composed of only a relatively small number of elementary arithmetic operations. In machine language, SIFT should run in several times real time while with special-purpose hardware it could easily be realized in real time.

This publication has 5 references indexed in Scilit: