What is cepstrum analysis?
James Stevens
Updated on April 26, 2026
Cepstrum Analysis is a tool for the detection of periodicity in a frequency spectrum, and seems so far to have been used mainly in speech analysis for voice pitch determination and related questions. 3, 4, 5), and determination of these modulation frequencies can be very useful in diagnosis of the fault.
Why is cepstrum analysis used?
The cepstrum is a representation used in homomorphic signal processing, to convert signals combined by convolution (such as a source and filter) into sums of their cepstra, for linear separation. In particular, the power cepstrum is often used as a feature vector for representing the human voice and musical signals.
How do you calculate cepstrum?
The real cepstrum of a signal x, sometimes called simply the cepstrum, is calculated by determining the natural logarithm of magnitude of the Fourier transform of x, then obtaining the inverse Fourier transform of the resulting sequence: c x = 1 2 π ∫ – π π log | X ( e j ω ) | e j ω n d ω .
What is difference between spectrum and cepstrum?
As nouns the difference between spectrum and cepstrum is that spectrum is specter, apparition while cepstrum is (mathematics) the fourier transform of the logarithm of a spectrum; used especially in voice analysis.
What is Cepstral distance?
In general, cepstral distance is applied to measuring the similarity between two frames of signals. Figure 2 shows cepstral distance between one angry utterance and its corresponding neutral utterance.
Is MFCC a spectrogram?
The mel-spectrogram is often log-scaled before. MFCC is a very compressible representation, often using just 20 or 13 coefficients instead of 32-64 bands in Mel spectrogram. The MFCC is a bit more decorrelarated, which can be beneficial with linear models like Gaussian Mixture Models. Take logs of Mel spectrogram.
What is Cepstral frequency?
In sound processing, the mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC.
What is Cepstral peak prominence?
Cepstral peak prominence (CPP) is an acoustic measure of voice quality that has been qualified as the most promising and perhaps robust acoustic measure of dysphonia severity [1].
What are the features of MFCC?
The MFCC feature extraction technique basically includes windowing the signal, applying the DFT, taking the log of the magnitude, and then warping the frequencies on a Mel scale, followed by applying the inverse DCT. The detailed description of various steps involved in the MFCC feature extraction is explained below.
What is a cepstrum analysis in signal processing?
Cepstrum analysis is a nonlinear signal processing technique with a variety of applications in areas such as speech and image processing. The complex cepstrum of a sequence x is calculated by finding the complex natural logarithm of the Fourier transform of x, then the inverse Fourier transform of the resulting sequence:
What is the cepstrum in phonology?
The cepstrum is defined to be the IDFT[log |S(omega)|], with the cepstrum represented as c(n), with units of ms in the quefrency domain. Figure: Examples of cepstrum analysis for voiced and unvoiced speech.
What are the applications of the cepstrum in mechanics?
The cepstrum had been used in speech analysis for determining voice pitch (by accurately measuring the harmonic spacing), but also for separating the formants (transfer function of the vocal tract) from voiced and unvoiced sources, and this led quite early to similar applications in mechanics.
What is Cepstrum pitch determination used for?
It has also been used to determine the fundamental frequency of human speech and to analyze radar signal returns. Cepstrum pitch determination is particularly effective because the effects of the vocal excitation (pitch) and vocal tract (formants) are additive in the logarithm of the power spectrum and thus clearly separate.