In file ../include/sigpr/EST_sigpr_utt.h:

void fbank

(EST_Wave &sig, EST_Track &fbank,
  const float factor,
  EST_WindowFunc* wf = EST_Window::creator(DEFAULT_WINDOW_NAME),
  const bool up = false, const bool take_log = true)

Mel scale filter bank analysis.

Documentation

Mel scale filter bank analysis. The Mel scale triangular filters are computed via an FFT (see fastFFT). This routine is required for Mel cepstral analysis (see melcep). The analysis of each frame is done by sig2fbank.

A typical filter bank analysis for speech recognition might use log energy outputs from 20 filters.

Parameters:
sig - : input waveform
fbank - : the output. The number of filters is determined from the number size of this track.
factor - : the frame length factor, i.e. the analysis frame length will be this times the local pitch period
wf - : function for windowing. See Windowing mechanisms
up - : whether the filterbank analysis should use power rather than energy.
take_log - : whether to take logs of the filter outputs
See Also:
sig2fbank
melcep

Alphabetic index HTML hierarchy of classes or Java


This page is part of the Edinburgh Speech Tools Library documentation
Copyright University of Edinburgh 1997
Contact: speech_tools@cstr.ed.ac.uk