next up previous contents index
Next: 5.15 Summary Up: 5 Speech Input/Output Previous: 5.13 Copying and Coding using HCOPY

5.14 Version 1.5 Compatibility

 

The redesign of the HTK front-end in version 2.0 has introduced a number of differences in parameter encoding. The main changes are

  1. Source waveform zero mean processing is now performed on a frame-by-frame basis.
  2. Delta coefficients use a modified form of regression rather than simple differences at the start and end of the utterance.
  3. Energy scaling is no longer applied to the zero'th MFCC coefficient.
If a parameter encoding is required which is as close as possible to the version 1.5 encoding, then the compatibility configuration variable V1COMPAT should be set to true.

Note also in this context that the default values for the various configuration values have been chosen to be consistent with the defaults or recommended practice for version 1.5.


next up previous contents index
Next: 5.15 Summary Up: 5 Speech Input/Output Previous: 5.13 Copying and Coding using HCOPY

ECRL HTK_V2.1: email [email protected]