Next: 13.12.4 Tracing
Up: 13.12 HQuant
Previous: 13.12.2 VQ Codebook Format
HQUANT is invoked via the command line
HQuant [options] vqFile trainFiles ...
where vqFile is the name of the output VQ table file.
The effect of this command
is to read in training data from each trainFile, cluster the
data and write the final cluster centres into the VQ table file.
The list of training files can be stored in a script file if required.
Furthermore, the data used for training the codebook
can be limited to that corresponding to a specified label. This can be
used, for example, to train phone specific codebooks. When constructing
a linear codebook, the maximum number of iterations per cluster can be
limited by setting the configuration variable MAXCLUSTITER.
The minimum number of samples in any one cluster can be set using the
configuration variable MINCLUSTSIZE.
The detailed operation of HQUANT is controlled by the following
command line options
- -d
- Use a diagonal-covariance Mahalonobis distance metric for
clustering (default is to use a Euclidean distance metric).
- -f
- Use a full-covariance Mahalonobis distance metric for
clustering (default is to use a Euclidean distance metric).
- -g
- Output the global covariance to a codebook. Normally,
covariances are computed individually for each cluster using
the data in that cluster. This option computes a global covariance
across all the clusters.
- -l s
- The string s must be the name of a
segment label. When this option is used, HQUANT searches
through all of the training files and uses only the speech
frames from segments with the given label. When this option is not
used, HQUANT uses all of the data in each training file.
- -n S N
- Set size of codebook for stream S
to N (default 256).
If tree-structured codebooks are required then N
must be a power of 2.
- -s N
- Set number of streams to N (default 1).
Unless the -w option is used, the width of each stream
is set automatically depending on the size and parameter kind of the
training data.
- -t
- Create tree-structured codebooks (default linear).
- -w S N
- Set width of stream S to N.
This option overrides the default decomposition that HTK normally
uses to divide a parameter file into streams. If this option is used,
it must be repeated for each individual stream.
- -F fmt
- Set the source data format to fmt.
- -G fmt
- Set the label file format to fmt.
- -I mlf
- This loads the master label file mlf. This option
may be repeated to load several MLFs.
- -L dir
- Search directory dir for label files (default
is to search current directory).
- -X ext
- Set label file extension to ext
(default is lab).
HQUANT also supports the standard options -A,
-C, -D, -S, -T, and -V as described
in section 4.4.
Next: 13.12.4 Tracing
Up: 13.12 HQuant
Previous: 13.12.2 VQ Codebook Format
ECRL HTK_V2.1: email [email protected]