next up previous contents index
Next: 13.4.2 Use Up: 13.4 HDMan Previous: 13.4 HDMan

13.4.1 Function

  The HTK tool HDMAN is used to prepare a pronunciation dictionary from one or more sources. It reads in a list of editing commands from a script file and then outputs an edited and merged copy of one or more dictionaries.

Each source pronunciation dictionary consists of comment lines and definition lines. Comment lines start with the # character (or optionally any one of a set of specified comment chars) and are ignored by HDMAN. Each definition line starts with a word and is followed by a sequence of symbols (phones) that define the pronunciation. The words and the phones are delimited by spaces or tabs, and the end of line delimits each definition.

Each edit command in the script file must be on a separate line. Lines in the script file starting with a # are comment lines and are ignored. The commands supported are listed below. They can be displayed by HDMAN using the -Q option.

When no edit files are specified, HDMAN simply merges all of the input dictionaries and outputs them in sorted order. All input dictionaries must be sorted. Each input dictionary xxx may be processed by its own private set of edit commands stored in xxx.ded. Subsequent to the processing of the input dictionaries by their own unique edit scripts, the merged dictionary can be processed by commands in global.ded (or some other specified global edit file name).

Dictionaries are processed on a word by word basis in the order that they appear on the command line. Thus, all of the pronunciations for a given word are loaded into a buffer, then all edit commands are applied to these pronunciations. The result is then output and the next word loaded.

Where two or more dictionaries give pronunciations for the same word, the default behaviour is that only the first set of pronunciations encountered are retained and all others are ignored. An option exists to override this so that all pronunciations are concatenated.

Dictionary entries can be filtered by a word list such that all entries not in the list are ignored.

The edit commands provided by HDMAN are as follows


next up previous contents index
Next: 13.4.2 Use Up: 13.4 HDMan Previous: 13.4 HDMan

ECRL HTK_V2.1: email [email protected]