Department of Cybernetics @ University of West Bohemia in Pilsen

Structural Metadata Annotation for Czech

Data

Currently, we have two MDE annotated corpora available for Czech - broadcast news corpus (26 hours of transcribed speech) and broadcast conversation corpus (33 hours). An example of metadata annotated transcript follows:


Figure 1: Example of MDE annotated transcript