Structural Metadata Annotation for Czech


Currently, we have two MDE annotated corpora available for Czech - broadcast news corpus (26 hours of transcribed speech) and broadcast conversation corpus (33 hours). An example of metadata annotated transcript follows:

Figure 1: Example of MDE annotated transcript