en

Swiss German to Standard German Corpus

The repository includes two directories: • Test Suites: a small collection of test suites for analyzing syntactic phenomena. The segments are organized by six different syntactic phenomena and there are up to 50 segments in each group. • Test Set (TV shows): an evaluation dataset of recent TV shows consisting of 2,580 segments provided by our partners, SRF and Recapp. The data is further divided into different TV shows such as Der Club, Eco Talk, Gesichter und Geschichten, Schweiz Aktuell, among others. Additionally, we provide links to the original videos.

    Organizational unit
    PASSAGE
    Type
    Dataset
    DOI
    10.26037/yareta:ba4z2sjsz5hjbkedsj5nfsgmlq
    License
    Creative Commons Attribution-NonCommercial 4.0 International
    Keywords
    swiss german, standard german, test-suites, machine translation
Publication date20/04/2023
Retention date07/05/2033
accessLevelPublicAccess levelPublic
SensitivityBlue
duaNoneContract on the use of data
Contributors
  • Mutal, Jonathan David orcid
  • Bouillon, Pierrette
  • Gerlach, Johanna
  • Notter, Florian
  • Imseng, David
53
5
  • Quality (0 Reviews)
  • Usefulness (0 Reviews)

Datacite metadata

Packages information

Similar archives

PASSAGE
OBSOLETE
2021 accessLevelClosed Closed 4.8 MB
All rights reserved by DLCM and the University of GenevaunigeBlack