Swiss German to Standard German Corpus
The repository includes two directories: • Test Suites: a small collection of test suites for analyzing syntactic phenomena. The segments are organized by six different syntactic phenomena and there are up to 50 segments in each group. • Test Set (TV shows): an evaluation dataset of recent TV shows consisting of 2,580 segments provided by our partners, SRF and Recapp. The data is further divided into different TV shows such as Der Club, Eco Talk, Gesichter und Geschichten, Schweiz Aktuell, among others. Additionally, we provide links to the original videos.
- Organizational unit
- PASSAGE
- Type
- Dataset
- DOI
- License
- Creative Commons Attribution-NonCommercial 4.0 International
- Keywords
- swiss german, standard german, test-suites, machine translation
Contributors
- Mutal, Jonathan David
- Bouillon, Pierrette
- Gerlach, Johanna
- Notter, Florian
- Imseng, David
Files
Quality (0 Reviews) Usefulness (0 Reviews)
Datacite metadata
Packages information
Similar archives
PASSAGE