To train transcriptor models that produce robust results, a large and diverse
labeled dataset is required. Finding such data with the necessary
characteristics is a challenging task, especially for languages less popular
than English. Moreover, producing such data requires significant