Scripted dialogues such as movie and TV subtitles constitute a widespread source of training data for conversational nlp models. However, the linguistic characteristics of those dialogues are notably different from those observed in corpora of spontaneous interactions. This difference