As more historical texts are digitized, there is interest in applying natural
language processing tools to these archives. However, the performance of these
tools is often unsatisfactory, due to language change and genre differences.
Spelling normalization heuristics are the dominant s