Most authorship attribution studies have focused on works that are available in the language used by the original author (Holmes, 1994; Juola, 2006) because this provides a direct way of examining an author's linguistic habits. Sometimes, however, questions of authorship arise regarding a work only surviving in translation. One example is 'Constance', the putative 'last play' of Oscar Wilde, only existing in a supposed French translation of a lost English original. The present study aims to take a step towards dealing with cases of this kind by addressing two related questions: (1) to what extent are authorial differences preserved in translation; (2) to what extent does this carry-over depend on the particular translator? With these aims, we analysed 262 letters written by Vincent van Gogh and by his brother Theo, dated between 1888 and 1890, each available in the original French and in an English translation. We also performed a more intensive investigation of a subset of this corpus, comprising forty-eight letters, for which two different English translations were obtainable. Using three different indices of discriminability (classification accuracy, Hedge's g, and area under the receiver operating characteristic curve), we found that much of the stylistic discriminability between the two brothers was preserved in the English translations. Subsidiary analyses were used to identify which lexical features were contributing most to interauthor discriminability. Discrimination between translation sources was possible, although less effective than between authors. We conclude that 'handprints' of both author and translator can be found in translated texts, using appropriate techniques.
|Number of pages||19|
|Journal||Literary and Linguistic Computing|
|Publication status||Published - Jun 2014|
Scopus Subject Areas
- Information Systems
- Language and Linguistics
- Linguistics and Language