Native language influence detection for forensic authorship analysis:Identifying L1 persian bloggers


This article demonstrates and examines the potential use of interlingual identifiers for forensic authorship analysis and native language influence detection (NLID). The work focuses on the practical applications of native language (L1) identifiers by a human analyst in investigative situations. Using naturally occurring blog posts where the writer self-identifies as a native Persian speaker, a human analyst derived and coded sets of non-native features. Two logistic regression models were built: the first was used to select features to distinguish L1 Persian speakers from L1 English speakers in their English writings, the second developed a feature list to contrast L1 languages that are geographically and linguistically close to Persian. The results clearly demonstrate that interlingual identifiers have the potential to aid in determining the L1 of an anonymous author and can be used by a human analyst in a short forensically realistic example text. This article demonstrates that NLID is possible beyond the more common computational approaches and can form a useful tool in the forensic linguist’s toolbox. This study is not a statistical validation study; instead it demonstrates how a sociolinguistic approach can complement more traditional computational approaches.

Publication DOI:
Divisions: College of Business and Social Sciences > School of Social Sciences & Humanities
?? 53981500Jl ??
College of Business and Social Sciences > Aston Institute for Forensic Linguistics
Additional Information: ©2018, equinox publishing
Uncontrolled Keywords: Authorship analysis,Linguistic profiling,Native language identification,Native language influence detection,Persian,Linguistics and Language,Law
Publication ISSN: 1748-8893
Last Modified: 11 Mar 2024 08:22
Date Deposited: 29 May 2018 12:15
Full Text Link:
Related URLs: http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2018-09-10
Accepted Date: 2018-04-30
Authors: Perkins, Ria (ORCID Profile 0000-0001-6193-1456)
Grant, Tim (ORCID Profile 0000-0002-5155-8413)



Version: Accepted Version

Export / Share Citation


Additional statistics for this record