Home > Research > Publications & Outputs > Ngrams and Engrams

Links

Text available via DOI:

View graph of relations

Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts. / Franklin, Emma; Oakes, Michael.
In: Corpora, Vol. 11, No. 3, 11.2016, p. 299-341.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Author

Bibtex

@article{789f2add08c046cb98b14db23573f2c6,
title = "Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts",
abstract = "In this paper, we present experiments using the Linguistic Inquiry and Word Count (LIWC) program, a {\textquoteleft}closed-class keyword{\textquoteright} (CCK) analysis and a {\textquoteleft}correspondence analysis{\textquoteright} (CA) to examine whether the Scientology texts of L. Ron Hubbard are linguistically and conceptually like those of other religions. A Kruskal–Wallis test comparing the frequencies of LIWC category words in the Scientology texts and the English translations of the texts of five other religions showed that there were eighteen categories for which the Scientology texts differed from the others, and between one and seventeen for the other religions. In the CCK experiment, keywords typical of each religion were found, both by comparing the religious texts with one another and with the Brown corpus of general English. The most typical keywords were looked up in a concordancer and were manually coded with conceptual tags. The set of categories found for the Scientology texts showed little overlap with those found for the others. Our CA experiments produced fairly clear clusters of texts by religion. Scientology texts were seen at one pole on the first factor, with Christian and Islamic texts at the other. It appears that, in several ways, the Scientology texts are dissimilar to the texts of some of the world's major religions.",
keywords = "Closed-class keywords analysis, Corpora, Correspondence analysis, LIWC, Religion, Religious texts, Scientology, Stylometry",
author = "Emma Franklin and Michael Oakes",
year = "2016",
month = nov,
doi = "10.3366/cor.2016.0098",
language = "English",
volume = "11",
pages = "299--341",
journal = "Corpora",
issn = "1749-5032",
publisher = "Edinburgh University Press",
number = "3",

}

RIS

TY - JOUR

T1 - Ngrams and Engrams

T2 - the use of structural and conceptual features to discriminate between English translations of religious texts

AU - Franklin, Emma

AU - Oakes, Michael

PY - 2016/11

Y1 - 2016/11

N2 - In this paper, we present experiments using the Linguistic Inquiry and Word Count (LIWC) program, a ‘closed-class keyword’ (CCK) analysis and a ‘correspondence analysis’ (CA) to examine whether the Scientology texts of L. Ron Hubbard are linguistically and conceptually like those of other religions. A Kruskal–Wallis test comparing the frequencies of LIWC category words in the Scientology texts and the English translations of the texts of five other religions showed that there were eighteen categories for which the Scientology texts differed from the others, and between one and seventeen for the other religions. In the CCK experiment, keywords typical of each religion were found, both by comparing the religious texts with one another and with the Brown corpus of general English. The most typical keywords were looked up in a concordancer and were manually coded with conceptual tags. The set of categories found for the Scientology texts showed little overlap with those found for the others. Our CA experiments produced fairly clear clusters of texts by religion. Scientology texts were seen at one pole on the first factor, with Christian and Islamic texts at the other. It appears that, in several ways, the Scientology texts are dissimilar to the texts of some of the world's major religions.

AB - In this paper, we present experiments using the Linguistic Inquiry and Word Count (LIWC) program, a ‘closed-class keyword’ (CCK) analysis and a ‘correspondence analysis’ (CA) to examine whether the Scientology texts of L. Ron Hubbard are linguistically and conceptually like those of other religions. A Kruskal–Wallis test comparing the frequencies of LIWC category words in the Scientology texts and the English translations of the texts of five other religions showed that there were eighteen categories for which the Scientology texts differed from the others, and between one and seventeen for the other religions. In the CCK experiment, keywords typical of each religion were found, both by comparing the religious texts with one another and with the Brown corpus of general English. The most typical keywords were looked up in a concordancer and were manually coded with conceptual tags. The set of categories found for the Scientology texts showed little overlap with those found for the others. Our CA experiments produced fairly clear clusters of texts by religion. Scientology texts were seen at one pole on the first factor, with Christian and Islamic texts at the other. It appears that, in several ways, the Scientology texts are dissimilar to the texts of some of the world's major religions.

KW - Closed-class keywords analysis

KW - Corpora

KW - Correspondence analysis

KW - LIWC

KW - Religion

KW - Religious texts

KW - Scientology

KW - Stylometry

U2 - 10.3366/cor.2016.0098

DO - 10.3366/cor.2016.0098

M3 - Journal article

VL - 11

SP - 299

EP - 341

JO - Corpora

JF - Corpora

SN - 1749-5032

IS - 3

ER -