Home > Research > Publications & Outputs > Sociolinguistic Features for Author Gender Iden...

Electronic data

  • accepted_manuscript

    Rights statement: This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Quantitative Linguistics on 07/10/2016, available online: http://www.tandfonline.com/10.1080/09296174.2016.1226430

    Accepted author manuscript, 659 KB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Sociolinguistic Features for Author Gender Identification: From Qualitative Evidence to Quantitative Analysis.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
  • Vasiliki Simaki
  • Christina Aravantinou
  • Iosif Mporas
  • Marianna Kondyli
  • Vasileios Megalooikonomou
Close
<mark>Journal publication date</mark>2017
<mark>Journal</mark>Journal of Quantitative Linguistics
Issue number1
Volume24
Number of pages20
Pages (from-to)65-84
Publication StatusPublished
Early online date7/10/16
<mark>Original language</mark>English

Abstract

Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”, the “period length”, the “adjectives” and the “vocabulary richness” characteristics seem to be significantly distinctive with respect to the author’s gender.

Bibliographic note

This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Quantitative Linguistics on 07/10/2016, available online: http://www.tandfonline.com/10.1080/09296174.2016.1226430