Age Identification of Twitter Users - Research Portal

Linguistics and English Language

Text available via DOI:

https://doi.org/10.1007/978-3-319-75487-1_30
Final published version
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

View graph of relations

Age Identification of Twitter Users: Classification Methods and Sociolinguistic Analysis

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Vasiliki Simaki
Iosif Mporas
Vasileios Megalooikonomou

More...

Publication date	2016
Host publication	Computational Linguistics and Intelligent Text Processing : 17th International Conference, CICLing 2016, Konya, Turkey, April 3–9, 2016, Revised Selected Papers, Part II
Editors	A. Gelbukh
Place of Publication	Cham
Publisher	Springer
Pages	385-395
Number of pages	11
ISBN (electronic)	9783319754871
ISBN (print)	9783319754864
<mark>Original language</mark>	English

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer
Volume	9624

Abstract

In this article, we address the problem of age identification of Twitter users, after their online text. We used a set of text mining, sociolinguistic-based and content-related text features, and we evaluated a number of well-known and widely used machine learning algorithms for classification, in order to examine their appropriateness on this task. The experimental results showed that Random Forest algorithm offered superior performance achieving accuracy equal to 61%. We ranked the classification features after their informativity, using the ReliefF algorithm, and we analyzed the results in terms of the sociolinguistic principles on age linguistic variation.

Research

Links

Text available via DOI:

Age Identification of Twitter Users: Classification Methods and Sociolinguistic Analysis

Publication series

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us