Home > Research > Publications & Outputs > Automatic Estimation of Web Bloggers’ Age Using...

Links

Text available via DOI:

View graph of relations

Automatic Estimation of Web Bloggers’ Age Using Regression Models

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNChapter (peer-reviewed)peer-review

Published
Close
Publication date2015
Host publicationSpeech and Computer: SPECOM 2015
EditorsA. Ronzhin, R. Potapova, N. Fakotakis
Place of PublicationCham
PublisherSpringer
Pages113-120
Number of pages8
ISBN (electronic)9783319231327
ISBN (print)9783319231310
<mark>Original language</mark>English

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume9319

Abstract

In this article, we address the problem of automatic age estimation of web users based on their posts. Most studies on age identification treat the issue as a classification problem. Instead of following an age category classification approach, we investigate the appropriateness of several regression algorithms on the task of age estimation. We evaluate a number of well-known and widely used machine learning algorithms for numerical estimation, in order to examine their appropriateness on this task. We used a set of 42 text features. The experimental results showed that the Bagging algorithm with RepTree base learner offered the best performance, achieving estimation of web users’ age with mean absolute error equal to 5.44, while the root mean squared error is approximately 7.14.