Final published version
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
}
TY - GEN
T1 - Using Sociolinguistic Inspired Features for Gender Classification of Web Users
AU - Simaki, Vasiliki
AU - Aravantinou, Christina
AU - Mporas, Iosif
AU - Megalooikonomou, Vasileios
PY - 2015
Y1 - 2015
N2 - In this article we present a methodology for classification of text from web authors, using sociolinguistic inspired text features. The proposed methodology uses a baseline text mining based feature set, which is combined with text features that quantify results from theoretical and sociolinguistic studies. Two combination approaches were evaluated and the evaluation results indicated a significant improvement in both combination cases. For the best performing combination approach the accuracy was 84.36%, in terms of percentage of correctly classified web posts.
AB - In this article we present a methodology for classification of text from web authors, using sociolinguistic inspired text features. The proposed methodology uses a baseline text mining based feature set, which is combined with text features that quantify results from theoretical and sociolinguistic studies. Two combination approaches were evaluated and the evaluation results indicated a significant improvement in both combination cases. For the best performing combination approach the accuracy was 84.36%, in terms of percentage of correctly classified web posts.
KW - text classification algorithms
KW - sociolinguistics
KW - gender identification
U2 - 10.1007/978-3-319-24033-6_66
DO - 10.1007/978-3-319-24033-6_66
M3 - Conference contribution/Paper
SN - 9783319240329
T3 - Lecture Notes in Computer Science
SP - 587
EP - 594
BT - Proceedings of the 18th International Conference of Text, Speech and Dialogue
A2 - Kral, P.
A2 - Matousek, V.
PB - Springer
CY - Cham
ER -