Children Online: A survey of child language and CMC corpora

Computing and Communications

Associated organisational units

Electronic data

Article (submitted version)
Rights statement: © 2012 John Benjamins This article has been published in International Journal of Corpus Linguistics, 17:4 2012. The publisher should be contacted for permission to re-use the material in any form.
Submitted manuscript, 806 KB, PDF document

Text available via DOI:

https://doi.org/10.1075/ijcl.17.4.01bar
Final published version

Keywords

child language, CMC, survey

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Children Online: A survey of child language and CMC corpora. / Baron, Alistair ; Rayson, Paul; Greenwood, Phil et al.
In: International Journal of Corpus Linguistics, Vol. 17, No. 4, 2012, p. 443-481.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Baron, A , Rayson, P, Greenwood, P, Walkerdine, J & Rashid, A 2012, 'Children Online: A survey of child language and CMC corpora', International Journal of Corpus Linguistics, vol. 17, no. 4, pp. 443-481. https://doi.org/10.1075/ijcl.17.4.01bar

APA

Baron, A., Rayson, P., Greenwood, P., Walkerdine, J., & Rashid, A. (2012). Children Online: A survey of child language and CMC corpora. International Journal of Corpus Linguistics, 17(4), 443-481. https://doi.org/10.1075/ijcl.17.4.01bar

Vancouver

Baron A , Rayson P, Greenwood P, Walkerdine J, Rashid A. Children Online: A survey of child language and CMC corpora. International Journal of Corpus Linguistics. 2012;17(4):443-481. doi: 10.1075/ijcl.17.4.01bar

Author

Baron, Alistair ; Rayson, Paul ; Greenwood, Phil et al. / Children Online: A survey of child language and CMC corpora. In: International Journal of Corpus Linguistics. 2012 ; Vol. 17, No. 4. pp. 443-481.

Bibtex

@article{06d6723ca9014e429be46d3f08fbf11e,

title = "Children Online: A survey of child language and CMC corpora",

abstract = "The collection of representative corpus samples of both child language and online (CMC) language varieties is crucial for linguistic research that is motivated by applications to the protection of children online. In this paper, we present an extensive survey of corpora available for these two areas. Although a significant amount of research has been undertaken both on child language and on CMC language varieties, a much smaller number of datasets are made available as corpora. Especially lacking are corpora which match requirements for verifiable age and gender metadata, although some include self-reported information, which may be unreliable. Our survey highlights the lack of corpus data available for the intersecting area of child language in CMC environments. This lack of available corpus data is a significant drawback for those wishing to undertake replicable studies of child language and online language varieties.",

keywords = "child language, CMC, survey",

author = "Alistair Baron and Paul Rayson and Phil Greenwood and James Walkerdine and Awais Rashid",

note = "{\textcopyright} 2012 John Benjamins This article has been published in International Journal of Corpus Linguistics, 17:4 2012. The publisher should be contacted for permission to re-use the material in any form.",

year = "2012",

doi = "10.1075/ijcl.17.4.01bar",

language = "English",

volume = "17",

pages = "443--481",

journal = "International Journal of Corpus Linguistics",

issn = "1384-6655",

publisher = "John Benjamins Publishing Company",

number = "4",

}

RIS

TY - JOUR

T1 - Children Online: A survey of child language and CMC corpora

AU - Baron, Alistair

AU - Rayson, Paul

AU - Greenwood, Phil

AU - Walkerdine, James

AU - Rashid, Awais

N1 - © 2012 John Benjamins This article has been published in International Journal of Corpus Linguistics, 17:4 2012. The publisher should be contacted for permission to re-use the material in any form.

PY - 2012

Y1 - 2012

N2 - The collection of representative corpus samples of both child language and online (CMC) language varieties is crucial for linguistic research that is motivated by applications to the protection of children online. In this paper, we present an extensive survey of corpora available for these two areas. Although a significant amount of research has been undertaken both on child language and on CMC language varieties, a much smaller number of datasets are made available as corpora. Especially lacking are corpora which match requirements for verifiable age and gender metadata, although some include self-reported information, which may be unreliable. Our survey highlights the lack of corpus data available for the intersecting area of child language in CMC environments. This lack of available corpus data is a significant drawback for those wishing to undertake replicable studies of child language and online language varieties.

AB - The collection of representative corpus samples of both child language and online (CMC) language varieties is crucial for linguistic research that is motivated by applications to the protection of children online. In this paper, we present an extensive survey of corpora available for these two areas. Although a significant amount of research has been undertaken both on child language and on CMC language varieties, a much smaller number of datasets are made available as corpora. Especially lacking are corpora which match requirements for verifiable age and gender metadata, although some include self-reported information, which may be unreliable. Our survey highlights the lack of corpus data available for the intersecting area of child language in CMC environments. This lack of available corpus data is a significant drawback for those wishing to undertake replicable studies of child language and online language varieties.

KW - child language

KW - CMC

KW - survey

U2 - 10.1075/ijcl.17.4.01bar

DO - 10.1075/ijcl.17.4.01bar

M3 - Journal article

VL - 17

SP - 443

EP - 481

JO - International Journal of Corpus Linguistics

JF - International Journal of Corpus Linguistics

SN - 1384-6655

IS - 4

ER -

Research

Associated organisational units

Electronic data

Links

Text available via DOI:

Keywords