Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Multilingual resources for European languages
T2 - Contributions of the CRATER project
AU - McEnery, T.
AU - Wilson, A.
AU - SáNchez-LeóN, F.
AU - Nieto-Serrano, A.
PY - 1997/11/1
Y1 - 1997/11/1
N2 - Here we describe the contributions of the CRATER project to the development of multilingual resources for European languages. The project has developed a trilingual parallel aligned corpus of one million tokens each of Spanish, French, and English. The corpus has been part-of-speech tagged and lemmatized. Tools for the alignment of multi-lingual corpora at the sentence and word levels hae been developed, which are of general significance to multilingual corpus linguistics. The Xerox part-of-speech tagger has also been retrained for Spanish, with important findings for part-of-speech tagging generally.
AB - Here we describe the contributions of the CRATER project to the development of multilingual resources for European languages. The project has developed a trilingual parallel aligned corpus of one million tokens each of Spanish, French, and English. The corpus has been part-of-speech tagged and lemmatized. Tools for the alignment of multi-lingual corpora at the sentence and word levels hae been developed, which are of general significance to multilingual corpus linguistics. The Xerox part-of-speech tagger has also been retrained for Spanish, with important findings for part-of-speech tagging generally.
U2 - 10.1093/llc/12.4.219
DO - 10.1093/llc/12.4.219
M3 - Journal article
AN - SCOPUS:84933481517
VL - 12
SP - 219
EP - 226
JO - Literary and Linguistic Computing
JF - Literary and Linguistic Computing
SN - 0268-1145
IS - 4
ER -