Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - An evaluation of MorphInd's morphological annotation scheme for Indonesian
AU - Prihantoro, Prihantoro
PY - 2021/8/31
Y1 - 2021/8/31
N2 - MorphInd2 (Larasati et al., 2011) is a state-of-the-art morphological analyser for Indonesian. To date, there has not been any comprehensive evaluation of the morphological annotation scheme which MorphInd implements. My evaluation of this annotation scheme reveals a number of significant drawbacks. Some analytical features encoded in MorphInd's tagset seem not to reflect features actually present in Indonesian morphology, while certain common features in the analysis of Indonesian are absent. Likewise, the Part of Speech (POS) hierarchy in the MorphInd tagset does not reflect the usual POS hierarchy used by Indonesian reference grammars. Moreover, the MorphInd output does not link morphological tags to the corresponding morpheme. Finally, a number of issues which might problematise text/corpus querying in the annotation's layout are observable, particularly relating to affixes, reduplication, and the affix-reduplication interface.
AB - MorphInd2 (Larasati et al., 2011) is a state-of-the-art morphological analyser for Indonesian. To date, there has not been any comprehensive evaluation of the morphological annotation scheme which MorphInd implements. My evaluation of this annotation scheme reveals a number of significant drawbacks. Some analytical features encoded in MorphInd's tagset seem not to reflect features actually present in Indonesian morphology, while certain common features in the analysis of Indonesian are absent. Likewise, the Part of Speech (POS) hierarchy in the MorphInd tagset does not reflect the usual POS hierarchy used by Indonesian reference grammars. Moreover, the MorphInd output does not link morphological tags to the corresponding morpheme. Finally, a number of issues which might problematise text/corpus querying in the annotation's layout are observable, particularly relating to affixes, reduplication, and the affix-reduplication interface.
KW - Annotation
KW - Indonesian
KW - MorphInd
KW - Morphology
KW - Morphosyntactic
U2 - 10.3366/COR.2021.0221
DO - 10.3366/COR.2021.0221
M3 - Journal article
VL - 16
SP - 287
EP - 299
JO - Corpora
JF - Corpora
SN - 1749-5032
IS - 2
ER -