I need to build a part-of-speech tagger for a new language
(for which there is no PoS-tagger available). For this, I need
to hand-annotate a minimum amount of text. I would like to know
how much text (minimum of course) I need to hand-tag. Also,
for this much text, what is the reasonable size of the tagset
used for annotation?
This archive was generated by hypermail 2b29 : Tue Nov 12 2002 - 03:06:26 MET