Corpora: Book: Syntactic Wordclass Tagging

Jean Veronis (
Fri, 17 Sep 1999 11:33:15 +0200

Volume 9
Series editors: Nancy Ide and Jean Véronis

Syntactic Wordclass Tagging

edited by
Hans van Halteren

Dept. of Language and Speech
University of Nijmegen, The Netherlands

This book provides an in-depth discussion of the field of syntactic
wordclass tagging, i.e. the annotation of the words in a text with tags
indicating their syntactic properties. Represented are the viewpoints of
the two main groups who take an interest in tagging: the users of tagged
text and the developers of tagging software.

The book starts out by examining the field foremost from the user's point
of view. After a brief historical overview, the nature and uses of tagging
are discussed and current practice is described. Here the user will find
what tagging is and the software developer what it is the user wants.

The book then switches to the other point of view and continues with a
detailed explanation of the most common computational techniques for
automatically tagging large amounts of text. Here the software developer
finds information needed for the implementation of a tagger while the user
gains insight into the possibilities and impossibilities of automatic
tagging and how computer-provided tags should be interpreted.

Kluwer Academic Publishers, Dordrecht
Hardbound, ISBN 0-7923-5896-1
August 1999, 300 pp.
NLG 280.00 / USD 149.00 / GBP 93.00


Contents and Contributors

Contributing Authors.

Part I: The User's View.

1. Orientation; A. Voutilainen.
2. A Short History of Tagging; A. Voutilainen.
3. The Use of Tagging; G. Leech, N. Smith.
4. Tagsets; J. Cloeren.
5. Standards for Tagsets; G. Leech, A. Wilson.
6. Performance of Taggers; H. van Halteren.
7. Selection and Operation of Taggers; H. van Halteren.

Part II: The Implementer's View.

8. Automatic Taggers: An Introduction; H. van Halteren, A. Voutilainen.
9. Tokenization; G. Grefenstette.
10. Lexicons for Tagging; A. Schiller, L. Karttunen.
11. Standardization in the Lexicon; M. Monachini, N. Calzolari.
12. Morphological Analysis; K. Oflazer.
13. Tagging Unknown Words; E. Brill.
14. Hand-Crafted Rules; A. Voutilainen.
15. Corpus-Based Rules; E. Brill.
16. Hidden Markov Models; M. El-Beze, B. Merialdo.
17. Machine Learning Approaches; W. Daelemans.

Appendix A: Example tagsets.



