I am a student in the department of Computational Linguistics at the Erlangen-Nuernberg university. I am preparing for my MA Theses.
My master project is to develop a query language for a corpus database in Java. So, I would like to know, which linguistic data can be asked for or obtained from a corpus database.
I have read about SARA (the query language of the BNC), Workbench, Cue and other corpora. These query languages mainly offer the possibility of searching about word, phrase, patterns (using wildcards), POS, frequency lists. What else can a linguist search for in a corpus database?
It would please me to receive your suggestions. Thanks for your help.
This archive was generated by hypermail 2b29 : Mon Apr 30 2001 - 15:52:56 MET DST