I have a question.
There is a term often used in CL - "balanced corpus". Can anybody
tell or point me - is (if) this term strictly defined - and what is
Does the definition "balanced corpus" include the properties
("balance" of them) of the inner structure of a corpus - or there is a
balance of user's demands to its contents?
Or - is it "balance" of both - outer and inner conditions?
YS Vladimir Rykov, PhD in Computational Linguistics
OUR INSTITUTE WEB PAGE: Linguistic Institute
WWW.GOL.RU/~iling 1/12 B.Kislovsky per., Moscow, 103009
M_M_M_M_M_M_M_M_M_M_M_M KREMLIN WALL IS WHERE YOU MAKE IT !