('binary' encoding is not supported, stored as-is)
In message <3DFF01D0.7677.48A18E@localhost> "Anne Harrap" <firstname.lastname@example.org> writes:
> Does anyone else get a lot of duplicated entries when doing a
> concordance in Wordsmith?
> Not sure if this is a bug or we are doing something wrong...
I had this problem some time ago. One possible reason is that the text file(s) loaded into the concordancer is tab-delimited (e.g. the concordances downloaded from the BNCweb). I am not sure why WordWmith is incompatible with this type of text, but it is.
If you are working with concordances downloaded from the BNCweb, another problem is that some lines may contain incomplete POS tag marks (i.e. only tag start mark < or tag stop mark >). In this case, if you do not select the box for 'Tags to ignore' (WordSmith settings), there will also be a problem.
Dr Zhonghua Xiao
Department of Lingusitics
This archive was generated by hypermail 2b29 : Wed Dec 18 2002 - 15:20:10 MET