Thanks for your reply. By the term 'discourse markers' (aka. 'cue
phrases'), I mean words like 'and', 'however', 'on the one hand....on
the other hand', etc,. In other words the types of marker that make
explicit the underlying relation between two spans of text, for example:
John is poor but he is happy.
These propositions are linked by a DENIAL OF EXPECTATION / ANTITHESIS
relation: the reader expects John to be unhappy because he is poor, the
discourse marker 'but' is used to signal to the reader that, although
being poor normally makes someone unhappy, in this case the reverse is
true. Thus, the reader's expectation is denied.
The problem I'm having with the corpus is that some lexis are used as
both discourse markers and as other parts of speech, for example:
He is SO funny (adverb not discourse marker)
I wanted to be funny SO I started telling jokes. (discourse marker of
I have been doing this PhD SINCE the beginning of October. (temporal
adverb not discourse marker)
I decided to do linguistics SINCE I am interested in languages and how
we use them. (discourse marker of 'justification')
Mary PROVIDED THAT food for the party. (verb+determiner not discourse
The party will go ahead PROVIDED THAT Mary brings the food. (discourse
marker of 'condition')
I've included some more examples of what I call 'discourse markers'
below, I hope this makes it more clear, It would be great to hear from
you if you know of any corpora that is tagged for such words, Sarah.
all the same
as a consequence
as long as
at the moment when
for another thing
in that case