>Sujet : Corpora: Welsh Corpus - DTD?
>Date : 9/6/00 3:42:49 W. Europe Daylight Time
>From: email@example.com (Peter Littlechild)
>As part of an initiative to create a corpus of written Welsh
>I'm investigating means to create a machine-processable form
>of the Welsh Bible. This text is heavily enriched with
>cross-references, markers indicating dubious etymologies,
>missing words, and other things I haven't fathomed yet. It's
>the perfect justification for using SGML/XML.
>Does anyone know of previous work on marked-up versions of
>bibles? Are there special DTDs or some appropriate subset of
>the TEI DTD, for example?
People who have done work on Biblical corpora for language engineering
purposes and who would be best qualified to answer your question are:
Mark Olsen, ARTFL Project, University of Chicago
Please note that on the ARTFL project site indicated above, there is the
"The Humanities Text Initative no longer distributes the the SGML version of
the Luther Bible, on request of the copyright holder."
This implies that someone has already developed a DTD. Hopefully Mark Olsen
can be of assistance.
The Polyglot Bible project
Also note that several Bible corpora language engineering reference articles
were indicated in the Corpora list messages dated:
Date: Tue, 2 Mar 1999 11:34:37 -0500 (EST)
From: Philip Resnik
Subject: Re: Corpora: corpora history revisited
Date: Tue, 02 Mar 1999 16:57:35 +0100
From: Jeff ALLEN
Subject: Corpora: corpora history revisited - Bible corpus
The Corpora list messages above can be accessed at:
Mason Integrated Technologies Ltd (MIT2)
P.O. Box 181015
Boston, MA 02118 USA
(617) 247-8885 (office & answering machine)
(617) 262-8923 (FAX)
Mason Integrated Technologies Ltd Home Page:
MIT2 Products Page:
Meet the MIT2 Team:
MIT2's Strategic Partners:
This archive was generated by hypermail 2b29 : Thu Sep 14 2000 - 12:25:11 MET DST