![]() |
ARCADE
Tagging guidelines for word alignment Version 1.0 - Jean Véronis, April 26, 1998 |
| These guidelines are under
development for the ARCADE
project's word
track.They use as a model and starting point the Blinker project's
Style Guide (Melamed, 1998),
with Dan Melamed's permission. A number of modifications and adaptations
have been added because of the different nature of the task: the Blinker
project aimed at aligning all words between the two parallel texts, whereas,
at least for this phase, the ARCADE project needs only alignment of a given
set of words.
As in the Blinker project, these guidelines are being developped in an interactive way: this draft will be revised as the annotation process goes on and new problems are found. We will probably not, however, have the resources to apply a scheme as elaborate as the one used in Blinker where at least four different annotators worked on each fragment. However, the task tackled here is much simpler, and the cases of disagreement should be less numerous. The current goal is annotation of each fragment by two annotators. Note: this document can be
downloaded as a whole as a zip
file (75 ko), which can be useful since it contains large files for
which web reading can be slow.
|