ARCADE     
Tagging guidelines for word alignment 

Version 1.0 - Jean Véronis, April 26, 1998

 

About these guidelines

These guidelines are under development for the ARCADE project's word track.They use as a model and starting point the Blinker project's Style Guide (Melamed, 1998), with Dan Melamed's permission. A number of modifications and adaptations have been added because of the different nature of the task: the Blinker project aimed at aligning all words between the two parallel texts, whereas, at least for this phase, the ARCADE project needs only alignment of a given set of words. 

As in the Blinker project, these guidelines are being developped in an interactive way: this draft will be revised as the annotation process goes on and new problems are found. We will probably not, however, have the resources to apply a scheme as elaborate as the one used in Blinker where at least four different annotators worked on each fragment. However, the task tackled here is much simpler, and the cases of disagreement should be less numerous. The current goal is annotation of each fragment by two annotators. 

Note: this document can be downloaded as a whole as a zip file (75 ko), which can be useful since it contains large files for which web reading can be slow. 
 

 

Contents