ARCADE    
Word track - Sentence alignment
 

Location and format

The alignment of sentences containing the source word and their translation is available on the arcftp site (directory word/align) or from the romanftp site (directory fr/align), file alignsent.txt.gz (access restricted to participants). 

The file has been obtanied by automatic means, followed only by a very rough checking. It is possible that it still contains a few errors. Thanks for reporting them.

If you do use this information in your system, please mention it for fair evaluation. 

Format is as follows: 
 
Cat grammatical category of word
NoOccur occurrence number
Lemma lemma of occurrence
Occur exact form of occurrence
SceSent source sentences
TarSent target sentence
 
 
The combination of the two fist columns uniquely identifies each occurrence. 
 

Example:  
 
 
Cat NoOccur Lemma Occur Source Target
A 1 biologique biologiques En outre, dans le cadre de la directive du Conseil 80/1107/CEE  ( 5 ) , modifiée par la directive du Conseil 88/642/CEE  ( 6 )... Moreover, within the framework of Council Directive 80/1107/EEC  ( 5 )  as amended by Council Directive 88/642/EEC  ( 6 )…
A 2 biologique biologique La Commission est-elle disposée à insister auprès des autorités grecques sur la nécessité de procéder à l'implantation, dans cette région, d'une station d'épuration biologique? Does the Commission intend to draw the attention of the Greek authorities to the need to set up a biological purification plant in this region?