MULTEXTLexical specifications : application to Italian
Document MULTEXT-LEX 1. Title page. Version 0.3. Last modified 12 december 1996.




logo

Lexical specifications : application to Italian

Nicoletta Calzolari and Monica Monachini - PISA-ILC







In this section the MULTEXT set of lexicon specifications is applied to Italian (Calzolari and Monachini 1994). The language-specific values added for Italian are highligthed with the code 'l-spec'.

Furthermore, a tagset for Italian is proposed. This is based on the tagset used by our tagger, but also takes into account the criteria expressed above for the construction of the tagset, and the results of a first cycle of experimentations on the MULTEXT tagger.

A table containing the the translation of the tag into the regular expression and its definition is presented, i.e.

TAG            Reg.expr.            Definition
NMS            Ncms-                Common noun, masc.sing.

A table displaying the mapping between lexicon specifications and corpus tags is provided, along with an examplification.


Contents


1. Italian Nouns (N)

1.1. Lexicon

============ =========== =========== =====
Attribute      Value       Example    Code
============ =========== =========== =====
Type           common     libro        c
               proper     Gianni       p
------------ ----------- ----------- -----
Gender         masculine  uomo         m
               feminine   donna        f
l-spec         common     insegnante   c
------------ ----------- ----------- -----
Number         singular   uomini       s
               plural     donne        p
l-spec         invariant  attivitá     n
------------ ----------- ----------- -----
Case           (n.a.)     (n.a.)       -
============ =========== =========== =====s



1.2. Corpus

TagRegular expressionDefinition
NMSNcms-Common noun, masc. sing.
NMPNcmp-Common noun, masc. plur.
NMNNcmn-Common noun, masc. invar.
NFSNcfs-Common noun, fem. sing.
NFPNcfp-Common noun, fem. plur.
NFNNcfn-Common noun, fem. invar.
NNSNccs-Common noun, comm. sing.
NNPNccp-Common noun, comm. plur.
NNNNccn-Common noun, comm. invar.
NP Np..-Proper noun



1.3. Combinations

LexiconCorpusExample
Ncms-NMSlibro
Ncmp-NMPlibri
Ncmn-NMNre, caffé (il/i)
Ncfs-NFScasa
Ncfp-NFPcase
Ncfn-NFNattivitá (la/le)
Nccs-NNSinsegnante (un/una)
Nccp-NNPinsegnanti (gli/le)
Nccn-NNNsosia (il/la, i/le)
Np..-NPMario, Maria, Borboni



1.4. Observations

The idea of the French group to tag Proper Nouns simply with NP (collapsing the information on gender and number) seems the best solution.



Compare with :


2. Italian Verbs (V)

2.1. Lexicon

============ =========== =========== =====
Attribute      Value       Example    Code
============ =========== =========== =====
Type          main         amare       m
              auxiliary    avere       a
------------ ----------- ----------- -----
Mood/VForm    indicative   amo         i
              subjunctive  ami         s
              imperative   ama         m
              conditional  amerei      c
              infinitive   amare       n
              participle   amato       p
              gerund       amando      g
------------ ----------- ----------- -----
Tense         present      amo         p
              imperfect    amavo       i
              future       ameró       f
              past         amai        s
------------ ----------- ----------- -----
Person        first        amo         1
              second       ami         2
              third        ama         3
------------ ----------- ----------- -----
Number        singular     amo         s
              plural       amiamo      p
------------ ----------- ----------- -----
rGender       masculine    amato       m
              feminine     amata       f
l-spec        common       amante      c
============ =========== =========== =====



2.2. Corpus

TagRegular expressionDefinition
VAS1IPVaip1s- Aux. Verb, 1st pers.sing., pres.indic.
VAS2IPVaip2s-Aux. Verb, 2nd pers.sing., pres.indic.
VAS3IPVaip3s- Aux. Verb, 3rd pers.sing., pres.indic.
VAP2IPVaip2p-Aux. Verb, 2nd pers.plur., pres.indic.
VAP3IPVaip3p-Aux. Verb, 3rd pers.plur., pres.indic.
VAP1ICPVa[is]p1p-Aux. Verb, 1stpers.plur.,pres.indic/cong
VAY2IPVaip(1s|3p)-Aux. Verb, 1st sing./3rd plur., pres.indic
VAS1IIVaii1s-Aux. Verb, 1st pers.sing., impf.indic.
VAS2IIVaii2s-Aux. Verb, 2nd pers.sing., impf.indic.
VAS3IIVaii3s-Aux. Verb, 3rd pers.sing., impf.indic.
VAP1IIVaii1p-Aux. Verb, 1st pers.plur., impf.indic.
VAP2IIVaii2p-Aux. Verb, 2nd pers.plur., impf.indic.
VAP3IIVaii3p-Aux. Verb, 3rd pers.plur., impf.indic.
VAS1IFVaif1s-Aux. Verb, 1st pers.sing., fut. indic.
VAS2IFVaif2s-Aux. Verb, 2nd pers.sing., fut. indic.
VAS3IFVaif3s-Aux. Verb, 3rd pers.sing., fut. indic.
VAP1IFVaif1p-Aux. Verb, 1st pers.plur., fut. indic.
VAP2IFVaif2p-Aux. Verb, 2nd pers.plur., fut. indic.
VAP3IFVaif3p-Aux. Verb, 3rd pers.plur., fut. indic.
VAS1IRVais1s-Aux. Verb, 1st pers.sing., past indic.
VAS2IRVais2s-Aux. Verb, 2nd pers.sing., past indic.
VAS3IRVais3s-Aux. Verb, 3rd pers.sing., past indic.
VAP1IRVais1p-Aux. Verb, 1st pers.plur., past indic.
VAP3IRVais3p-Aux. Verb, 3rd pers.plur., past indic.
VAP2ICRVa(is)|(si)2p-Aux. Verb, 2nd p.pl., past indic./pres.cong
VASXCPVacp.s-Aux. Verb, 1/2/3 p. sing., pres.subj.
VAP2CMPVa[sm]p2p-Aux. Verb, 2nd pers.plur., pres.subj./imper.
VAP3CPVasp3p-Aux. Verb, 3rd pers.plur., pres.subj.
VAS3CIVasi3s-Aux. Verb, 1/2 pers.sing., impf.subj.
VAS3CIVasi3s-Aux. Verb, 3rd pers.sing., impf.subj.
VAP1CIVasi1p-Aux. Verb, 1st pers.plur., impf.subj.
VAP3CIVasi3p-Aux. Verb, 3rd pers.plur., impf.subj.
VAS2MPVamp2s-ux. Verb, 2nd pers.sing., pres.impr.
VAS2MPEVamp2s-yAux. Verb, 2nd pers.sing., pres.impr. + clit.
VAP2MPEVamp2p-yAux. Verb, 2nd pers.plur., pres.impr. + clit.
VAS1DPVacp1s- Aux. Verb, 1st pers.sing., pres.cond.
VAS2DPVacp2s-Aux. Verb, 2nd pers.sing., pres.cond.
VAS3DPVacp3s-Aux. Verb, 3rd pers.sing., pres.cond.
VAP1DPVacp1p-Aux. Verb, 1st pers.plur., pres.cond.
VAP2DPVacp2p-Aux. Verb, 2nd pers.plur., pres.cond.
VAP3DPVacp3p-Aux. Verb, 3rd pers.plur., pres.cond.
VAFVanp---Aux. Verb, infinitive
VAFEVanp--cyAux. Verb, infinitive + clitic
VANSPPVapp-scAux. Verb, comm.sing., pres.part.
VANPPPVapp-pcAux. Verb, comm.plur., pres.part.
VAMSPRVaps-smAux. Verb, masc.sing., past part.
VAMPPRVaps-pm Aux. Verb, masc.plur., past part.
VAFSPRVaps-sfAux. Verb, femm.sing., past part.
VAFPPRVaps-pfAux. Verb, femm.plur., past part.
VAMSPREVaps-smyAux. Verb, masc.sing., past part. + clitic
VAMPPREVaps-pmy Aux. Verb, masc.plur., past part. + clitic
VAFSPREVaps-sfyAux. Verb, femm.sing., past part. + clitic
VAFPPREVaps-pfyAux. Verb, femm.plur., past part. + clitic
VAGVagp---Aux. Verb, gerund
VAGEVagp---yAux. Verb, gerund + clitic
VS1IPVmip1s-Main Verb, 1st pers.sing., pres.indic
VS3IPVmip3s-Main Verb, 3rd pers.sing., pres.indic
VP3IPVmip3p-Main Verb, 3rd pers.plur., pres.indic
VP1ICPVm[is]p1pMain Verb,1stpers.plur.,pres.indic/cong
VP2IMPP Vm([im]p2p-)|(ps-pf)M.V., 2nd pl., pres.indic/imper|pstprt f.pl.
VP2IMPVm([im]p2p)-Main Verb, 2nd pl., pres.indic/imper
VSXICPVm(sp.s)|(ip2s)-M.V., 1/2/3 sg.,pres.subj.|2ndsg. pres.indic.
VS1IMPVm[im]1s-Main Verb, not 1stsg.,pres.indic./imper.
VS2IMPVm[im]p2s-Main Verb, 2nd sg., pres.indic/imper
VP2IMCPPVm([ims]p2p-)|(ps-pf)M.V., 2pl., pr.ind/imp/sub|pst.prt f.pl.
VS1IIVmii1s-Main Verb, 1st pers.sing., impf.indic.
VS2IIVmii2s-Main Verb, 2nd pers.sing., impf.indic.
VS3IIVmii3s-Main Verb, 3rd pers.sing., impf.indic.
VP1IIVmii1p-Main Verb, 1st pers.plur., impf.indic.
VP2IIVmii2p-Main Verb, 2nd pers.plur., impf.indic.
VP3IIVmii3p-Main Verb, 3rd pers.plur., impf.indic.
VS1IFVmif1s-Main Verb, 1st pers.sing., fut. indic.
VS2IFVmif2s-Main Verb, 2nd pers.sing., fut. indic.
VS3IF Vmif3s-Main Verb, 3rd pers.sing., fut. indic.
VP1IFVmif1p- Main Verb, 1st pers.plur., fut. indic.
VP2IFVmif2p-Main Verb, 2nd pers.plur., fut. indic.
VP3IFVmif3p-Main Verb, 3rd pers.plur., fut. indic.
VS1IRVmis1s-Main Verb, 1st pers.sing., past indic.
VS2IRVmis2s-Main Verb, 2nd pers.sing., past indic.
VS3IRVmis3s-Main Verb, 3rd pers.sing., past indic.
VP1IR Vmis1p-Main Verb, 1st pers.plur., past indic.
VP3IRVmis3p-Main Verb, 3rd pers.plur., past indic.
VP2ICRVm(is)|(si)2p-Main Verb, 2nd p.pl., past indic./pres.subj.
VP2CPVmsp2p-Main Verb, 2nd pers.plur., pres.subj. amiate
VP3CPVmsp3p-Main Verb, 3rd pers.plur., pres.subj. amino
VSXCPVmcp.s-Main Verb, 1/2/3 p. sing., pres.subj.
VS3CIVmsi3s-Main Verb, 1/2 pers.sing., impf.subj.
VS3CIVmsi3s-Main Verb, 3rd pers.sing., impf.subj.
VP1CIVmsi1p-Main Verb, 1st pers.plur., impf.subj.
VP3CIVmsi3p- Main Verb, 3rd pers.plur., impf.subj.
VS2MPEVmmp2s-yMain Verb, 2nd pers.sing., pres.impr. + clit.
VP2MPEVmmp2p-y Main Verb, 2nd pers.plur., pres.impr. + clit.
VS1DPVmcp1s-Main Verb, 1st pers.sing., pres.cond.
VS2DP Vmcp2s-Main Verb, 2nd pers.sing., pres.cond.
VS3DPVmcp3s-Main Verb, 3rd pers.sing., pres.cond.
VP1DPVmcp1p-Main Verb, 1st pers.plur., pres.cond.
VP2DPVmcp2p-Main Verb, 2nd pers.plur., pres.cond.
VP3DPVmcp3p- Main Verb, 3rd pers.plur., pres.cond.
VFVmnp---Main Verb, infinitive
VFEVmnp---y Main Verb, infinitive + clitic
VNSPPVmpp-scMain Verb, comm.sing., pres.part.
VNPPPVmpp-pcMain Verb, comm.plur., pres.part.
VMSPRVmps-sm Main Verb, masc.sing., past part.
VMPPRVmps-pmMain Verb, masc.plur., past part.
VFSPRVmps-sfMain Verb, femm.sing., past part.
VFPPRVmps-pfMain Verb, femm.plur., past part.
VMSPREVmps-smyMain Verb, masc.sing., past part. +c
VMPPREVmps-pmyMain Verb, masc.plur., past part. +c
VFSPRE Vmps-sfyMain Verb, femm.sing., past part. +c
VFPPREVmps-pfyMain Verb, femm.plur., past part. +c
VG Vmgp---Main Verb, gerund
VGEVmgp---yMain Verb, gerund + clitic
VA1PVa[iscm][pifs]1p--Aux. verb, 1st person plur.
VA1S Va[iscm][pifs]1s--Aux. verb, 1st person sing.
VA2PVa[iscm][pifs]2p--Aux. verb, 2nd person plur.
VA2SVa[iscm][pifs]2s--Aux. verb, 2nd person sing.
VA3PVa[iscm][pifs]3p--Aux. verb, 3rd person plur.
VA3SVa[iscm][pifs]3s--Aux. verb, 3rd person sing.
VAFPPSVaps-pf-Aux. verb, fem. plur., past part.
VAFSPSVaps-sf-Aux. verb, fem. sing., past part.
VAMPPSVaps-pm-Aux. verb, masc. plur., past part.
VAMSPSVaps-sm-Aux. verb, masc. sing., past part.
VANVanp----Aux. verb, infinitive
VAFEVanp----Aux. Verb, infinitive + enclitic
VAGVagp----Aux. Verb, gerund
VAGEVagp----Aux. Verb, gerund + enclitic
VAPPVapp-..-Aux. verb, pres. participle
V1PVm[iscm][pifs]1p--Main Verb, 1st person plur.
V1S Vm[iscm][pifs]1s--Main Verb, 1st person sing.
V2PVm[iscm][pifs]2p--Main Verb, 2nd person plur.
V2S Vm[iscm][pifs]2s--Main Verb, 2nd person sing.
V3PVm[iscm][pifs]3p--Main Verb, 3rd person plur.
V3S Vm[iscm][pifs]3s--Main Verb, 3rd person sing.
VFPPS Vmps-pf-Main Verb, fem. plur., past part.
VFSPS Vmps-sf-Main Verb, fem. sing., past part.
VMPPS Vmps-pm-Main Verb, masc. plur., past part.
VMSPS Vmps-sm-Main Verb, masc. plur., past part.
VFVmnp----Main Verb, infinitive
VFEVmnp----yMain Verb, infinitive + enclitic
VGVmgp----Main Verb, gerund
VGEVmgp----yMain Verb, gerund + enclitic
VPPVmpp-..-Main Verb, pres. participle



2.3. Combinations

LexiconCorpusExample
Vaip1s-VAS1IPho
Vaip2s-VAS2IPhai, sei
Vaip3s-VAS3IPha, e'
Vaip2p-VAP2IPavete, siete
Vaip3p-VAP3IPhanno
Vaip1p-VAP1ICPabbiamo, siamo
Vaip1s-VAY2IPsono
Vaip3p-VAY2IPsono
Vaii1s-VAS1IIavevo, ero
Vaii2s-VAS2IIavevi, eri
Vaii3s-VAS3IIaveva, era
Vaii1p-VAP1IIavevamo, eravamo
Vaii2p-VAP2IIavevate, eravate
Vaii3p-VAP3IIavevano, erano
Vaif1s-VAS1IFavro', saro'
Vaif2s-VAS2IFavrai, sarai
Vaif3s-VAS3IFavra', sara'
Vaif1p-VAP1IFavremo, saremo
Vaif2p-VAP2IFavrete, sarete
Vaif3p-VAP3IFavranno, saranno
Vais1s-VAS1IRebbi, fui
Vais2s-VAS2IRavesti, fosti
Vais3s-VAS3IRebbe, fu
Vais1p-VAP1IRavemmo, fummo
Vais3p-VAP3IRebbero, furono
Vais2s-VAP2ICRaveste, foste
Vasp1s-VASXCPabbia, sia
Vasp2s-VASXCPabbia, sia
Vasp3s-VASXCPabbia, sia
Vasp1p-VAP1ICPabbiamo, siamo
Vasp2p-VAP2CMPabbiate, siate
Vasp3p-VAP3CPabbiano, siano
Vasi1s-VAS3CIavessi, fossi
Vasi2s-VAS3CIavessi, fossi
Vasi3s-VAS3CIavesse, fosse
Vasi1p-VAP1CIavessimo, fossimo
Vasi2s-VAP2ICRaveste, foste
Vasi3p-VAP3CIavessero, fossero
Vamp2s-VAS2MPabbi, sii
Vamp2s-yVAS2MPEabbilo, siilo
Vamp2p-VAP2CMPabbiate, siate
Vamp2p-yVAP2MPEabbiatelo, siatelo
Vacp1s-VAS1DPavrei, sarei
Vacp2s-VAS2DPavresti, saresti
Vacp3s-VAS3DPavrebbe, sarebbe
Vacp1p-VAP1DPavremmo, saremmo
Vacp2p-VAP2DPavreste, sareste
Vacp3p-VAP3DPavrebbero, sarebbero
Vanp---VAFavere, essere
Vanp--cyVAFEaverlo, esserlo
Vapp-scVANSPPavente, essente
Vapp-pcVANPPPaventi, essenti
Vaps-smVAMSPRavuto, stato
Vaps-pmVAMPPRavuti, stati
Vaps-sfVAFSPRavuta, stata
Vaps-pfVAFPPRavute, state
Vaps-smyVAMSPREavutolo
Vaps-pmyVAMPPREavutili
Vaps-sfyVAFSPREavutala
Vaps-pfyVAFPPREavuteli
Vagp---VAGavendo, essendo
Vagp---yVAGEavendolo, essendolo
Vmip1s-VS1IPamo, leggo, servo
Vmip2s-VSXICPami
Vmip2s-VS2IMPleggi, servi
Vmip3s- VS1IMPama
Vmip3s-VS3IPlegge, serve
Vmip1p-VP1ICPamiamo, leggiamo, serviamo
Vmip2p-VP2IMPPamate, servite
Vmip2p-VP2IMPleggete
Vmip2pVP2IMCPPpremiate
Vmip3p-VP3IPamano, leggono, servono
Vmii1s-VS1IIamavo,
Vmii2s-VS2IIamavi,
Vmii3s-VS3IIamava
Vmii1p-VP1IIamavano
Vmii2p-VP2IIamavate
Vmii3p-VP3IIamavano
Vmif1s-VS1IFamero'
Vmif2s-VS2IFamerai
Vmif3s-VS3IFamera'
Vmif1p-VP1IFameremo
Vmif2p-VP2IFamerete
Vmif3p-VP3IFameranno
Vmis1s-VS1IRamai
Vmis2s-VS2IRamasti
Vmis3s-VS3IRamo'
Vmis1p-VP1IRamammo
Vmis2p-VP2ICRamaste, leggeste, serviste
Vmis3p-VP3IRamarono
Vmsp1s-VSXCPlegga
Vmsp1s-VSXICPami
Vmsp2s-VSXCPlegga
Vmsp2s-VSXICPami
Vmsp3s-VSXCPlegga
Vmsp3s-VSXICPami
Vmsp1p-VP1ICPamiamo, leggiamo, serviamo
Vmsp2p-VP2CPamiate, leggiate, serviate
Vmsp2p-VP2ICMPPpremiate
Vmsp3p-VP3CPamino, leggano, servano
Vmsi1s-VS3CIamassi, leggessi, servissi
Vmsi2s-VS3CIamassi, leggessi, servissi
Vmsi3s-VS3CIamasse, leggesse, servisse
Vmsi1p-VP1CIamassimo
Vmsi2p-VP2ICRamaste, leggeste, serviste
Vmsi3p-VP3CIamassero
Vmmp2s-VS1IMPama
Vmmp2s-VS2IMPleggi, servi
Vmmp2p-VP2IMPPamate
Vmmp2p-VP2IMPleggete, servite
Vmmp2p-VP2IMCPPpremiate
Vmmp2s-yVS2MPeamalo, leggilo, servilo
Vmmp2p-yVP2MPeamatelo, leggetelo, servitelo
Vmcp1s-VS1DPamerei
Vmcp2s-VS2DPameresti
Vmcp3s-VS3DPamarebbe
Vmcp1p-VP1DPameremmo
Vmcp2p-VP2DPamereste
Vmcp3p-VP3DPamerebero
Vmnp---VFamare
Vmnp---yVFEamarlo
Vmpp-scVNSPPamante
Vmpp-pcVNPPPamanti
Vmps-smVMSPRamato, letto, servito
Vmps-pmVMPPRamati, letti, serviti
Vmps-sfVFSPRamata, letta, servita
Vmps-pfVP2IMCPPpremiate
Vmps-pfVP2IMPPamate, servite
Vmps-pfVFPPRlette
Vmps-smyVMSPREamatolo
Vmps-pmyVMPPREamatili
Vmps-sfyVFSPREamatala
Vmps-pfyVFPPREamatele
Vmgp---VGamando
Vmgp---yVGEamandolo
Vaip1s-VA1Sho
Vaip2s-VA2Shai
Vaip3s-VA3Sha
Vaip1p-VA1Pabbiamo
Vaip2p-VA2Pavete
Vaip3p-VA3Phanno
Vaii1s-VA1Savevo
Vaii2s-VA2Savevi
Vaii3s-VA3Saveva
Vaii1p-VA1Pavevamo
Vaii2p-VA2Pavevate
Vaii3p-VA3Pavevano
Vaif1s-VA1Savro'
Vaif2s-VA2Savrai
Vaif3s-VA3Savra'
Vaif1p-VA1Pavremo
Vaif2p-VA2Pavrete
Vaif3p-VA3Pavranno
Vais1s-VA1Sebbi
Vais2s-VA2Savesti
Vais3s-VA3Sebbe
Vais1p-VA1Pavemmo
Vais2p-VA2Paveste
Vais3p-VA3Pebbero
Vasp1s-VA1Sabbia
Vasp2s-VA2Sabbia
Vasp3s-VA3Sabbia
Vasp1p-VA1Pabbiamo
Vasp2p-VA2Pabbiate
Vasp3p-VA3Pabbiano
Vasi1s-VA1Savessi
Vasi2s-VA2Savessi
Vasi3s-VA3Savesse
Vasi1p-VA1Pavessimo
Vasi2p-VA2Paveste
Vasi3p-VA3Pavessero
Vamp2s-VA2Sabbi
Vamp2p-VA2Pabbiate
Vacp1s-VA1Savrei
Vacp2s-VA2Savresti
Vacp3s-VA3Savrebbe
Vacp1p-VA1Pavremmo
Vacp2p-VA2Pavreste
Vacp3p-VA3Pavrebbero
Vanp---VAFavere
Vanp---yVAFEaverlo
Va-csppVANSPPavente
Va-cpppVANPPPaventi
Va-mspsVAMSPRavuto
Va-mppsVAMPPRavuti
Va-fsps VAFSPRavuta
Va-fppsVAFPPRavute
Va-gp--VAGavendo
Va-gp--yVAGEavendolo
Vmip1s-V1Samo
Vmip2s-V2Sami
Vmip3s-V3Sama
Vmip1p-V1Pamiamo
Vmip2p-V2Pamate
Vmip3p-V3Pamano
Vmii1s-V1Samavo
Vmii2s-V2Samavi
Vmii3s-V3Samava
Vmii1p-V1Pamavamo
Vmii2p-V2Pamavate
Vmii3p-V3Pamavano
Vmif1s-V1Samero'
Vmif2s-V2Samerai
Vmif3s-V3Samera'
Vmif1p-V1Pameremo
Vmif2p-V2Pamerete
Vmif3p-V3Pameranno
Vmis1s-V1Samai
Vmis2s-V2Samasti
Vmis3s-V3Samo'
Vmis1p-V1Pamammo
Vmis2p-V2Pamaste
Vmis3p-V3Pamarono
Vmsp1s-V1Sami
Vmsp2s-V2Sami
Vmsp3s-V3Sami
Vmsp1p-V1Pamiamo
Vmsp2p-V2Pamiate
Vmsp3p-V3Pamino
Vmsi1s-V1Samassi
Vmsi2s-V2Samassi
Vmsi3s-V3Samasse
Vmsi1p-V1Pamassimo
Vmsi2p-V2Pamaste
Vmsi3p-V3Pamassero
Vmmp2s-V2Sama
Vmmp2p-V2Pamate
Vmcp1s-V1Samerei
Vmcp2s-V2Sameresti
Vmcp3s-V3Samerebbe
Vmcp1p-V1Pameremmo
Vmcp2p-V2Pamereste
Vmcp3p-V3Pamerebbero
Vmnp---VFamare
Vmnp---yVFEamarlo
Vm-csppVNSPPamante
Vm-cpppVNPPPamanti
Vm-mspsVMSPRamato
Vm-mppsVMPPRamati
Vm-fspsVFSPRamata
Vm-fppsVFPPRamate
Vm-gp--VGamando
Vm-gp--yVGEamandolo


2.4. Observations

An observation concerns the special marking for the auxiliaries: the taggers are in general not able to disambiguate the cases in which the auxiliaries are used as full verbs ("io ho un cane" , "i bambini sono nel prato") from the cases when they are auxiliaries. The distinction of the auxiliaries is used only in order to isolate 'avere' and 'essere' from the other verbs.

For verbs, two different sets of tags are proposed, the first more fine-grained for more accurate distinctions and the latter more coarse-grained, which follows the approach proposed by the French group.

The collapsing proposed by the French group of Moods and Tenses, if considered wrt to the performances of our tagger, appears restrictive: for many unambiguous tenses and moods, the Italian tagger is able to formulate the correct analysis (e.g. conditional, subjunctive imperfect, indicative past etc.) and these distinctions are, in our opinion, worth being maintained. It has to be noticed that the ambiguities between verb forms depend also on different lexical verbs.

In Italian, the major ambiguities concerns the 2nd sing and plur of the present indicative and imperative, ama-amate; leggi-leggete. However, this is again not a general rule.

Another very common ambiguity is between the 2nd pers. of the indicative and the 1st, 2nd, 3rd person of the present subjunctive. Therefore not always it is possible to decide unambiguosly on the person.

Some more frequent typical homographies in Italian are listed below:

                VP1ICP                amiamo
                VP2IMP                leggete
                VP2IMPP               amate
                VP2IMCPP              premiate
                VP2ICR                amaste
                VS3CI                 amassi
                VSXCP                 legga
                VAY2IP                sono
                VSXICP                ami
                VS2IMP                leggi
                VS1IMP                ama
                VS1IMP                ama

In the design of corpus tagsets for verbs careful attention should be given to the enclitic phenomenon: at present our tagger is able to recognize the presence of the clitics which is signalled by the addition of the mark "+E" (plus clitic) to the regular verb tag.

Compare with :



3. Italian Adjectives (A)

3.1. Lexicon

============ =========== =========== =====
Attribute      Value       Example    Code
============ =========== =========== =====
Type             -           -         -
------------ ----------- ----------- -----
Degree       positive	 buono        p
             comparative  migliore     c
             superlative  buonissimo   s
------------ ----------- ----------- -----
Gender       masculine    buono        m
             feminine     buona        f
l-spec       common       dolce        c
------------ ----------- ----------- -----
Number       singular     buono        s
             plural       buoni        p
l-spec       invariant	 pari         n
------------ ----------- ----------- -----
Case         (n.a.)       (n.a.)       -
============ =========== =========== =====



3.2. Corpus

TagRegular expressionDefinition
AFPA-.fp-Adjective fem. plur.
AFSA-.fs-Adjective fem. sing.
AFNA-.fn-Adjective fem. invar.
AMPA-.mp-Adjective masc. plur.
AMSA-.ms-Adjective masc. sing.
AMNA-.mn-Adjective masc. invar.
AMPA-.mp-Adjective comm. plur.
AMSA-.ms-Adjective comm. sing.
AMNA-.mn-Adjective comm. invar.



3.3. Combinations

LexiconCorpusExample
A-pms-AMSvero
A-pmp-AMPveri
A-pmn-AMNoggetto (complemento/i oggetto: grammatical language)
A-pfs-AFSvera
A-pfp-AFPvere
A-pfn-AFNvalore (clausola valore: juridical language)
A-pcs-ANSdolce (biscotto, torta)
A-pcp-ANPdolci (biscotti, dolci)
A-pcn-ANNpari (risultato/i, somma/e)
A-sms-AMSverissimo
A-smp-AMPverissimi
A-sfs-AFSverissima
A-sfp-AFPverissime



3.4. Observations

The comparative Degree applies only to a close set of adjectives (e.g. maggiore, migliore, etc). All other adjectives form their comparatives with "piu'" + adjective (e.g., piu' forte). Superlative is also an analytical form (il piu' forte), but can be also synthetically formed: grandissimo, massimo.


Compare with :



4. Italian Pronouns (P)

4.1. Lexicon

============ =========== =========== =====
Attribute      Value       Example    Code
============ =========== =========== =====
Type         personal      io          p
             demonstrat.   quello      d
             indefinite    chiunque    i
             possessive    mio         s
             interrog.     chi         t
             relative      che         r
             exclamative   quanto      e
------------ ----------- ----------- -----
Person       first         io          1
             second        tu          2
             third         egli        3
------------ ----------- ----------- -----
Gender       masculine     questo      m
             feminine      questa      f
l-spec       common        io          c
------------ ----------- ----------- -----
Number       singular      questo      s
             plural        questi      p
l-spec       invariant     che         n
------------ ----------- ----------- -----
Case         (n.a.)        (n.a.)      -
------------ ----------- ----------- -----
Possessor        -           -         -
============ =========== =========== =====



4.2. Corpus

TagRegular expressionDefinition
PDMSPd-ms--Demonstrative pronoun masc.sing.
PDMPPd-mp--Demonstrative pronoun masc.plur.
PDFSPd-fs--Demonstrative pronoun femm.sing.
PDFPPd-fp--Demonstrative pronoun femm.plur.
PDNSPd-cs--Demonstrative pronoun comm.sing.
PDNPPd-cp--Demonstrative pronoun comm.plur.
PIMSPi-ms--Indefinite pronoun masc.sing.
PIMPPi-mp--Indefinite pronoun masc.plur.
PIFSPi-fs--Indefinite pronoun femm.sing.
PIFPPi-fp--Indefinite pronoun femm.plur.
PINSPi-cs--Indefinite pronoun comm.sing.
PINPPi-cp--Indefinite pronoun comm.plur.
PPMSPs.ms--Possessive pronoun, masc.sing.
PPMPPs.mp--Possessive pronoun, masc.plur.
PPFSPs.fs--Possessive pronoun, femm.sing.
PPFPPs.fp--Possessive pronoun, femm.plur.
PPNPPs.cp--Possessive pronoun, comm.plur.
PWNSP[tre]-cs--Interr./Rel./Escl. pronoun, comm.sing.
PWNPP[tre]-cp--Interr./Rel./Escl. pronoun, comm.plur.
PWNNP[tre]-cn--Interr./Rel./Escl. pronoun, comm.plur.
PWMSP[tre]-ms--Interr./Rel./Escl. pronoun, masc.sing.
PWMPP[tre]-mp--Interr./Rel./Escl. pronoun, masc.plur.
PWFSP[tre]-fs--Interr./Rel./Escl. pronoun, femm.sing.
PWFPP[tre]-fp--Interr./Rel./Escl. pronoun, femm.plur.
PQNS1Pp1cs--Personal pronoun, 1st pers., comm.sing.
PQNS2Pp2cs--Personal pronoun, 2nd pers., comm.sing.
PQMS3Pp3ms--Personal pronoun, 3rd pers., masc.sing.
PQFS3Pp3fs--Personal pronoun, 3rd pers., femm.sing.
PQNN3Pp3cn--Personal pronoun, 3rd pers., comm.inv.
PQNP1Pp1cp--Personal pronoun, 1st pers., comm.plur.
PQNP2Pp2cp--Personal pronoun, 2nd pers., comm.plur.
PQNP3Pp3cp--Personal pronoun, 3rd pers., comm.plur.
PQMP3Pp3mp--Personal pronoun, 3rd pers., masc.plur.
PQFP3Pp3fp--Personal pronoun, 3rd pers., femm.plur.
PFPP..fp---Pronoun, fem. plur.
PFSP..fs---Pronoun, fem. plur.
PMPP..mp---Pronoun, masc. plur.
PMSP..ms---Pronoun, masc. sing.
PNSP..cs---Pronoun, comm. sing.
PNPP..cp---Pronoun, comm. plur.
PNNP..cn---Pronoun, comm. inv.



4.3. Combinations

LexiconCorpusExample
Pd-ms--PDMSquello, costui
Pd-mp--PDMPquelli
Pd-fs--PDFSquella
Pd-fp--PDFPquelle
Pd-cs--PDNScio'
Pd-cp--PDNPcoloro
Pi-ms--PIMSognuno
Pi-mp--PIMPalcuni
Pi-fs--PIFSognuna
Pi-fp--PIFPalcune
Pi-cs--PINSchiunque, tale
Pi-cp--PINPtali
Ps1ms--PPMSmio, nostro
Ps1mp--PPMPmiei
Ps1fs--PPFSmia
Ps1fp--PPFPmie
Ps2ms--PPMStuo, vostro
Ps2mp--PPMPtuoi
Ps2fs--PPFStua
Ps2fp--PPFPtue
Ps3ms--PPMSsuo
Ps3mp--PPMPsuoi
Ps3fs--PPFSsua
Ps3fp--PPFPsue
Ps3cp--PPNPloro
Pt-cs--PWNSchi? quale?
Pt-cp--PWNPquali?
Pt-cn--PWNNche?
Pt-ms--PWMSquanto?
Pt-mp--PWMPquanti?
Pt-fs--PWFSquanta?
Pt-fp--PWFPquante?
Pr-cn--PWNNcui
Pr-ms--PWMSquanto
Pr-mp--PWMPquanti
Pr-fs--PWFSquanta
Pr-fp--PWFPquante
Pr-cs--PWNSchi, quale
Pr-cp--PWNPquali
Pe-ms--PWMSquanto!
Pe-mp--PWMPquanti!
Pe-fs--PWFSquanta!
Pe-fp--PWFPquante!
Pe-cs--PWNSquale!
Pe-cp--PWNPquali!
Pe-cn--PWNNche!
Pp1cs--PQNS1io, me, mi
Pp2cs--PQNS2tu, te, ti,
Pp3ms--PQMS3egli, lui, esso, gli, lo
Pp3fs--PQFS3ella, lei, essa, le, la
Pp3cn--PQNN3si
Pp1cp--PQNP1noi, ci
Pp2cp--PQNP2voi, vi
Pp3cp--PQNP3loro,
Pp3mp--PQMP3essi, li
P..fp---PFPmie, queste, quante etc.
P..fs---PFSmia, questa, quanta etc.
P..mp---PMPmiei, questi, quanti etc.
P..ms---PMSmio, questo, quanto etc.
P..cs---PNSquale
P..cp---PNPquali
P..cn---PNNche, cui, altrui



4.3. Observations

For pronouns, the strategy of proposing two different tagsets, the one more collapsed and the other more fine-grained is followed.

As far as the pronominal paradigm is concerned, Case is not encoded at present in our DMI (Calzolari et al. 1983). Personal pronouns are not lemmatized: 'gli' is not considered the dative form of the base pronoun 'egli' (he), but constitutes a separate entry.

The Italian pronominal paradigm is the following:

  • 'forme toniche' (strong forms): subj (io, egli), compl (me, lui)
            ama me / da' a me
           dir-obj / prep-obj
      (he loves me / he gives to me)
    
            ama lui / da' a lui
            dir-obj / prep-obj
     (she loves him / she gives to him)
    

  • 'forme atone' (weak forms): compl (mi, gli/lo)
            mi da' / mi ama
           ind-obj / dir-obj
      (he gives me / he loves me)
    
    
           gli da'
           ind-obj
        (he gives him)
    
           lo ama
           dir-obj
        (she loves him)
    
    

    This paradigm can be mapped on the Case system proposed by the French group, in the following way:

    io, egli               = subj                          = nom
    mi/me                  = dir-obj/ind-obj/prep-obj      = obj -] acc, dat, prep+obl
    lui                    = dir-obj/prep-obj              = obj -] acc, prep+obl
    gli                    = ind-obj                       = dat
    lo                     = dir-obj                       = acc
    



    Compare with :



    5. Italian Determiners (Pronominal Adjectives) (D)

    5.1. Lexicon

    ============ =========== =========== =====
    Attribute      Value       Example    Code
    ============ =========== =========== =====
    Type         demonstrat.   questo      d
                 indefinite    ogni        i
                 possessive    mio         s
                 interrogat.   che         t
                 exclamative   quanto      e    this value has been added
                 relative      quanto      r    this value has been added
    ------------ ----------- ----------- -----
    Person       first         mio         1
                 second        tuo         2
                 third         suo         3
    ------------ ----------- ----------- -----
    Gender       masculine     questo      m
                 feminine      questa      f
    l-spec       common        ogni        c
    ------------ ----------- ----------- -----
    Number       singular      quello      s
                 plural        quelli      p
    l-spec       invariant     altrui      n
    ------------ ----------- ----------- -----
    Case         (n.a.)        (n.a.)      -
    ------------ ----------- ----------- -----
    Possessor      -              -        -
    ============ =========== =========== =====
    



    5.2. Corpus

    TagRegular expressionDefinition
    DDNSDd-ns--Demonstrative pron.adj. comm.inv.
    DDNPDd-np--Demonstrative pron.adj. comm.plur.
    DDMSDd-ms--Demonstrative pron.adj. masc.sing.
    DDMPDd-mp--Demonstrative pron.adj. masc.plur.
    DDFSDd-fs--Demonstrative pron.adj. femm.sing.
    DDFPDd-fp--Demonstrative pron.adj. femm.plur.
    DIMSDi-ms--Indefinite pron.adj. masc.sing.
    DIMPDi-mp--Indefinite pron.adj. masc.plur.
    DIFSDi-fs--Indefinite pron.adj. femm.sing.
    DIFPDi-fp--Indefinite pron.adj. femm.plur.
    DINSDi-cs--Indefinite pron.adj. comm.sing.
    DINPDi-cp--Indefinite pron.adj. comm.plur.
    DPMSDs.ms--Possessive pron.adj., masc.sing.
    DPMPDs.mp--Possessive pron.adj., masc.plur.
    DPFSDs.fs--Possessive pron.adj., femm.sing.
    DPFPDs.fp--Possessive pron.adj., femm.plur.
    DPNNDs-cn--Possessive pron.adj., comm.inv.
    DWNND[tre]-cn--Interr/Relat./escl. pron.adj., comm.inv.
    DWMSD[tre]-ms--Interr/Relat./escl. pron.adj., masc.sing.
    DWMPD[tre]-mp--Interr/Relat./escl. pron.adj., masc.plur.
    DWFSD[tre]-fs--Interr/Relat./escl. pron.adj., femm.sing.
    DWFPD[tre]-fp--Interr/Relat./escl. pron.adj., femm.plur.
    DWNSD[tre]-cs--Interr/Relat./escl. pron.adj., comm.sing.
    DWNPD[tre]-cp--Interr/Relat./escl. pron.adj., comm.plur.
    DFPD..fp---Determiner, fem. plur.
    DFSD..fs---Determiner, fem. plur.
    DMPD..mp---Determiner, masc. plur.
    DMSD..ms---Determiner, masc. sing.
    DNSD..cs---Determiner, comm. sing.
    DNPD..cp---Determiner, comm. plur.
    DNND..cn---Determiner, comm. inv.



    5.3. Combinations

    LexiconCorpusExample
    Dd-cs--DDNStale
    Dd-cp--DDNPtali
    Dd-ms--DDMSquello
    Dd-mp--DDMPquelli
    Dd-fs--DDFSquella
    Dd-fp--DDFPquelle
    Di-ms--DIMSnessun
    Di-mp--DIMPalcuni
    Di-fs--DIFSnessuna
    Di-fp--DIFPalcune
    Di-cs--DINSogni
    Di-cp--DINPquali
    Ds1ms--DPMSmio, nostro
    Ds1mp--DPMPmiei
    Ds1fs--DPFSmia
    Ds1fp--DPFPmie
    Ds2ms--DPMStuo, vostro
    Ds2mp--DPMPtuoi
    Ds2fs--DPFStua
    Ds2fp--DPFPtue
    Ds3ms--DPMSsuo
    Ds3mp--DPMPsuoi
    Ds3fs--DPFSsua
    Ds3fp--DPFPsue
    Ds-cn--DPNNaltrui
    Dr-cn--DWNNcui
    Dr-ms--DWMSquanto
    Dr-mp--DWMPquanti
    Dr-fs--DWFSquante
    Dr-fp--DWFPquanti
    Dr-cs--DWNSquale
    Dr-cp--DWNPquale
    Dt-cn--DWNNche
    Dt-ms--DWMSquanto
    Dt-mp--DWMPquanti
    Dt-fs--DWFS quante
    Dt-fp--DWFPquanti
    Dt-cs--DWNSquale
    Dt-cp--DWNPquale
    De-cn--DWNNche
    De-cp--DWNPquali
    De-cs--DWNSquale
    De-ms--DWMSquanto
    De-mp--DWMPquanti
    De-fs--DWFSquanta
    De-fp--DWFPquante
    D..fp---DFPmie, queste, quante etc.
    D..fs---DFSmia, questa, quanta etc.
    D..mp---DMPmiei, questi, quanti etc.
    D..ms---DMSmio, questo, quanto etc.
    D..cs---DNSquale
    D..cp---DNPquali
    D..cn---DNNaltrui

    Note On the basis of the strategy adopted for Pronouns, also for Determiners two tagsets are proposed.


    Compare with :



    6. Italian Articles (T)

    6.1. Lexicon

    ============ =========== =========== =====
    Attribute      Value       Example    Code
    ============ =========== =========== =====
    Type          definite       il        d
                  indefinite     un        i
    ------------ ----------- ----------- -----
    Gender        masculine      il        m
                  feminine       la        f
    l-spec        common         l'        c
    ------------ ----------- ----------- -----
    Number        singular       la        s
                  plural         le        p
    ------------ ----------- ----------- -----
    Case          (n.a)        (n.a.)        -
    ============ =========== =========== =====
    



    6.2. Corpus

    TagRegular expressionDefinition
    RMSTdms-Article, definite, masc.sing.
    RMPTdmp-Article, definite, masc.plur.
    RFSTdfs-Article, definite, femm.sing.
    RFPTdfp-Article, definite, femm.plur.
    RNSTdcs-Article, definite, comm.sing.
    RIMS Tims-Article, indefinite, masc.sing.
    RIFS Tifs-Article, indefinite, femm.sing.



    6.3. Combinations

    LexiconCorpusExample
    Tdms-RMSil, lo
    Tdmp-RMPi, gli
    Tdfs-RFSla
    Tdfp-RFPle
    Tdcs-RNSl' (amico/a)
    Tims-RIMSun, uno
    Tifs-RIFSuna, un'



    Compare with :



    7. Italian Adverbs (R)

    7.1. Lexicon

    ============ =========== =========== =====
    Attribute      Value       Example    Code
    ============ =========== =========== =====
    Type             -            -        -
    ------------ ----------- ----------- -----
    Degree       positive     bene         p
                 superlative  benissimo    s
    ============ =========== =========== =====
    



    7.2. Corpus

    TagRegular expressionDefinition
    BR-pAdverb positive
    BSR-sAdverb superaltive



    7.3. Combinations

    LexiconCorpusExample
    R-pBfortemente
    R-sBSfortissimamente

    Note

    The feature Type is not encoded in the Italian lexicon.


    Compare with :



    8. Italian Adpositions (S)

    8.1. Lexicon

    ============ =========== =========== =====
    Attribute      Value       Example    Code
    ============ =========== =========== =====
    Type         preposition  di, a, da    p
    ------------ ----------- ----------- -----
    Formation    simple       di           s
                 compound     dello        c
    ------------ ----------- ----------- -----
    Gender       masculine    dello        m    This attribute and values have been added
                 feminine     alla         f    
    l-spec       common       dell'        c
    ------------ ----------- ----------- -----
    Number       singular     al           s    This attribute and values have been added
                 plural       ai           p    
    ============ =========== =========== =====
    




    8.2. Corpus

    TagRegular expressionDefinition
    ESp-Preposition simple
    EASpc..Preposition compound



    8.3. Combinations

    LexiconCorpusExample
    SpE di
    SpcfsEAdella
    SpcfpEAdelle
    SpcmsEAdel, dello
    SpcmpEAdei, degli
    SpccnEAdell'

    Note

    The Italian policy for encoding fused prepositions foresees to attach the morphological information of the article to the preposition tag.


    Compare with :



    9. Italian Conjunctions (C)

    9.1. Lexicon

    ============ =========== =========== =====
    Attribute      Value       Example    Code
    ============ =========== =========== =====
    Type          coordinat.    e          c
                  subordinat.   perche'    s
    ============ =========== =========== =====
    




    9.2. Corpus

    TagRegular expressionDefinition
    CCCcCoordinative conjunction
    CSCsSubordinative conjunction



    9.3. Combinations

    LexiconCorpusExample
    CcCCma
    CsCSperche'




    Compare with :



    10. Italian Numerals (M)

    10.1. Lexicon

    ============ =========== =========== =====
    Attribute      Value       Example    Code
    ============ =========== =========== =====
    Type          cardinal    cento        c
                  ordinal     primo        o
    ------------ ----------- ----------- -----
    Gender        masculine   primo        m
                  feminine    prima        f
    ------------ ----------- ----------- -----
    Number        singular    secondo      s
                  plural      secondi      p
    ------------ ----------- ----------- -----
    Case          (n.a.)       (n.a.)      -
    ============ =========== =========== =====
    




    10.2. Corpus

    TagRegular expressionDefinition
    NMSM.ms-Numeral, masc.sing.
    NFSM.fs-Numeral, femm.sing.
    NMPM.mp-Numeral, masc.plur.
    NFPM.fp-Numeral, femm.plur.
    NMc---Numeral cardinal




    10.3. Combinations

    LexiconCorpusExample
    M.ms-NMSprimo
    M.fs-NFSprima
    M.mp-NMPprimi
    M.fp-NFPprime
    Mc---Nzero, cento



    Compare with :



    11. Italian Interjections (I)

    11.1. Corpus

    TagRegular expressionDefinition
    IIInterjection



    11.2.Combinations

    LexiconCorpusExample
    IIoh



    Compare with :



    12. Italian Unique membership class (U)

    None.


    Compare with :



    13. Italian Residual (X)

    13.1. Corpus

    TagRegular expressionDefinition
    NY???"Guessed" Noun
    AY???"Guessed" Adjective



    13.2. Combinations

    LexiconCorpusExample
    ???NYbit
    ???AYcomputerizzato



    13.3. Observations

    At corpus level, we have the tag SY which is used to mark symbols, letters, acronyms, foreign words, toponyms etc., in general unknown words, for which a "guess" is provided.

    Compare with :



    14. Italian Punctuation

    TagExample
    punct.,;:?! etc.



    Compare with :





    Top | LPL/CNRS | Multext home page | General Content | Other Languages Description
    Copyright © Centre National de la Recherche Scientifique, 1996.
    This page will undergo frequent modification. Therefore, please do not mirror this page.