| The Swedish Multext comparable corpusEva Ejerhed (University of Umea) |
This report briefly describes the comparable corpus of Swedish financial newstext contributed by the University of Umea to the MULTEXT project, the different versions of this corpus, and the way in which they were processed.
Unfortunately, no parallel Swedish corpus aligned with English is provided by the Swedish MULTEXT project, for reasons having to do mainly with there being no Swedish translations of the European Parliamentary debates used by other MULTEXT partners. However, it should be noted that there is on-going work on multilingual parallel corpora that include Swedish, using tools developed in Edinburgh for the European MULTEXT/MLCC projects, at the Department of Swedish, University of Gothenburg (D. Ridings, P. Danielsson), and there is recent work on alignment of parallel corpora that has been done in the context of two different translation projects carried out respectively at the University of Uppsala (A. Sagvall Hein), and at the University of Linkoping (L. Ahrenberg, M. Merkel).
The comparable corpus, Dagens Industri 1993 (DI93), consists of all 14 180 articles from the Swedish daily financial newspaper Dagens Industri, that were electronically archived and published by Aff{rsdata AB in 1993. The size of the corpus is roughly 5 M words. For more precise statistics, see Appendix I.
The Swedish MULTEXT project has been greatly helped by the generosity of Aff{rsdata AB in making these and other texts available for research and experiments in text analysis at the University of Umea.
Dagens Industri is a newspaper directed at the Swedish business community. It is published in hard copy (pink paper) on weekdays, except during industrial vacations in July, and it currently has a daily circulation of 95 200 copies.
The newspaper is published by Dagens Industri AB, Stockholm. The work on the DI93 corpus was carried out in the General Linguistics' research laboratory of the Department of Linguistics during the academic years of 1994/95 and 1995/96 under the direction and active participation of the author, and with the invaluable assistance of research engineers Magnus Astrom and Fredrick Backman.
All funding for the Swedish MULTEXT project has been provided by the Swedish agency NUTEK.
The DI93 corpus contributed to the MULTEXT project consists of 274 files. The name of each file is DI followed by a date (year, month, day), e.g. DI930107, DI930320. Each file consists of all material that was electronically published that day. Each file is divided into a sequence of articles (here called blocks), and each article/block is preceded by a tag that identifies it uniquely. The tag for each block is composed of the date of the file in which it occurs followed by a hyphen and a number, corresponding to the unique identifier of the article in Aff{rsdata's text archive, e.g. <<<<930320-119827>>>. In the versions submitted, block elements have no end tags, but they could easily be added e.g. <<<</930320-119827>>>. The files are provided in the same Unix Record/Field format as has been used in Umea in processing the SUC corpus, and a program developed in Umea by Fredrick Backman for fully automatic conversion of this Record/Field format to an informationally exactly equivalent SGML format is included with the DI93 corpus data that is delivered.
Four versions of the data are provided, grouped into four directories named respectively DI93.raw, DI93.tok, DI93.str and DI93.dis. The three first directories contain all 274 files: DI93.raw contains the inital raw data, DI93.tok contains tokenized data, and DI93.str contains tokenized data with typographically generated structural markup that identifies heads (<h>, </h>) and paragraphs (<p>, </p>). The last directory DI93.dis contains 12 files of data, corresponding to approximately 200 000 words. These 12 files have undergone a sequence of fully automatic processing steps consisting of tokenization, structural markup, lexical lookup that adds a morphosyntactic analysis and a lemma to each token, and probabilistic disambiguation by a trigram tagger that results in a unique analysis assigned to each token.The trigram tagger is the result of joint efforts of Magnus Astrom and the author, and an early version of this tagger is described in Astrom (1994).
These processing steps were actually applied to all 274 files in order to test a set of strategies for analyzing unknown words that have been developed at Umea, but in accordance with MULTEXT project specifications, only 12 of the resulting POS annotated files (i.e. 200 000 words) are included in the data currently provided. The tagset used in DI93 (168 tags) is virtually the same as the tagset used in the Stockholm Umea Corpus (SUC) (160 tags). The set of DI93 tags, the frequency of ocurrence of each tag in the complete material of 274 files, and an example of each tag is provided in Appendix II. The considerations that went into creating the SUC tagset are described in Ejerhed (1994), and an early version of the SUC tagset is described in Ejerhed et al (1992). The construction of this tagset took into consideration the recommendations of TEI AI 1W2, 1992.
SUMMARY STATISTICS OF THE DI93 CORPUS:
The properties of each of the DI93 versions, and the tools by which they were created are briefly described in the following sections of the report. The report ends with a section on the results of one quantitative evaluation of the processing methods that were used to create the DI93.dis files.
The data in the DI93.raw files consists of unformated text in 7bit ascii characters. The last three characters of the Swedish alphabet are represented by }, {, | for the lower case single characters here rendered as a^o, a^e, o^e, and ], [, \ for the upper case single characters A^o, A^e, O^e.
We observed that the raw data contained some illformed input. There were a number of misspelled single words, many cases of two words written as a single word with space omitted, and cases of single words broken into two words by a space, due to the presence of a foreign character. Examples are provided below. We have no statistics on the number of these cases of illformedness. The cases of misspelled words, and run-in words were left as such, because we had no fully automatic means of either locating, counting or correcting them, and we had committed ourselves to a general strategy of fully automatic processing for this project. Broken words that were located by automatic means were mended.
EXAMPLES OF ILLFORMED INPUT:
MISSPELLED WORDS
utl{ndsa for utl{dnska 'foreign'
f|rs{ljingen for f|rs{ljningen 'the_sale'
ill for till 'to'
tll for till 'to'
gic for gick 'went'
ocn for och 'and'
oxh for och 'and'
pch for och 'and'
RUN-IN WORDS
("<fasti>" <11> (NN UTR SIN IND NOM "fasti")) for ("<fast>" (KN "fast")) 'although' ("<i>" (PP "i")) 'in'
("<Till|kningi>" <51> (PM NOM "Till|kningi")) for ("<Till|kning>" (NN UTR SIN IND NOM "till|kning")) 'growth' ("<i>" (PP "i")) 'in'
BROKEN WORDS
Su ede for Su#e2de
Dep ots for Dep#o3ts
Fran ois for Fran#c5ois
The effects of the measures described above on subsequent processing was the following. Misspelled single words either received incorrect POS analyses, or correct POS analyses, because the strategies for dealing with unknown words were flexible enough to analyze some illformed input correctly. All cases of two words incorrectly written as one predictably led to incorrect POS analyses, and for this reason they were the most serious cases of illformedness in the input. The mended broken words were either correctly or incorrectly analyzed, depending on properties of the lexical lookup and disambiguation processes.
The data in the DI93.tok files consists of the result of a fully automatic new tokenizer for Swedish, implemented in C by Magnus Astrom. In the default case, a token equals a character sequence preceded and followed by space. A general strategy of maximizing token length was followed, resulting in sequences like "AB/Kvaerner" and "VA/IMU-barometern" being considered as tokens. For words that were separated by a hyphen at the end of a line, tokenization removed the hyphen, making the result a single token, placed on one line. Punctuation symbols were separated from the character sequences to which they initally belonged and treated as separate tokens. No human interaction was used to decide whether periods were sentence final punctuation or not, which had been the case in processing the SUC corpus.
Special routines were written to cover all kinds of numerical tokens that occurred in the DI93 corpus, resulting in tokens such as "500_000", "1435,9", "-1,07" and "+0,7".
Abbreviations in Swedish text can be written either with or without periods according to current norms (either "Kgl." or "Kgl" abbreviates some form of "Kunglig" 'Royal'). Abbreviations of sequences of words like "till exempel" 'for example' can be written either "t.ex.", "t. ex." or "t ex", and for such multi-word abbreviations, an updated version of Astrom's abbreviations tool (AN-tool) that was used in processing the SUC corpus was created for the DI93 corpus, resulting in all versions of such abbreviations, including the ones containing an internal space, or spaces, becoming single tokens with an underscore added to represent space ("t.ex.", "t._ex.", and "t_ex").
Only multi-word sequences that are abbreviations are treated as single tokens by the DI93 tokenizer, as well as the SUC tokenizer. All other multi-word sequences are treated as sequences of single tokens, e.g. "d{rf|r att" lit. 'because that' is tokenized "d}rf|r" "att", "vad som" lit. 'what that' is tokenized "vad" "som", "i fr}ga om" lit. 'in question of' is tokenized "i" "fr}ga" "om", etc. The reason for this general policy, which was also adopted in the SUC project, is that it is not easy to formulate a principled basis for when to consider a given word sequence to be a single token, and when not to do so. Based on experiences with the SUC corpus, the strategy of considering words in such sequences as separate tokens has not had any adverse effect on either lexical lookup or disambiguation or partial parsing.More detailed investigations of the frequency of occurrence of multi-word sequences, and of their appropriate morphosyntactic analyses in different contexts of occurrence are necessary in order to construct lists of multi-word sequences amenable to treatment as single units.
The data in the DI93.str files consists of the result of tokenization plus the added result of a C-program for fully automatic, typographically driven, gross structural markup, created by Fredrick Backman. The only units recognized are heads <h> and paragraphs (i.e. non-heads) <p>.
Both <h> and <p> units are recognized on the basis of an empty line (or lines) and/or the indentation of a line. What distinguishes these two units is that the initial "orthographic paragraph" in each article/block is classified as head, regardless of whether it is written in upper, lower or mixed case; further any block internal "orthographic paragraph" that is written in upper case throughout is classified as head, since a study of the inital DI93 data revealed that this convention was adhered to consistently. All remaining block internal "orthographic paragraphs" are classified as paragraphs, rather than as heads.
EXAMPLES
BLOCK INITIAL HEAD
("<<<<<930320-119770>>>>>" <17243>)
("<<h>>" <17244>)
("<Fondhandlare>" <17245> (NN UTR PLU IND NOM "Fondhandlare"))
("<i>" <17246> (PP "i"))
("<krism|te>" <17247> (NN NEU SIN IND NOM "krism|te"))
("<p_g_a>" <17248> (AB AN "p}_grund_av"))
("<Orion>" <17249> (PM NOM "Orion"))
("<</h>>" <17250>)
BLOCK INTERNAL HEAD
("<</p>>" <984>)
("<<h>>" <985>)
("<SM]F\RETAGEN>" <986> (NN NEU PLU DEF NOM "sm}f|retag"))
("<NISCHEN>" <987> (NN UTR SIN DEF NOM "nisch"))
("<</h>>" <988>)
("<<p>>" <989>)
("<</p>>" <1100>)
("<<h>>" <1101>)
("<INTE>" <1102> (AB "inte"))
("<UTL[NDSA>" <1103> (JJ POS UTR/NEU PLU IND/DEF NOM "utl{nds"))
("<AKTIER>" <1104> (NN UTR PLU IND NOM "aktie"))
("<</h>>" <1105>) ("<<p>>" <1106>)
BLOCK INTERNAL PARAGRAPH
("<<p>>" <989>)
("<$">" <990> (DL PAD "$""))
("<Vi>" <991> (PN UTR PLU DEF SUB "vi"))
("<har>" <992> (VB PRS AKT "ha"))
("<alltid>" <993> (AB "alltid"))
("<f|rs|kt>" <994> (VB SUP AKT "f|rs|ka"))
("<f|lja>" <995> (VB INF AKT "f|lja"))
("<marknaden>" <996> (NN UTR SIN DEF NOM "marknad"))
("<ur>" <997> (PP "ur"))
("<v}rt>" <998> (PS NEU SIN DEF "v}r"))
("<perspektiv>" <999> (NN NEU SIN IND NOM "perspektiv"))
("<<p>>" <1106>)
("<Matteus>" <1107> (PM GEN "Matteus"))
("<20>" <1108> (RG NOM "20"))
(NN UTR PLU IND NOM "anst{lld"))
("<handlar>" <1110> (VB PRS AKT "handla"))
("<inte>" <1111> (AB "inte"))
("<med>" <1112> (PP "med"))
("<utl{ndska>" <1113> (JJ POS UTR/NEU PLU IND/DEF NOM "utl{ndsk"))
("<aktier>" <1114> (NN UTR PLU IND NOM "aktie"))
("<.>" <1115> (DL MAD "."))
("<<p>>" <19489>)
("<F\RETAGSLEDNINGARNA>" <19490> (NN UTR PLU DEF NOM "f|retagsledning"))
("<[R>" <19491> (VB PRS AKT "vara"))
("<STILBILDARE>" <19492> (NN UTR PLU IND NOM "stilbildare"))
("<f|r>" <19493> (PP "f|r"))
("<de>" <19494> (DT UTR/NEU PLU DEF "den"))
("<anst{llda>" <19495> (NN UTR PLU IND NOM "anst{lld"))
("<i>" <19496> (PP "i"))
("<f|retagen>" <19497> (NN NEU PLU DEF NOM "f|retag"))
("<.>" <19498> (DL MAD "."))
The names of authors of articles, where such names occur, are always written in upper case throughout, and they are located either last in an article, or immediately preceding sections headed by BILDTEXT 'Picture Text', and based on these criteria they can be located by automatic means. However, typographically, author names are neither separated from the previous text by an empty line, nor are the lines on which they occur indented, and for these reasons author names do not constitute separate <h> or <p> elements of the text.
EXAMPLES
AUTHOR NAMES
("<.>" <14805> (DL MAD "."))
("<EVA-LENA>" <14806> (PM NOM "EVA-LENA"))
("<AHLQVIST>" <14807> (PM NOM "Ahlqvist"))
("<</p>>" <14808>)
("<<<<<930320-119780>>>>>" <14809>)
("<.>" <18692> (DL MAD "."))
("<PATRIK>" <18693> (PM NOM "PATRIK"))
("<ENGELLAU>" <18694> (PM NOM "Engellau"))
("<</p>>" <18695>)
("<<p>>" <18696>)
("<BILDTEXT>" <18697> (NN UTR SIN IND NOM "bildtext"))
("<:>" <18698> (DL MID ":"))
("<</p>>" <18699>)
("<,>" <18719> (DL MID ","))
("<skriver>" <18720> (VB PRS AKT "skriva"))
("<PATRIK>" <18721> (PM NOM "PATRIK"))
("<ENGELLAU>" <18722> (PM NOM "Engellau"))
("<</p>>" <18723>)
The data in the DI93.dis files consists of the results of tokenization, structural markup, lexical lookup and disambiguation. As mentioned in the introduction, these processes were applied to all 274 files, in order to test strategies for deriving or guessing the lexical analyses of the unknown word types. We were also interested in measuring the size of the unknown word problem in Swedish. By the term known word, we here mean one of the 107 103 word types observed in the 1 M word annotated SUC corpus, and by unknown word we mean any word not observed in that corpus.
The DI93 corpus of 5 M tokens consists of a total of 232 429 unique word types. Of these word types, 82 436 were known words also occurring in the SUC corpus, and 149 993 were unknown words. The known words covered 90% of the 5 M tokens, but only 35,5% of the total DI93 vocabulary. Analyses of known words were obtained by lookup in the frequency dictionary based on the SUC corpus. By contrast, the unknown words, while covering only 10% of the total corpus, constituted as much as 64,5% of the total DI93 vocabulary. The large number of unknown words is due to Swedish compounds being written as single orthographic words, rather than as several orthographic words separated by spaces.
The multiple strategies for deriving or guessing the lexical analyses of the 149 993 unknown word types consisted of several steps.
One step was preprocessing the entire corpus in order to locate highly probable proper names. Recognizing proper names in unrestricted text is a hard problem in any language, and in Swedish it is compounded by a large number of words, or sequences of words, that can be either proper names or ordinary definite descriptions, on the basis of their linguistic form. Examples are: "Handelsbanken", "Kanslihuset", "Matteus Fondkommission". While all names have initial upper case, the property of beginning with an upper case letter does not only belong to proper names, but also to compound words words such as "A-aktien" and "OMX-index", which are clearly not proper names, but compound nouns. In the case of compounds like "Stockholmsb|rsen" 'the Stockholm stock exchange' where the first part of the compound is a proper name, it is an open question, or a matter of social conventions, whether to consider the whole compound to be a proper name or a singular, definite noun.
Other steps in providing analyses for unknown words consisted of lexical lookup in lexicons specially constructed by the author, consisting of variables followed by the final parts of observed compounds, variables followed by all possible derivational and inflectional morphemes, and finally variables followed by all possible non-empty inflectional morphemes.
The last step consisted of lexical lookup by an intelligent guesser, constructed by Magnus Astrom, which also has the property of deriving information about an unknown word from its final subsequence.
Following lexical lookup, which was total in the sense that it left no token unanalyzed, we applied probabilistic disambiguation using lexical probabilities, contextual probabilities (bi- and trigrams for tags) and added word specific statistics <wt,t>, <wt,wt>, all estimated from the SUC corpus.
The result of applying all processing steps to the DI93 files was evaluated in the following way. One file DI930320 which was of length 19 608 units and consisted of 62 articles/blocks was picked a random, and the author did manual disambiguation of the prefinal version of that file, which had undergone all lexical lookup, but not the final, automatic disambiguation step. The manual disambiguation was done before the author had looked at the result of the automatic disambiguation, so as not to be influenced by that. In order for the result of manual disambiguation to be exactly comparable to the automatic disambiguation, which forces a choice of one analysis to be made for each token, the manual disambiguation was also done under the same constraint. In those relatively few cases where the lexical lookup had resulted in a set of analyses, of which none was appropriate, manual disambiguation picked the least bad analysis of those offered, instead of picking no analysis at all.
Using the manually disambiguated file as the reference file, we then applied a program that gives an automatic measure of agreement between an automatically disambiguated file and a manually disambiguated reference version. The result is reproduced below:
DI930320.dis - n=19608 (13101) correct : 95.45 % (894 errors, category : 67.89 % feature : 32.1 %) -------------------- Correctness : 95.45 % Categorial errors : 67.89 % Feature errors : 32.1 %
The main result is thus that the fully automatic disambiguation agrees with human disambiguation in 95.45% of all cases, and disagrees in 4.55%. This is an encouraging result. However, a closer examination of the diff file that the author made revealed that the manual disambiguation perfomance of the author was not exactly error free, in the author's own opinion. Measuring correctness is a difficult problem.
We plan to have additional persons doing manual disambiguation of the same file in order to get a coefficient of agreement between human annotators of the same data, but this was not possible to do in the time that was available for the evaluation work.
Since defects in the input also affects the total outcome, further evaluation work will locate the errors in the input (misspelled "utl{ndsa", mistokenized words, etc) and also locate defective cohorts, that lack a correct analysis. On the basis of this, a more fine grained evaluation of the various processing components will be produced.
Astrom, M. (1994), A probabilistic tagger for Swedish using the SUC tagset. To appear in Proceedings of the Conference on Lexicon + Text, Lexicographica Series Maior, Niemeyer, Tuebingen.
Ejerhed, E., Kallgren, G., Wennstedt, O. and Astrom, M. (1992), The linguistic annotation system of the Stockholm-Umea Corpus project. Report 33 from the Department of General Linguistics, University of Umea (DGL-UUM-R-33), Umea.
Ejerhed, E. (1994), Design principles for a Swedish corpus annotation system. To appear in Proceedings of the Conference on Lexicon + Text, Lexicographica Series Maior, Niemeyer, Tuebingen.
TEI AI 1W2 (1991), List of common morphological features for inclusion in TEI starter set of grammatical annotation tags, TEI.
DI930107.dis - 16681 tokens: 13725 words, 1818 delims, 1138 meta (0 diffs). 1138 unanalyzed (1138 trivial), 15543 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930108.dis - 16487 tokens: 13661 words, 1672 delims, 1154 meta (0 diffs). 1154 unanalyzed (1154 trivial), 15333 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930109.dis - 16838 tokens: 13966 words, 1682 delims, 1190 meta (0 diffs). 1190 unanalyzed (1190 trivial), 15648 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930111.dis - 16127 tokens: 13426 words, 1663 delims, 1038 meta (0 diffs). 1038 unanalyzed (1038 trivial), 15089 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930112.dis - 19273 tokens: 15947 words, 1955 delims, 1371 meta (0 diffs). 1371 unanalyzed (1371 trivial), 17902 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930113.dis - 18681 tokens: 15486 words, 1922 delims, 1273 meta (0 diffs). 1273 unanalyzed (1273 trivial), 17408 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930114.dis - 17986 tokens: 14931 words, 1781 delims, 1274 meta (0 diffs). 1274 unanalyzed (1274 trivial), 16712 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930115.dis - 21832 tokens: 17944 words, 2283 delims, 1605 meta (0 diffs). 1605 unanalyzed (1605 trivial), 20227 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930116.dis - 18465 tokens: 15154 words, 1950 delims, 1361 meta (0 diffs). 1361 unanalyzed (1361 trivial), 17104 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930118.dis - 15903 tokens: 13291 words, 1618 delims, 994 meta (0 diffs). 994 unanalyzed (994 trivial), 14909 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930119.dis - 19351 tokens: 16004 words, 1974 delims, 1373 meta (0 diffs). 1373 unanalyzed (1373 trivial), 17978 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930120.dis - 19079 tokens: 15620 words, 2086 delims, 1373 meta (0 diffs). 1373 unanalyzed (1373 trivial), 17706 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930121.dis - 18450 tokens: 15306 words, 1829 delims, 1315 meta (0 diffs). 1315 unanalyzed (1315 trivial), 17135 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930122.dis - 20623 tokens: 16916 words, 2244 delims, 1463 meta (0 diffs). 1463 unanalyzed (1463 trivial), 19160 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930123.dis - 20525 tokens: 16831 words, 2179 delims, 1515 meta (0 diffs). 1515 unanalyzed (1515 trivial), 19010 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930125.dis - 16601 tokens: 13888 words, 1664 delims, 1049 meta (0 diffs). 1049 unanalyzed (1049 trivial), 15552 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930126.dis - 18409 tokens: 15126 words, 1972 delims, 1311 meta (0 diffs). 1311 unanalyzed (1311 trivial), 17098 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930127.dis - 19290 tokens: 15918 words, 1972 delims, 1400 meta (0 diffs). 1400 unanalyzed (1400 trivial), 17890 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930128.dis - 21157 tokens: 17378 words, 2239 delims, 1540 meta (0 diffs). 1540 unanalyzed (1540 trivial), 19617 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930129.dis - 20060 tokens: 16599 words, 2052 delims, 1409 meta (0 diffs). 1409 unanalyzed (1409 trivial), 18651 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930130.dis - 21606 tokens: 17820 words, 2273 delims, 1513 meta (0 diffs). 1513 unanalyzed (1513 trivial), 20093 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930201.dis - 13734 tokens: 11463 words, 1387 delims, 884 meta (0 diffs). 884 unanalyzed (884 trivial), 12850 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930202.dis - 20029 tokens: 16485 words, 2068 delims, 1476 meta (0 diffs). 1476 unanalyzed (1476 trivial), 18553 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930203.dis - 20578 tokens: 16864 words, 2248 delims, 1466 meta (0 diffs). 1466 unanalyzed (1466 trivial), 19112 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930204.dis - 18591 tokens: 15259 words, 1955 delims, 1377 meta (0 diffs). 1377 unanalyzed (1377 trivial), 17214 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930205.dis - 19406 tokens: 15970 words, 2067 delims, 1369 meta (0 diffs). 1369 unanalyzed (1369 trivial), 18037 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930206.dis - 18161 tokens: 14936 words, 1936 delims, 1289 meta (0 diffs). 1289 unanalyzed (1289 trivial), 16872 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930208.dis - 17559 tokens: 14461 words, 1899 delims, 1199 meta (0 diffs). 1199 unanalyzed (1199 trivial), 16360 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930209.dis - 20261 tokens: 16688 words, 2079 delims, 1494 meta (0 diffs). 1494 unanalyzed (1494 trivial), 18767 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930210.dis - 34657 tokens: 28527 words, 3664 delims, 2466 meta (0 diffs). 2466 unanalyzed (2466 trivial), 32191 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930211.dis - 18296 tokens: 15036 words, 1864 delims, 1396 meta (0 diffs). 1396 unanalyzed (1396 trivial), 16900 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930212.dis - 17753 tokens: 14600 words, 1869 delims, 1284 meta (0 diffs). 1284 unanalyzed (1284 trivial), 16469 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930213.dis - 17369 tokens: 14352 words, 1744 delims, 1273 meta (0 diffs). 1273 unanalyzed (1273 trivial), 16096 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930215.dis - 14110 tokens: 11825 words, 1360 delims, 925 meta (0 diffs). 925 unanalyzed (925 trivial), 13185 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930216.dis - 19731 tokens: 16168 words, 2063 delims, 1500 meta (0 diffs). 1500 unanalyzed (1500 trivial), 18231 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930217.dis - 20509 tokens: 16782 words, 2227 delims, 1500 meta (0 diffs). 1500 unanalyzed (1500 trivial), 19009 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930218.dis - 22135 tokens: 18116 words, 2300 delims, 1719 meta (0 diffs). 1719 unanalyzed (1719 trivial), 20416 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930219.dis - 18524 tokens: 15288 words, 1929 delims, 1307 meta (0 diffs). 1307 unanalyzed (1307 trivial), 17217 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930220.dis - 19220 tokens: 15841 words, 2010 delims, 1369 meta (0 diffs). 1369 unanalyzed (1369 trivial), 17851 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930222.dis - 16403 tokens: 13669 words, 1699 delims, 1035 meta (0 diffs). 1035 unanalyzed (1035 trivial), 15368 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930223.dis - 22567 tokens: 18559 words, 2365 delims, 1643 meta (0 diffs). 1643 unanalyzed (1643 trivial), 20924 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930224.dis - 25538 tokens: 20720 words, 2878 delims, 1940 meta (0 diffs). 1940 unanalyzed (1940 trivial), 23598 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930225.dis - 18143 tokens: 14837 words, 1958 delims, 1348 meta (0 diffs). 1348 unanalyzed (1348 trivial), 16795 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930226.dis - 18035 tokens: 14897 words, 1904 delims, 1234 meta (0 diffs). 1234 unanalyzed (1234 trivial), 16801 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930227.dis - 18254 tokens: 14865 words, 1976 delims, 1413 meta (0 diffs). 1413 unanalyzed (1413 trivial), 16841 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930301.dis - 13251 tokens: 11182 words, 1279 delims, 790 meta (0 diffs). 790 unanalyzed (790 trivial), 12461 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930302.dis - 16230 tokens: 13568 words, 1561 delims, 1101 meta (0 diffs). 1101 unanalyzed (1101 trivial), 15129 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930303.dis - 16991 tokens: 13909 words, 1773 delims, 1309 meta (0 diffs). 1309 unanalyzed (1309 trivial), 15682 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930304.dis - 16580 tokens: 13708 words, 1690 delims, 1182 meta (0 diffs). 1182 unanalyzed (1182 trivial), 15398 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930305.dis - 16945 tokens: 14020 words, 1736 delims, 1189 meta (0 diffs). 1189 unanalyzed (1189 trivial), 15756 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930306.dis - 17365 tokens: 14348 words, 1799 delims, 1218 meta (0 diffs). 1218 unanalyzed (1218 trivial), 16147 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930308.dis - 15243 tokens: 12717 words, 1580 delims, 946 meta (0 diffs). 946 unanalyzed (946 trivial), 14297 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930309.dis - 20388 tokens: 16781 words, 2170 delims, 1437 meta (0 diffs). 1437 unanalyzed (1437 trivial), 18951 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930310.dis - 19753 tokens: 16297 words, 2032 delims, 1424 meta (0 diffs). 1424 unanalyzed (1424 trivial), 18329 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930311.dis - 22450 tokens: 18590 words, 2221 delims, 1639 meta (0 diffs). 1639 unanalyzed (1639 trivial), 20811 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930312.dis - 20144 tokens: 16625 words, 2134 delims, 1385 meta (0 diffs). 1385 unanalyzed (1385 trivial), 18759 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930313.dis - 19712 tokens: 16166 words, 2135 delims, 1411 meta (0 diffs). 1411 unanalyzed (1411 trivial), 18301 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930315.dis - 18641 tokens: 15581 words, 1868 delims, 1192 meta (0 diffs). 1192 unanalyzed (1192 trivial), 17449 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930316.dis - 20750 tokens: 17008 words, 2157 delims, 1585 meta (0 diffs). 1585 unanalyzed (1585 trivial), 19165 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930317.dis - 20536 tokens: 16800 words, 2155 delims, 1581 meta (0 diffs). 1581 unanalyzed (1581 trivial), 18955 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930318.dis - 20859 tokens: 17107 words, 2193 delims, 1559 meta (0 diffs). 1559 unanalyzed (1559 trivial), 19300 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930319.dis - 21412 tokens: 17834 words, 2135 delims, 1443 meta (0 diffs). 1443 unanalyzed (1443 trivial), 19969 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930320.dis - 19608 tokens: 15922 words, 2212 delims, 1474 meta (0 diffs). 1474 unanalyzed (1474 trivial), 18134 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930322.dis - 16884 tokens: 14097 words, 1693 delims, 1094 meta (0 diffs). 1094 unanalyzed (1094 trivial), 15790 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930323.dis - 24507 tokens: 20260 words, 2582 delims, 1665 meta (0 diffs). 1665 unanalyzed (1665 trivial), 22842 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930324.dis - 21302 tokens: 17362 words, 2344 delims, 1596 meta (0 diffs). 1596 unanalyzed (1596 trivial), 19706 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930325.dis - 21031 tokens: 17306 words, 2182 delims, 1543 meta (0 diffs). 1543 unanalyzed (1543 trivial), 19488 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930326.dis - 20981 tokens: 17358 words, 2153 delims, 1470 meta (0 diffs). 1470 unanalyzed (1470 trivial), 19511 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930327.dis - 17632 tokens: 14611 words, 1787 delims, 1234 meta (0 diffs). 1234 unanalyzed (1234 trivial), 16398 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930329.dis - 13649 tokens: 11261 words, 1480 delims, 908 meta (0 diffs). 908 unanalyzed (908 trivial), 12741 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930330.dis - 18614 tokens: 15367 words, 1912 delims, 1335 meta (0 diffs). 1335 unanalyzed (1335 trivial), 17279 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930331.dis - 19717 tokens: 16328 words, 2022 delims, 1367 meta (0 diffs). 1367 unanalyzed (1367 trivial), 18350 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930401.dis - 21544 tokens: 17557 words, 2278 delims, 1709 meta (0 diffs). 1709 unanalyzed (1709 trivial), 19835 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930402.dis - 22114 tokens: 18038 words, 2422 delims, 1654 meta (0 diffs). 1654 unanalyzed (1654 trivial), 20460 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930403.dis - 18170 tokens: 15029 words, 1841 delims, 1300 meta (0 diffs). 1300 unanalyzed (1300 trivial), 16870 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930405.dis - 16417 tokens: 13630 words, 1658 delims, 1129 meta (0 diffs). 1129 unanalyzed (1129 trivial), 15288 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930406.dis - 17232 tokens: 14115 words, 1810 delims, 1307 meta (0 diffs). 1307 unanalyzed (1307 trivial), 15925 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930407.dis - 15404 tokens: 12536 words, 1681 delims, 1187 meta (0 diffs). 1187 unanalyzed (1187 trivial), 14217 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930408.dis - 15260 tokens: 12562 words, 1560 delims, 1138 meta (0 diffs). 1138 unanalyzed (1138 trivial), 14122 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930413.dis - 13328 tokens: 11143 words, 1329 delims, 856 meta (0 diffs). 856 unanalyzed (856 trivial), 12472 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930414.dis - 21530 tokens: 18046 words, 2035 delims, 1449 meta (0 diffs). 1449 unanalyzed (1449 trivial), 20081 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930415.dis - 20442 tokens: 16825 words, 2097 delims, 1520 meta (0 diffs). 1520 unanalyzed (1520 trivial), 18922 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930416.dis - 19950 tokens: 16435 words, 2068 delims, 1447 meta (0 diffs). 1447 unanalyzed (1447 trivial), 18503 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930417.dis - 16218 tokens: 13374 words, 1697 delims, 1147 meta (0 diffs). 1147 unanalyzed (1147 trivial), 15071 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930419.dis - 17207 tokens: 14444 words, 1709 delims, 1054 meta (0 diffs). 1054 unanalyzed (1054 trivial), 16153 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930420.dis - 20313 tokens: 16766 words, 2080 delims, 1467 meta (0 diffs). 1467 unanalyzed (1467 trivial), 18846 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930421.dis - 21457 tokens: 17684 words, 2244 delims, 1529 meta (0 diffs). 1529 unanalyzed (1529 trivial), 19928 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930422.dis - 21663 tokens: 17781 words, 2234 delims, 1648 meta (0 diffs). 1648 unanalyzed (1648 trivial), 20015 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930423.dis - 21587 tokens: 17805 words, 2242 delims, 1540 meta (0 diffs). 1540 unanalyzed (1540 trivial), 20047 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930424.dis - 18883 tokens: 15547 words, 1978 delims, 1358 meta (0 diffs). 1358 unanalyzed (1358 trivial), 17525 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930426.dis - 18143 tokens: 14998 words, 1916 delims, 1229 meta (0 diffs). 1229 unanalyzed (1229 trivial), 16914 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930427.dis - 19199 tokens: 15834 words, 1983 delims, 1382 meta (0 diffs). 1382 unanalyzed (1382 trivial), 17817 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930428.dis - 21544 tokens: 17816 words, 2166 delims, 1562 meta (0 diffs). 1562 unanalyzed (1562 trivial), 19982 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930429.dis - 19132 tokens: 15592 words, 2053 delims, 1487 meta (0 diffs). 1487 unanalyzed (1487 trivial), 17645 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930430.dis - 35287 tokens: 28888 words, 3694 delims, 2705 meta (0 diffs). 2705 unanalyzed (2705 trivial), 32582 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930503.dis - 12528 tokens: 10444 words, 1309 delims, 775 meta (0 diffs). 775 unanalyzed (775 trivial), 11753 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930504.dis - 20526 tokens: 17016 words, 2023 delims, 1487 meta (0 diffs). 1487 unanalyzed (1487 trivial), 19039 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930505.dis - 21252 tokens: 17595 words, 2112 delims, 1545 meta (0 diffs). 1545 unanalyzed (1545 trivial), 19707 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930506.dis - 20424 tokens: 16779 words, 2114 delims, 1531 meta (0 diffs). 1531 unanalyzed (1531 trivial), 18893 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930507.dis - 21381 tokens: 17570 words, 2337 delims, 1474 meta (0 diffs). 1474 unanalyzed (1474 trivial), 19907 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930508.dis - 17998 tokens: 14870 words, 1883 delims, 1245 meta (0 diffs). 1245 unanalyzed (1245 trivial), 16753 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930510.dis - 15725 tokens: 13108 words, 1685 delims, 932 meta (0 diffs). 932 unanalyzed (932 trivial), 14793 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930511.dis - 20546 tokens: 16860 words, 2141 delims, 1545 meta (0 diffs). 1545 unanalyzed (1545 trivial), 19001 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930512.dis - 20302 tokens: 16622 words, 2189 delims, 1491 meta (0 diffs). 1491 unanalyzed (1491 trivial), 18811 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930513.dis - 23026 tokens: 18888 words, 2408 delims, 1730 meta (0 diffs). 1730 unanalyzed (1730 trivial), 21296 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930514.dis - 19838 tokens: 16357 words, 2081 delims, 1400 meta (0 diffs). 1400 unanalyzed (1400 trivial), 18438 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930515.dis - 20245 tokens: 16650 words, 2096 delims, 1499 meta (0 diffs). 1499 unanalyzed (1499 trivial), 18746 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930517.dis - 17367 tokens: 14450 words, 1772 delims, 1145 meta (0 diffs). 1145 unanalyzed (1145 trivial), 16222 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930518.dis - 19769 tokens: 16273 words, 2074 delims, 1422 meta (0 diffs). 1422 unanalyzed (1422 trivial), 18347 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930519.dis - 17691 tokens: 14576 words, 1788 delims, 1327 meta (0 diffs). 1327 unanalyzed (1327 trivial), 16364 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930524.dis - 17913 tokens: 15053 words, 1753 delims, 1107 meta (0 diffs). 1107 unanalyzed (1107 trivial), 16806 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930525.dis - 17021 tokens: 14078 words, 1736 delims, 1207 meta (0 diffs). 1207 unanalyzed (1207 trivial), 15814 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930526.dis - 20865 tokens: 17280 words, 2193 delims, 1392 meta (0 diffs). 1392 unanalyzed (1392 trivial), 19473 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930527.dis - 21435 tokens: 17795 words, 2133 delims, 1507 meta (0 diffs). 1507 unanalyzed (1507 trivial), 19928 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930528.dis - 18221 tokens: 14982 words, 1953 delims, 1286 meta (0 diffs). 1286 unanalyzed (1286 trivial), 16935 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930529.dis - 21137 tokens: 17459 words, 2151 delims, 1527 meta (0 diffs). 1527 unanalyzed (1527 trivial), 19610 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930601.dis - 17232 tokens: 14425 words, 1769 delims, 1038 meta (0 diffs). 1038 unanalyzed (1038 trivial), 16194 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930602.dis - 21598 tokens: 17971 words, 2260 delims, 1367 meta (0 diffs). 1367 unanalyzed (1367 trivial), 20231 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930603.dis - 20650 tokens: 17113 words, 2141 delims, 1396 meta (0 diffs). 1396 unanalyzed (1396 trivial), 19254 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930604.dis - 19359 tokens: 16016 words, 2084 delims, 1259 meta (0 diffs). 1259 analyzed (1259 trivial), 18100 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930605.dis - 20793 tokens: 17274 words, 2127 delims, 1392 meta (0 diffs). 1392 unanalyzed (1392 trivial), 19401 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930607.dis - 13902 tokens: 11658 words, 1327 delims, 917 meta (0 diffs). 917 unanalyzed (917 trivial), 12985 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930608.dis - 20958 tokens: 17245 words, 2296 delims, 1417 meta (0 diffs). 1417 unanalyzed (1417 trivial), 19541 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930609.dis - 21789 tokens: 18027 words, 2327 delims, 1435 meta (0 diffs). 1435 unanalyzed (1435 trivial), 20354 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930610.dis - 18270 tokens: 15257 words, 1796 delims, 1217 meta (0 diffs). 1217 unanalyzed (1217 trivial), 17053 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930611.dis - 19685 tokens: 16237 words, 2065 delims, 1383 meta (0 diffs). 1383 unanalyzed (1383 trivial), 18302 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930612.dis - 18820 tokens: 15695 words, 1888 delims, 1237 meta (0 diffs). 1237 unanalyzed (1237 trivial), 17583 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930614.dis - 16198 tokens: 13578 words, 1652 delims, 968 meta (0 diffs). 968 unanalyzed (968 trivial), 15230 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930615.dis - 20583 tokens: 16856 words, 2304 delims, 1423 meta (0 diffs). 1423 unanalyzed (1423 trivial), 19160 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930616.dis - 14581 tokens: 12056 words, 1505 delims, 1020 meta (0 diffs). 1020 unanalyzed (1020 trivial), 13561 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930617.dis - 18530 tokens: 15248 words, 1922 delims, 1360 meta (0 diffs). 1360 unanalyzed (1360 trivial), 17170 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930618.dis - 20931 tokens: 17378 words, 2105 delims, 1448 meta (0 diffs). 1448 unanalyzed (1448 trivial), 19483 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930619.dis - 17076 tokens: 14156 words, 1744 delims, 1176 meta (0 diffs). 1176 unanalyzed (1176 trivial), 15900 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930621.dis - 16345 tokens: 13676 words, 1678 delims, 991 meta (0 diffs). 991 unanalyzed (991 trivial), 15354 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930622.dis - 17156 tokens: 14189 words, 1764 delims, 1203 meta (0 diffs). 1203 unanalyzed (1203 trivial), 15953 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930623.dis - 19770 tokens: 16252 words, 2163 delims, 1355 meta (0 diffs). 1355 unanalyzed (1355 trivial), 18415 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930624.dis - 19072 tokens: 15923 words, 1856 delims, 1293 meta (0 diffs). 1293 unanalyzed (1293 trivial), 17779 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930628.dis - 14993 tokens: 12634 words, 1432 delims, 927 meta (0 diffs). 927 unanalyzed (927 trivial), 14066 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930629.dis - 14111 tokens: 11699 words, 1447 delims, 965 meta (0 diffs). 965 unanalyzed (965 trivial), 13146 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930630.dis - 30369 tokens: 25016 words, 3152 delims, 2201 meta (0 diffs). 2201 unanalyzed (2201 trivial), 28168 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930701.dis - 16269 tokens: 13303 words, 1778 delims, 1188 meta (0 diffs). 1188 unanalyzed (1188 trivial), 15081 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930702.dis - 14574 tokens: 12109 words, 1512 delims, 953 meta (0 diffs). 953 unanalyzed (953 trivial), 13621 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930709.dis - 13401 tokens: 11187 words, 1343 delims, 871 meta (0 diffs). 871 unanalyzed (871 trivial), 12530 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930716.dis - 14772 tokens: 12341 words, 1490 delims, 941 meta (0 diffs). 941 unanalyzed (941 trivial), 13831 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930723.dis - 14836 tokens: 12144 words, 1589 delims, 1103 meta (0 diffs). 1103 unanalyzed (1103 trivial), 13733 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930730.dis - 14169 tokens: 11724 words, 1404 delims, 1041 meta (0 diffs). 1041 unanalyzed (1041 trivial), 13128 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930731.dis - 14655 tokens: 12166 words, 1480 delims, 1009 meta (0 diffs). 1009 unanalyzed (1009 trivial), 13646 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930802.dis - 12658 tokens: 10663 words, 1245 delims, 750 meta (0 diffs). 750 unanalyzed (750 trivial), 11908 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930803.dis - 12996 tokens: 10788 words, 1302 delims, 906 meta (0 diffs). 906 unanalyzed (906 trivial), 12090 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930804.dis - 14564 tokens: 12026 words, 1478 delims, 1060 meta (0 diffs). 1060 unanalyzed (1060 trivial), 13504 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930805.dis - 14873 tokens: 12318 words, 1542 delims, 1013 meta (0 diffs). 1013 unanalyzed (1013 trivial), 13860 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930806.dis - 12513 tokens: 10399 words, 1280 delims, 834 meta (0 diffs). 834 unanalyzed (834 trivial), 11679 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930807.dis - 14186 tokens: 11878 words, 1362 delims, 946 meta (0 diffs). 946 unanalyzed (946 trivial), 13240 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930809.dis - 8672 tokens: 7182 words, 891 delims, 599 meta (0 diffs). 599 unanalyzed (599 trivial), 8073 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930810.dis - 13996 tokens: 11629 words, 1432 delims, 935 meta (0 diffs). 935 unanalyzed (935 trivial), 13061 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930811.dis - 14679 tokens: 12178 words, 1477 delims, 1024 meta (0 diffs). 1024 unanalyzed (1024 trivial), 13655 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930812.dis - 17825 tokens: 14801 words, 1772 delims, 1252 meta (0 diffs). 1252 unanalyzed (1252 trivial), 16573 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930813.dis - 18230 tokens: 15078 words, 1866 delims, 1286 meta (0 diffs). 1286 unanalyzed (1286 trivial), 16944 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930814.dis - 16910 tokens: 14034 words, 1639 delims, 1237 meta (0 diffs). 1237 unanalyzed (1237 trivial), 15673 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930816.dis - 2969 tokens: 2464 words, 349 delims, 156 meta (0 diffs). 156 unanalyzed (156 trivial), 2813 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930817.dis - 17210 tokens: 14218 words, 1725 delims, 1267 meta (0 diffs). 1267 unanalyzed (1267 trivial), 15943 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930818.dis - 48719 tokens: 40254 words, 5063 delims, 3402 meta (0 diffs). 3402 unanalyzed (3402 trivial), 45317 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930819.dis - 18643 tokens: 15378 words, 1907 delims, 1358 meta (0 diffs). 1358 unanalyzed (1358 trivial), 17285 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930820.dis - 18788 tokens: 15436 words, 2024 delims, 1328 meta (0 diffs). 1328 unanalyzed (1328 trivial), 17460 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930821.dis - 1961 tokens: 1654 words, 179 delims, 128 meta (0 diffs). 128 unanalyzed (128 trivial), 1833 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930823.dis - 14444 tokens: 12179 words, 1412 delims, 853 meta (0 diffs). 853 unanalyzed (853 trivial), 13591 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930824.dis - 17624 tokens: 14629 words, 1781 delims, 1214 meta (0 diffs). 1214 unanalyzed (1214 trivial), 16410 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930825.dis - 20644 tokens: 17004 words, 2199 delims, 1441 meta (0 diffs). 1441 unanalyzed (1441 trivial), 19203 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930826.dis - 19745 tokens: 16290 words, 2057 delims, 1398 meta (0 diffs). 1398 unanalyzed (1398 trivial), 18347 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930827.dis - 19331 tokens: 16018 words, 2044 delims, 1269 meta (0 diffs). 1269 unanalyzed (1269 trivial), 18062 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930828.dis - 19110 tokens: 15871 words, 1953 delims, 1286 meta (0 diffs). 1286 unanalyzed (1286 trivial), 17824 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930830.dis - 14650 tokens: 12349 words, 1421 delims, 880 meta (0 diffs). 880 unanalyzed (880 trivial), 13770 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930831.dis - 18097 tokens: 14882 words, 1884 delims, 1331 meta (0 diffs). 1331 unanalyzed (1331 trivial), 16766 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930901.dis - 23180 tokens: 19100 words, 2471 delims, 1609 meta (0 diffs). 1609 unanalyzed (1609 trivial), 21571 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930902.dis - 18061 tokens: 15096 words, 1751 delims, 1214 meta (0 diffs). 1214 unanalyzed (1214 trivial), 16847 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930903.dis - 19239 tokens: 15947 words, 1999 delims, 1293 meta (0 diffs). 1293 unanalyzed (1293 trivial), 17946 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930904.dis - 19534 tokens: 16120 words, 2068 delims, 1346 meta (0 diffs). 1346 unanalyzed (1346 trivial), 18188 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930906.dis - 15872 tokens: 13260 words, 1614 delims, 998 meta (0 diffs). 998 unanalyzed (998 trivial), 14874 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930907.dis - 18540 tokens: 15340 words, 1985 delims, 1215 meta (0 diffs). 1215 unanalyzed (1215 trivial), 17325 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930908.dis - 19166 tokens: 15981 words, 1932 delims, 1253 meta (0 diffs). 1253 unanalyzed (1253 trivial), 17913 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930909.dis - 21671 tokens: 17977 words, 2220 delims, 1474 meta (0 diffs). 1474 unanalyzed (1474 trivial), 20197 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930910.dis - 21242 tokens: 17809 words, 2107 delims, 1326 meta (0 diffs). 1326 unanalyzed (1326 trivial), 19916 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930911.dis - 19600 tokens: 16351 words, 2102 delims, 1147 meta (0 diffs). 1147 unanalyzed (1147 trivial), 18453 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930913.dis - 14355 tokens: 12105 words, 1415 delims, 835 meta (0 diffs). 835 unanalyzed (835 trivial), 13520 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930914.dis - 19961 tokens: 16634 words, 2031 delims, 1296 meta (0 diffs). 1296 unanalyzed (1296 trivial), 18665 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930915.dis - 21378 tokens: 17939 words, 2205 delims, 1234 meta (0 diffs). 1234 unanalyzed (1234 trivial), 20144 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930916.dis - 21050 tokens: 17504 words, 2232 delims, 1314 meta (0 diffs). 1314 unanalyzed (1314 trivial), 19736 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930917.dis - 20649 tokens: 17275 words, 2085 delims, 1289 meta (0 diffs). 1289 unanalyzed (1289 trivial), 19360 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930918.dis - 17536 tokens: 14538 words, 1881 delims, 1117 meta (0 diffs). 1117 unanalyzed (1117 trivial), 16419 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930920.dis - 11749 tokens: 9828 words, 1208 delims, 713 meta (0 diffs). 713 unanalyzed (713 trivial), 11036 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930921.dis - 17900 tokens: 14861 words, 1908 delims, 1131 meta (0 diffs). 1131 unanalyzed (1131 trivial), 16769 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930922.dis - 22589 tokens: 18867 words, 2396 delims, 1326 meta (0 diffs). 1326 unanalyzed (1326 trivial), 21263 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930923.dis - 17141 tokens: 14393 words, 1733 delims, 1015 meta (0 diffs). 1015 unanalyzed (1015 trivial), 16126 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930924.dis - 19793 tokens: 16318 words, 2261 delims, 1214 meta (0 diffs). 1214 unanalyzed (1214 trivial), 18579 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930925.dis - 19288 tokens: 15963 words, 2071 delims, 1254 meta (0 diffs). 1254 unanalyzed (1254 trivial), 18034 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930927.dis - 16095 tokens: 13640 words, 1630 delims, 825 meta (0 diffs). 825 unanalyzed (825 trivial), 15270 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930928.dis - 19265 tokens: 16063 words, 2031 delims, 1171 meta (0 diffs). 1171 unanalyzed (1171 trivial), 18094 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930929.dis - 20541 tokens: 17131 words, 2184 delims, 1226 meta (0 diffs). 1226 unanalyzed (1226 trivial), 19315 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI930930.dis - 38941 tokens: 32162 words, 4080 delims, 2699 meta (0 diffs). 2699 unanalyzed (2699 trivial), 36242 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931001.dis - 22287 tokens: 18636 words, 2296 delims, 1355 meta (0 diffs). 1355 unanalyzed (1355 trivial), 20932 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931002.dis - 19177 tokens: 16164 words, 1846 delims, 1167 meta (0 diffs). 1167 unanalyzed (1167 trivial), 18010 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931004.dis - 15755 tokens: 13266 words, 1559 delims, 930 meta (0 diffs). 930 unanalyzed (930 trivial), 14825 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931005.dis - 18588 tokens: 15443 words, 2019 delims, 1126 meta (0 diffs). 1126 unanalyzed (1126 trivial), 17462 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931006.dis - 22065 tokens: 18534 words, 2198 delims, 1333 meta (0 diffs). 1333 unanalyzed (1333 trivial), 20732 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931007.dis - 20591 tokens: 17254 words, 2060 delims, 1277 meta (0 diffs). 1277 unanalyzed (1277 trivial), 19314 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931008.dis - 20418 tokens: 16970 words, 2179 delims, 1269 meta (0 diffs). 1269 unanalyzed (1269 trivial), 19149 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931009.dis - 19609 tokens: 16367 words, 2056 delims, 1186 meta (0 diffs). 1186 unanalyzed (1186 trivial), 18423 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931011.dis - 15500 tokens: 12849 words, 1682 delims, 969 meta (0 diffs). 969 unanalyzed (969 trivial), 14531 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931012.dis - 20400 tokens: 16900 words, 2210 delims, 1290 meta (0 diffs). 1290 unanalyzed (1290 trivial), 19110 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931013.dis - 20032 tokens: 16723 words, 2082 delims, 1227 meta (0 diffs). 1227 unanalyzed (1227 trivial), 18805 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931014.dis - 20407 tokens: 17002 words, 2091 delims, 1314 meta (0 diffs). 1314 unanalyzed (1314 trivial), 19093 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931015.dis - 28485 tokens: 23462 words, 3347 delims, 1676 meta (0 diffs). 1676 unanalyzed (1676 trivial), 26809 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931016.dis - 17990 tokens: 15170 words, 1822 delims, 998 meta (0 diffs). 998 unanalyzed (998 trivial), 16992 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931018.dis - 15161 tokens: 12842 words, 1477 delims, 842 meta (0 diffs). 842 unanalyzed (842 trivial), 14319 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931019.dis - 20194 tokens: 16886 words, 2096 delims, 1212 meta (0 diffs). 1212 unanalyzed (1212 trivial), 18982 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931020.dis - 20047 tokens: 16717 words, 2182 delims, 1148 meta (0 diffs). 1148 unanalyzed (1148 trivial), 18899 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931021.dis - 22524 tokens: 18974 words, 2249 delims, 1301 meta (0 diffs). 1301 unanalyzed (1301 trivial), 21223 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931022.dis - 20502 tokens: 17176 words, 2133 delims, 1193 meta (0 diffs). 1193 unanalyzed (1193 trivial), 19309 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931023.dis - 16925 tokens: 14177 words, 1775 delims, 973 meta (0 diffs). 973 unanalyzed (973 trivial), 15952 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931025.dis - 15987 tokens: 13433 words, 1707 delims, 847 meta (0 diffs). 847 unanalyzed (847 trivial), 15140 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931026.dis - 20463 tokens: 17136 words, 2156 delims, 1171 meta (0 diffs). 1171 unanalyzed (1171 trivial), 19292 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931027.dis - 21096 tokens: 17615 words, 2233 delims, 1248 meta (0 diffs). 1248 unanalyzed (1248 trivial), 19848 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931028.dis - 23682 tokens: 19896 words, 2385 delims, 1401 meta (0 diffs). 1401 unanalyzed (1401 trivial), 22281 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931029.dis - 20695 tokens: 17336 words, 2147 delims, 1212 meta (0 diffs). 1212 unanalyzed (1212 trivial), 19483 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931030.dis - 19642 tokens: 16521 words, 2014 delims, 1107 meta (0 diffs). 1107 unanalyzed (1107 trivial), 18535 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931101.dis - 16345 tokens: 13752 words, 1704 delims, 889 meta (0 diffs). 889 unanalyzed (889 trivial), 15456 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931102.dis - 23814 tokens: 19924 words, 2473 delims, 1417 meta (0 diffs). 1417 unanalyzed (1417 trivial), 22397 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931103.dis - 21770 tokens: 18103 words, 2312 delims, 1355 meta (0 diffs). 1355 unanalyzed (1355 trivial), 20415 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931104.dis - 20287 tokens: 16932 words, 2143 delims, 1212 meta (0 diffs). 1212 unanalyzed (1212 trivial), 19075 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931105.dis - 22045 tokens: 18535 words, 2291 delims, 1219 meta (0 diffs). 1219 unanalyzed (1219 trivial), 20826 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931108.dis - 15421 tokens: 12920 words, 1620 delims, 881 meta (0 diffs). 881 unanalyzed (881 trivial), 14540 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931109.dis - 20175 tokens: 16810 words, 2115 delims, 1250 meta (0 diffs). 1250 unanalyzed (1250 trivial), 18925 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931110.dis - 23037 tokens: 19417 words, 2259 delims, 1361 meta (0 diffs). 1361 unanalyzed (1361 trivial), 21676 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931111.dis - 21338 tokens: 17881 words, 2196 delims, 1261 meta (0 diffs). 1261 unanalyzed (1261 trivial), 20077 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931112.dis - 19720 tokens: 16509 words, 2108 delims, 1103 meta (0 diffs). 1103 unanalyzed (1103 trivial), 18617 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931113.dis - 20979 tokens: 17544 words, 2222 delims, 1213 meta (0 diffs). 1213 unanalyzed (1213 trivial), 19766 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931115.dis - 15947 tokens: 13404 words, 1695 delims, 848 meta (0 diffs). 848 unanalyzed (848 trivial), 15099 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931116.dis - 21014 tokens: 17470 words, 2267 delims, 1277 meta (0 diffs). 1277 unanalyzed (1277 trivial), 19737 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931117.dis - 19134 tokens: 16174 words, 1916 delims, 1044 meta (0 diffs). 1044 unanalyzed (1044 trivial), 18090 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931118.dis - 24259 tokens: 20350 words, 2519 delims, 1390 meta (0 diffs). 1390 unanalyzed (1390 trivial), 22869 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931119.dis - 25388 tokens: 21392 words, 2666 delims, 1330 meta (0 diffs). 1330 unanalyzed (1330 trivial), 24058 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931120.dis - 21414 tokens: 18005 words, 2177 delims, 1232 meta (0 diffs). 1232 unanalyzed (1232 trivial), 20182 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931122.dis - 17257 tokens: 14577 words, 1780 delims, 900 meta (0 diffs). 900 unanalyzed (900 trivial), 16357 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931123.dis - 22692 tokens: 18912 words, 2431 delims, 1349 meta (0 diffs). 1349 unanalyzed (1349 trivial), 21343 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931124.dis - 21859 tokens: 18129 words, 2455 delims, 1275 meta (0 diffs). 1275 unanalyzed (1275 trivial), 20584 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931125.dis - 20213 tokens: 17066 words, 2049 delims, 1098 meta (0 diffs). 1098 unanalyzed (1098 trivial), 19115 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931126.dis - 20254 tokens: 16961 words, 2102 delims, 1191 meta (0 diffs). 1191 unanalyzed (1191 trivial), 19063 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931127.dis - 18321 tokens: 15390 words, 1839 delims, 1092 meta (0 diffs). 1092 unanalyzed (1092 trivial), 17229 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931129.dis - 17924 tokens: 15294 words, 1719 delims, 911 meta (0 diffs). 911 unanalyzed (911 trivial), 17013 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931130.dis - 32001 tokens: 26826 words, 3260 delims, 1915 meta (0 diffs). 1915 unanalyzed (1915 trivial), 30086 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931201.dis - 23537 tokens: 19726 words, 2431 delims, 1380 meta (0 diffs). 1380 unanalyzed (1380 trivial), 22157 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931202.dis - 18194 tokens: 15133 words, 1850 delims, 1211 meta (0 diffs). 1211 unanalyzed (1211 trivial), 16983 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931203.dis - 18248 tokens: 15204 words, 1973 delims, 1071 meta (0 diffs). 1071 unanalyzed (1071 trivial), 17177 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931204.dis - 19114 tokens: 16030 words, 2009 delims, 1075 meta (0 diffs). 1075 unanalyzed (1075 trivial), 18039 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931206.dis - 16541 tokens: 13887 words, 1689 delims, 965 meta (0 diffs). 965 unanalyzed (965 trivial), 15576 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931207.dis - 20236 tokens: 16837 words, 2186 delims, 1213 meta (0 diffs). 1213 unanalyzed (1213 trivial), 19023 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931208.dis - 22485 tokens: 18881 words, 2316 delims, 1288 meta (0 diffs). 1288 unanalyzed (1288 trivial), 21197 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931209.dis - 19255 tokens: 16212 words, 1926 delims, 1117 meta (0 diffs). 1117 unanalyzed (1117 trivial), 18138 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931210.dis - 17670 tokens: 14843 words, 1820 delims, 1007 meta (0 diffs). 1007 unanalyzed (1007 trivial), 16663 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931211.dis - 17122 tokens: 14312 words, 1737 delims, 1073 meta (0 diffs). 1073 unanalyzed (1073 trivial), 16049 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931213.dis - 15360 tokens: 12952 words, 1563 delims, 845 meta (0 diffs). 845 unanalyzed (845 trivial), 14515 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931214.dis - 18666 tokens: 15627 words, 1991 delims, 1048 meta (0 diffs). 1048 unanalyzed (1048 trivial), 17618 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931215.dis - 19345 tokens: 16193 words, 1982 delims, 1170 meta (0 diffs). 1170 unanalyzed (1170 trivial), 18175 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931216.dis - 37804 tokens: 31808 words, 3774 delims, 2222 meta (0 diffs). 2222 unanalyzed (2222 trivial), 35582 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931217.dis - 17082 tokens: 14409 words, 1685 delims, 988 meta (0 diffs). 988 unanalyzed (988 trivial), 16094 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931218.dis - 15685 tokens: 13254 words, 1517 delims, 914 meta (0 diffs). 914 unanalyzed (914 trivial), 14771 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931220.dis - 12944 tokens: 11003 words, 1252 delims, 689 meta (0 diffs). 689 unanalyzed (689 trivial), 12255 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931221.dis - 16754 tokens: 14166 words, 1633 delims, 955 meta (0 diffs). 955 unanalyzed (955 trivial), 15799 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931222.dis - 14774 tokens: 12290 words, 1601 delims, 883 meta (0 diffs). 883 unanalyzed (883 trivial), 13891 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931223.dis - 13504 tokens: 11264 words, 1442 delims, 798 meta (0 diffs). 798 unanalyzed (798 trivial), 12706 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931227.dis - 17254 tokens: 14559 words, 1705 delims, 990 meta (0 diffs). 990 unanalyzed (990 trivial), 16264 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931228.dis - 11779 tokens: 9955 words, 1196 delims, 628 meta (0 diffs). 628 unanalyzed (628 trivial), 11151 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931229.dis - 12773 tokens: 10760 words, 1300 delims, 713 meta (0 diffs). 713 unanalyzed (713 trivial), 12060 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last DI931230.dis - 16107 tokens: 13501 words, 1701 delims, 905 meta (0 diffs). 905 unanalyzed (905 trivial), 15202 unmarked (0 trivial), 0 poly-marked, consecutively numbered, lemmas last TOTAL: 5198279 tokens: 4315295 words 538850 delims 344134 meta (0 diffs).TOTAL: 344134 unanalyzed (344134 trivial), 4854145 unmarked (0 trivial), 0 poly-marked, consecutively numbered
225487 AB "<H{r>"<18> DI930107.dis
8627 AB AN "<bl_a>" <244> DI930107.dis
15244 AB KOM "<mer>" <9> DI930107.dis
57773 AB POS "<kostnadsm{ssigt>"<105> DI930107.dis
85 AB SMS "<in->" <8051> DI930114.dis
8479 AB SUV "<mest>" <21> DI930107.dis
1 AN "<MD>" <15761> DI930617.dis
281294 DL MAD "<.>" <31> DI930107.dis
170066 DL MID "<,>" <27> DI930107.dis
87490 DL PAD "<$">" <11> DI930107.dis
33 DT MAS SIN DEF "<denne>" <8158> DI930113.dis
20 DT MAS SIN IND "<samme>" <7212> DI930116.dis
17776 DT NEU SIN DEF "<det>" <126> DI930107.dis
32848 DT NEU SIN IND "<ett>" <151> DI930107.dis
69 DT NEU SIN IND/DEF "<allt>" <1864> DI930107.dis
43903 DT UTR SIN DEF "<den>" <132> DI930107.dis
70574 DT UTR SIN IND "<en>" <39> DI930107.dis
54 DT UTR SIN IND/DEF "<all>" <165> DI930120.dis
31098 DT UTR/NEU PLU DEF "<de>" <75> DI930107.dis
4018 DT UTR/NEU PLU IND "<inga>" <682> DI930107.dis
3771 DT UTR/NEU PLU IND/DEF "<alla>" <2589> DI930107.dis
13 DT UTR/NEU SIN DEF "<vardera>" <10569> DI930109.dis
1630 DT UTR/NEU SIN IND "<varje>" <1342> DI930107.dis
3357 DT UTR/NEU SIN/PLU IND "<samma>" <1226> DI930107.dis
20334 HA "<N{r>" <1363> DI930107.dis
146 HD NEU SIN IND "<vilket>" <1809> DI930108.dis
359 HD UTR SIN IND "<vilken>" <7810> DI930107.dis
430 HD UTR/NEU PLU IND "<vilka>" <16774> DI930109.dis
43131 HP - - - "<som>" <95> DI930107.dis
6965 HP NEU SIN IND "<vilket>" <370> DI930107.dis
791 HP UTR SIN IND "<vem>" <10274> DI930107.dis
349 HP UTR/NEU PLU IND "<vilka>" <1952> DI930109.dis
580 HS DEF "<vars>" <3693> DI930107.dis
53830 IE "<Att>" <119> DI930107.dis
871 IN "<Nej>" <664> DI930107.dis
18 IN SMS "<ja->" <829> DI930122.dis
520 JJ AN "<t_f>" <63> DI930107.dis
4 JJ KOM UTR/NEU SIN/PLU IND/DEF GEN "<{ldres>" <17185> DI930428.dis
21099 JJ KOM UTR/NEU SIN/PLU IND/DEF NOM "<yngre>" <198> DI930107.dis
2 JJ MAS SIN IND NOM "<Sjelledske>" <9782> DI930817.dis
1 JJ NEU SIN IND NOM "<Lettiskt>" <9340> DI931218.dis
2 JJ POS MAS - - SMS "<olje->" <3575> DI930222.dis
57 JJ POS MAS SIN DEF GEN "<Banksveriges>" <13775> DI930111.dis
2720 JJ POS MAS SIN DEF NOM "<f|rre>" <5463> DI930107.dis
1 JJ POS NEU - - SMS "<naturligt->" <20339> DI930311.dis
26148 JJ POS NEU SIN IND NOM "<ot{nkbart>" <166> DI930107.dis
1082 JJ POS NEU SIN IND/DEF NOM "<eget>" <8173> DI930107.dis
79 JJ POS UTR - - SMS "<}ng->" <11483> DI930123.dis
12 JJ POS UTR SIN IND GEN "<gottskrivs>" <15445> DI930306.dis
62278 JJ POS UTR SIN IND NOM "<gammal>" <93> DI930107.dis
1252 JJ POS UTR SIN IND/DEF NOM "<egen>" <1256> DI930107.dis
80 JJ POS UTR/NEU - - SMS "<sm}->" <15658> DI930113.dis
5126 JJ POS UTR/NEU PLU IND NOM "<Flera>" <3435> DI930107.dis
44 JJ POS UTR/NEU PLU IND/DEF GEN "<konservativas>" <13019> DI930115.dis
77057 JJ POS UTR/NEU PLU IND/DEF NOM "<svenska>" <420> DI930107.dis
10 JJ POS UTR/NEU SIN DEF GEN "<allm{nnas>" <4135> DI930327.dis
55137 JJ POS UTR/NEU SIN DEF NOM "<luxu|sa>" <22> DI930107.dis
1 JJ POS UTR/NEU SIN/PLU IND NOM "<r{tt>" <5744> DI931016.dis
276 JJ POS UTR/NEU SIN/PLU IND/DEF NOM "<genomtr{ngande>"<1187> DI930107.dis
330 JJ SUV MAS SIN DEF NOM "<b{ste>" <12107> DI930107.dis
2 JJ SUV UTR/NEU - - SMS "<krist->" <10471> DI930129.dis
1202 JJ SUV UTR/NEU PLU DEF NOM "<flesta>" <5675> DI930107.dis
75 JJ SUV UTR/NEU PLU IND NOM "<Flest>" <12069> DI930107.dis
13449 JJ SUV UTR/NEU SIN/PLU DEF NOM "<finaste>" <127> DI930107.dis
2117 JJ SUV UTR/NEU SIN/PLU IND NOM "<S{mst>" <2405> DI930107.dis
1 JJ UTR SIN IND GEN "<Norsks>" <5914> DI930113.dis
12 JJ UTR SIN IND NOM "<PESSIMISTISK>"<6933> DI930211.dis
168864 KN "<som>" <62> DI930107.dis
6243 KN AN "<&>" <1978> DI930107.dis
1 LAS" PM NOM "<LAS>" <19733> DI930325.dis
3353 NN - - - - "<salu>" <659> DI930107.dis
205 NN - - - SMS "<bygg->" <2617> DI930107.dis
20703 NN AN "<Jr>" <4> DI930107.dis
832 NN NEU - - SMS "<olje->" <8829> DI930107.dis
1866 NN NEU PLU DEF GEN "<SKOGSBOLAGENS>"<4444> DI930107.dis
19891 NN NEU PLU DEF NOM "<prisl{gena>" <1114> DI930107.dis
731 NN NEU PLU IND GEN "<}rs>" <279> DI930107.dis
49743 NN NEU PLU IND NOM "<}r>" <209> DI930107.dis
7531 NN NEU SIN DEF GEN "<hotellets>" <496> DI930107.dis
81727 NN NEU SIN DEF NOM "<hotellet>" <82> DI930107.dis
1372 NN NEU SIN IND GEN "<}rs>" <2665> DI930107.dis
119508 NN NEU SIN IND NOM "<ansikte>" <25> DI930107.dis
5 NN SMS "<S->" <14361> DI930127.dis
25 NN UTR - - - "<Dags>" <9644> DI930114.dis
2863 NN UTR - - SMS "<sommar->" <240> DI930107.dis
4387 NN UTR PLU DEF GEN "<Studiernas>" <299> DI930107.dis
2 NN UTR PLU DEF GRAF "<biografierna>" <558> DI931013.dis
117 NN UTR PLU DEF METR "<Volymerna>" <6939> DI930115.dis
54803 NN UTR PLU DEF NOM "<organisationsf|r{ndringarna>"<78> DI930107.dis
2054 NN UTR PLU IND GEN "<procents>" <4319> DI930107.dis
2 NN UTR PLU IND GRAF "<serigrafier>" <678> DI931206.dis
25 NN UTR PLU IND METR "<GL[DJEKALKYLER>"<14952> DI930112.dis
200787 NN UTR PLU IND NOM "<studier>" <280> DI930107.dis
18843 NN UTR SIN DEF GEN "<Dagens>" <113> DI930107.dis
7 NN UTR SIN DEF METR "<Parfymen>" <1899> DI930217.dis
209245 NN UTR SIN DEF NOM "<g}ngen>" <38> DI930107.dis
6 NN UTR SIN DEG NOM "<skruvmejseln>" <17934> DI930204.dis
3270 NN UTR SIN IND GEN "<hotells>" <23> DI930107.dis
71 NN UTR SIN IND METR "<logotyp>" <933> DI930111.dis
306036 NN UTR SIN IND NOM "<hotellkung>" <6> DI930107.dis
1 NN UTR SIN INSD NOM "<pilau>" <1404> DI931230.dis
17 PC PRF - - - SMS "<lombard->" <285> DI930208.dis
13 PC PRF MAS SIN DEF GEN "<anst{lldes>" <19170> DI930129.dis
403 PC PRF MAS SIN DEF NOM "<fuskbourgogne>"<1502> DI930107.dis
5317 PC PRF NEU SIN IND NOM "<l{ttsm{lt>" <1853> DI930107.dis
1 PC PRF UTR SIN IND GEN "<|verordnads>" <16534> DI930113.dis
12986 PC PRF UTR SIN IND NOM "<tidsanpassad>" <85> DI930107.dis
179 PC PRF UTR/NEU PLU IND/DEF GEN "<omg{rdas>" <9960> DI930111.dis
12921 PC PRF UTR/NEU PLU IND/DEF NOM "<p}b|rjade>" <77> DI930107.dis
33 PC PRF UTR/NEU SIN DEF GEN "<besk}das>" <6696> DI930109.dis
5388 PC PRF UTR/NEU SIN DEF NOM "<familje{gda>" <44> DI930107.dis
13992 PC PRS UTR/NEU SIN/PLU IND/DEF NOM "<ledande>" <475> DI930107.dis
30458 PL "<in>" <61> DI930107.dis
47 PL SMS "<in->" <14831> DI930121.dis
26653 PM GEN "<Sveriges>" <5> DI930107.dis
317257 PM NOM "<Wallenberg>" <3> DI930107.dis
27 PM SMS "<B->" <2836> DI930121.dis
75 PN MAS SIN DEF SUB/OBJ "<denne>" <15259> DI930109.dis
57697 PN NEU SIN DEF SUB/OBJ "<Det>" <34> DI930107.dis
5726 PN NEU SIN IND SUB/OBJ "<ett>" <472> DI930107.dis
1961 PN UTR PLU DEF OBJ "<oss>" <3196> DI930107.dis
13969 PN UTR PLU DEF SUB "<Vi>" <446> DI930107.dis
1550 PN UTR SIN DEF OBJ "<honom>" <12296> DI930107.dis
19252 PN UTR SIN DEF SUB "<Han>" <7> DI930107.dis
6756 PN UTR SIN DEF SUB/OBJ "<den>" <1731> DI930107.dis
9099 PN UTR SIN IND SUB "<man>" <128> DI930107.dis
2244 PN UTR SIN IND SUB/OBJ "<en>" <494> DI930107.dis
2517 PN UTR/NEU PLU DEF OBJ "<dem>" <816> DI930107.dis
9075 PN UTR/NEU PLU DEF SUB "<de>" <3346> DI930107.dis
1411 PN UTR/NEU PLU DEF SUB/OBJ "<Dessa>" <1803> DI930107.dis
4347 PN UTR/NEU PLU IND SUB/OBJ "<alla>" <737> DI930107.dis
13146 PN UTR/NEU SIN/PLU DEF OBJ "<sig>" <214> DI930107.dis
587176 PP "<p}>" <10> DI930107.dis
8 PP AN "<ink>" <10450> DI930213.dis
1 PP SMS "<pro->" <5494> DI930811.dis
4919 PS NEU SIN DEF "<sitt>" <1392> DI930107.dis
10050 PS UTR SIN DEF "<sin>" <220> DI930107.dis
6837 PS UTR/NEU PLU DEF "<sina>" <206> DI930107.dis
4272 PS UTR/NEU SIN/PLU DEF "<Deras>" <2805> DI930107.dis
21 RG GEN "<1s>" <16161> DI930123.dis
59 RG MAS SIN DEF NOM "<ene>" <15775> DI930123.dis
129 RG NEU SIN IND NOM "<Ett>" <3696> DI930108.dis
155041 RG NOM "<33>" <208> DI930107.dis
69 RG SMS "<1->" <8637> DI930113.dis
62 RG UTR SIN IND NOM "<en>" <11985> DI930107.dis
1066 RG UTR/NEU PLU IND/DEF NOM "<b}da>" <3501> DI930107.dis
218 RG UTR/NEU SIN DEF NOM "<ena>" <14949> DI930109.dis
1112 RG UTR/NEU SIN IND/DEF NOM "<enda>" <1240> DI930107.dis
1 RO MAS SIN IND/DEF GEN "<enes>" <17875> DI930924.dis
101 RO MAS SIN IND/DEF NOM "<f|rste>" <2564> DI930109.dis
2507 RO NOM "<sj{tte>" <2399> DI930107.dis
5 RO SMS "<Fj{rde->" <1420> DI930528.dis
6455 RO UTR/NEU SIN/PLU IND/DEF NOM "<f|rsta>" <37> DI930107.dis
1 RO UTR/NEU SIN/PLU IND/DEF SMS "<andra->" <1528> DI930528.dis
59353 SN "<att>" <72> DI930107.dis
10279 UO "<bed>" <12> DI930107.dis
6 VB AN "<Obs>" <14452> DI930303.dis
1806 VB IMP AKT "<Betala>" <8098> DI930107.dis
122809 VB INF AKT "<genomf|ra>" <74> DI930107.dis
14963 VB INF SFO "<anpassas>" <99> DI930107.dis
75 VB KON PRS AKT "<vare>" <11581> DI930107.dis
447 VB KON PRT AKT "<vore>" <3698> DI930108.dis
3 VB KON PRT SFO "<funnes>" <9531> DI930323.dis
321914 VB PRS AKT "<tror>" <8> DI930107.dis
36518 VB PRS SFO "<hoppas>" <525> DI930107.dis
100688 VB PRT AKT "<startade>" <219> DI930107.dis
14799 VB PRT SFO "<invigdes>" <829> DI930107.dis
87 VB SMS "<balett->" <1010> DI930108.dis
51520 VB SUP AKT "<g}tt>" <60> DI930107.dis
9442 VB SUP SFO " <talats>" <491> DI930107.dis
This file was used in the evaluation described in the report. The file highlights the differences between fully automatic POS disambiguation, which is presented first, and manual POS disambiguation which is presented second. Differences with respect to syntactic categories are highlighted with ***, and differences with rspect to inflectional features are highlighted with +++. APPENDICES A. B. DI93 Tagset C. Sample from the file DI930320.dis.diff
("<Folk>" <270> (NN NEU PLU IND NOM "folk"))
("<i>" <271> (PP "i"))
("<farten>" <272> (NN UTR SIN DEF NOM "fart"))
("<:>" <273> (DL MID ":"))
("<En>" <274> (DT UTR SIN IND "en"))
("<sanndjurg}rdare>" <275> (NN UTR SIN IND NOM "sanndjurg}rdare"))
("<Handelsbankens>" <278> (NN UTR SIN DEF GEN "handelsbank"))
("<penningmarknadsavdelning>" <279> (NN UTR SIN IND NOM "penningmarknadsavdelning"))
("<{r>" <280> (VB PRS AKT "vara"))
("<ofta>" <281> (AB POS "ofta"))
("<i>" <282> (PP "i"))
("<rampljuset>" <283> (NN NEU SIN DEF NOM "rampljus"))
("<n{r>" <284> (HA "n{r"))
("<det>" <285> (PN NEU SIN DEF SUB/OBJ "det"))
("<hettar>" <286> (VB PRS AKT "hetta"))
("<till>" <287> (PL "till"))
("<inom>" <288> (PP "inom"))
("<politiken>" <289> (NN UTR SIN DEF NOM "politik"))
("<.>" <290> (DL MAD "."))
("<D}>" <291> (AB "d}"))
("<vill>" <292> (VB PRS AKT "vilja"))
("<alla>" <293> (PN UTR/NEU PLU IND SUB/OBJ "alla"))
+++
("<veta>" <294> (VB PRS AKT "veta"))
("<veta>" <294> (VB INF AKT "veta"))
("<hur>" <295> (HA "hur"))
("<marknaden>" <296> (NN UTR SIN DEF NOM "marknad"))
("<reagerar>" <297> (VB PRS AKT "reagera"))
("<och>" <298> (KN "och"))
("<massmedia>" <299> (NN NEU PLU IND NOM "massmedia"))
("<flockas>" <300> (VB PRS SFO "flockas"))
("<bl_a>" <301> (AB AN "bland_annat"))
("<p}>" <302> (PP "p}"))
("<Handelsbanken>" <303> (PM NOM "Handelsbanken"))
("<.>" <304> (DL MAD "."))
("<AIK-affischer>" <307> (NN UTR PLU IND NOM "AIK-affisch"))
("<och>" <308> (KN "och"))
("<Leksands-vimplar>" <309> (NN UTR PLU IND NOM "Leksands-vimpla"))
("<fick>" <310> (VB PRT AKT "f}"))
("<h{rom>" <311> (AB "h{rom"))
+++
("<sistens>" <312> (NN NEU SIN IND NOM "sistens"))
("<sistens>" <312> (NN UTR SIN DEF GEN "sist"))
("<en>" <313> (DT UTR SIN IND "en"))
("<bes|kande>" <314> (PC PRS UTR/NEU SIN/PLU IND/DEF NOM "bes|kande"))
("<reporter>" <315> (NN UTR SIN IND NOM "reporter"))
("<att>" <316> (IE "att"))
("<utbrista>" <317> (VB INF AKT "utbrista"))
+++
("<:>" <318> (DL MAD ":"))
("<:>" <6> (DL MID ":"))
("<$">" <321> (DL PAD "$""))
("<Finns>" <322> (VB PRS SFO "finna"))
("<det>" <323> (PN NEU SIN DEF SUB/OBJ "det"))
("<inga>" <324> (DT UTR/NEU PLU IND "ingen"))
("<djurg}rdare>" <325> (NN UTR PLU IND NOM "djurg}rdare"))
("<h{r>" <326> (AB "h{r"))
("<?>" <327> (DL MAD "?"))
("<$">" <328> (DL PAD "$""))
("<.>" <329> (DL MAD "."))
("<$">" <332> (DL PAD "$""))
("<Jo>" <333> (IN "jo"))
("<,>" <334> (DL MID ","))
("<jag>" <335> (PN UTR SIN DEF SUB "jag"))
("<$">" <336> (DL PAD "$""))
("<,>" <337> (DL MID ","))
("<h|rdes>" <338> (VB PRT SFO "h|ra"))
("<en>" <339> (DT UTR SIN IND "en"))
("<r|st>" <340> (NN UTR SIN IND NOM "r|st"))
("<,>" <341> (DL MID ","))
("<som>" <342> (HP - - - "som"))
("<visade>" <343> (VB PRT AKT "visa"))
("<sig>" <344> (PN UTR/NEU SIN/PLU DEF OBJ "sig"))
("<komma>" <345> (VB INF AKT "komma"))
("<fr}n>" <346> (PP "fr}n"))
("<h|gste>" <347> (JJ SUV MAS SIN DEF NOM "h|g"))
("<chefen>" <348> (NN UTR SIN DEF NOM "chef"))
("<sj{lv>" <349> (JJ POS UTR SIN IND NOM "sj{lv"))
("<,>" <350> (DL MID ","))
("<Jan>" <351> (PM NOM "Jan"))
("<Carlsson>" <352> (PM NOM "Carlsson"))
("<.>" <353> (DL MAD "."))
("<Han>" <354> (PN UTR SIN DEF SUB "han"))
("<har>" <355> (VB PRS AKT "ha"))
("<}>" <356> (PP "}"))
***
("<andra>" <357> (JJ POS UTR/NEU SIN DEF NOM "annan"))
("<andra>" <357> (RO UTR/NEU SIN/PLU IND/DEF NOM "andra"))
("<sidan>" <358> (NN UTR SIN DEF NOM "sida"))
("<sitt>" <359> (PS NEU SIN DEF "sin"))
("<chefsrum>" <360> (NN NEU SIN IND NOM "chefsrum"))
("<en>" <361> (DT UTR SIN IND "en"))
("<bit>" <362> (NN UTR SIN IND NOM "bit"))
("<bort>" <363> (AB "bort"))
("<,>" <364> (DL MID ","))
***
("<s}>" <365> (AB "s}"))
("<s}>" <365> (KN "s}"))
("<eventuella>" <366> (JJ POS UTR/NEU PLU IND/DEF NOM "eventuell"))
("<Djurg}rdsflaggor>" <367> (NN UTR PLU IND NOM "Djurg}rdsflagga"))
("<torde>" <368> (VB PRT AKT "torde"))
("<h{nga>" <369> (VB INF AKT "h{nga"))
("<d{r>" <370> (AB "d{r"))
("<.>" <371> (DL MAD "."))
_________________________________
+++
("<Ledare>" <19087> (NN UTR PLU IND NOM "ledare"))
("<Ledare>" <19087> (NN UTR SIN IND NOM "ledare"))
("<:>" <19088> (DL MID ":"))
***
("<Ju>" <19089> (AB "ju"))
("<Ju>" <19089> (KN "ju"))
***
("<mer>" <19090> (JJ KOM UTR/NEU SIN/PLU IND/DEF NOM "mycken"))
("<mer>" <19090> (AB KOM "mycket"))
("<offentlighet>" <19091> (NN UTR SIN IND NOM "offentlighet"))
("<som>" <19092> (HP - - - "som"))
("<kr{vs>" <19093> (VB PRS SFO "kr{va"))
("<desto>" <19094> (AB "desto"))
("<rimligare>" <19095> (JJ KOM UTR/NEU SIN/PLU IND/DEF NOM "rimlig"))
("<blir>" <19096> (VB PRS AKT "bli"))
("<fallsk{rmsavtalen>" <19097> (NN NEU PLU DEF NOM "fallsk{rmsavtal"))
("<.>" <19098> (DL MAD "."))
("<P}>" <19101> (PP "p}"))
("<Volvo>" <19102> (PM NOM "Volvo"))
("<inledde>" <19103> (VB PRT AKT "inleda"))
("<16>" <19104> (RG NOM "16"))
("<arbetare>" <19105> (NN UTR PLU IND NOM "arbetare"))
("<p}>" <19106> (PP "p}"))
("<fredagen>" <19107> (NN UTR SIN DEF NOM "fredag"))
("<en>" <19108> (DT UTR SIN IND "en"))
("<vild>" <19109> (JJ POS UTR SIN IND NOM "vild"))
("<strejk>" <19110> (NN UTR SIN IND NOM "strejk"))
("<mot>" <19111> (PP "mot"))
("<bl_a>" <19112> (AB AN "bland_annat"))
("<chefernas>" <19113> (NN UTR PLU DEF GEN "chef"))
("<fallsk{rmsavtal>" <19114> (NN NEU PLU IND NOM "fallsk{rmsavtal"))
("<.>" <19115> (DL MAD "."))
("<I>" <19118> (PP "i"))
("<TV>" <19119> (NN UTR SIN IND NOM "tv"))
("<har>" <19120> (VB PRS AKT "ha"))
("<Volvos>" <19121> (PM GEN "Volvos"))
("<styrelseordf|rande>" <19122> (NN UTR SIN IND NOM "styrelseordf|rande"))
("<Pehr>" <19123> (PM NOM "Pehr"))
("<G>" <19124> (PM NOM "G"))
("<Gyllenhammar>" <19125> (PM NOM "Gyllenhammar"))
("<talat>" <19126> (VB SUP AKT "tala"))
***
("<om>" <19127> (PP "om"))
("<om>" <19127> (PL "om"))
***
("<att>" <19128> (IE "att"))
("<att>" <19128> (SN "att"))
("<{ven>" <19129> (AB "{ven"))
("<kunderna>" <19130> (NN UTR PLU DEF NOM "kund"))
("<har>" <19131> (VB PRS AKT "ha"))
("<b|rjat>" <19132> (VB SUP AKT "b|rja"))
("<reagera>" <19133> (VB INF AKT "reagera"))
("<.>" <19134> (DL MAD "."))
("<Aktiespararna>" <19137> (NN UTR PLU DEF NOM "Aktiesparare"))
("<kr{ver>" <19138> (VB PRS AKT "kr{va"))
("<att>" <19139> (SN "att"))
("<alla>" <19140> (DT UTR/NEU PLU IND/DEF "all"))
("<fallsk{rmsavtal>" <19141> (NN NEU PLU IND NOM "fallsk{rmsavtal"))
("<i>" <19142> (PP "i"))
("<b|rsbolagen>" <19143> (NN NEU PLU DEF NOM "b|rsbolag"))
("<offentligg|rs>" <19144> (VB PRS SFO "offentligg|ra"))
("<i>" <19145> (PP "i"))
("<}rsredovisningarna>" <19146> (NN UTR PLU DEF NOM "}rsredovisning"))
("<.>" <19147> (DL MAD "."))
("<TILL>" <19150> (PP "till"))
("<OCH>" <19151> (KN "och"))
("<MED>" <19152> (PP "med"))
("<SAFs>" <19153> (PM GEN "SAF"))
("<ordf|rande>" <19154> (NN UTR SIN IND NOM "ordf|rande"))
("<Ulf>" <19155> (PM NOM "Ulf"))
("<Laurin>" <19156> (PM NOM "Laurin"))
("<skriver>" <19157> (VB PRS AKT "skriva"))
("<i>" <19158> (PP "i"))
("<en>" <19159> (DT UTR SIN IND "en"))
("<debattartikel>" <19160> (NN UTR SIN IND NOM "debattartikel"))
("<i>" <19161> (PP "i"))
***
("<Dagens>" <19162> (NN UTR SIN DEF GEN "dag"))
("<Dagens>" <19162> (PM NOM "Dagens"))
***
("<Nyheter>" <19163> (NN UTR PLU IND NOM "nyhet"))
("<Nyheter>" <19163> (PM NOM "Nyheter"))
("<att>" <19164> (SN "att"))
("<det>" <19165> (PN NEU SIN DEF SUB/OBJ "det"))
("<i>" <19166> (PP "i"))
("<80-talets>" <19167> (NN NEU SIN DEF GEN "80-tal"))
("<lyckorus>" <19168> (NN NEU SIN IND NOM "lyckorus"))
("<f|rekom>" <19169> (VB PRT AKT "f|re_komma"))
***
("<mycket>" <19170> (AB POS "mycket"))
("<mycket>" <19170> (PN NEU SIN IND SUB/OBJ "mycket"))
***
("<omoraliskt>" <19171> (AB POS "omoraliskt"))
("<omoraliskt>" <19171> (JJ NEU SIN IND NOM "omoralisk"))
("<och>" <19172> (KN "och"))
("<oanst{ndigt>" <19173> (JJ POS NEU SIN IND NOM "oanst{ndig"))
("<inom>" <19174> (PP "inom"))
("<n{ringslivet>" <19175> (NN NEU SIN DEF NOM "n{ringsliv"))
("<och>" <19176> (KN "och"))
("<s{rskilt>" <19177> (AB "s{rskilt"))
("<inom>" <19178> (PP "inom"))
("<finanssektorn>" <19179> (NN UTR SIN DEF NOM "finansssektor"))
("<.>" <19180> (DL MAD "."))
("<Han>" <19181> (PN UTR SIN DEF SUB "han"))
("<pekar>" <19182> (VB PRS AKT "peka"))
("<bl_a>" <19183> (AB AN "bland_annat"))
("<p}>" <19184> (PP "p}"))
("<fallsk{rmarna>" <19185> (NN UTR PLU DEF NOM "fallsk{rmare"))
***
("<som>" <19186> (KN "som"))
("<som>" <19186> (HP - - - "som"))
+++
("<kritiseras>" <19187> (VB INF SFO "kritisera"))
("<kritiseras>" <19187> (VB PRS SFO "kritisera"))
("<av>" <19188> (PP "av"))
("<alla>" <19189> (PN UTR/NEU PLU IND SUB/OBJ "alla"))
("<som>" <19190> (HP - - - "som"))
("<nu>" <19191> (AB "nu"))
("<st}r>" <19192> (VB PRS AKT "st}"))
("<utan>" <19193> (PP "utan"))
("<jobb>" <19194> (NN NEU SIN IND NOM "jobb"))
("<och>" <19195> (KN "och"))
("<inte>" <19196> (AB "inte"))
("<f}tt>" <19197> (VB SUP AKT "f}"))
("<m|jlighet>" <19198> (NN UTR SIN IND NOM "m|jlighet"))
("<till>" <19199> (PP "till"))
***
("<motsvarande>" <19200> (NN NEU SIN IND NOM "motsvarande"))
("<motsvarande>" <19200> (PC PRS UTR/NEU SIN/PLU IND/DEF NOM "motsvarande"))
("<mjuklandning>" <19201> (NN UTR SIN IND NOM "mjuklandning"))
("<.>" <19202> (DL MAD "."))
("<Ulf>" <19203> (PM NOM "Ulf"))
("<Laurin>" <19204> (PM NOM "Laurin"))
("<menar>" <19205> (VB PRS AKT "mena"))
("<att>" <19206> (SN "att"))
***
("<allt>" <19207> (PN NEU SIN IND SUB/OBJ "allt"))
("<allt>" <19207> (DT NEU SIN IND/DEF "all"))
("<detta>" <19208> (PN NEU SIN DEF SUB/OBJ "detta"))
***
("<lett>" <19209> (AB "lett"))
("<lett>" <19209> (VB SUP AKT "le"))
("<till>" <19210> (PP "till"))
("<rej{lt>" <19211> (AB POS "rej{lt"))
("<f|rs{mrad>" <19212> (PC PRF UTR SIN IND NOM "f|rs{mrad"))
("<legitimitet>" <19213> (NN UTR SIN IND NOM "legitimitet"))
("<f|r>" <19214> (PP "f|r"))
("<arbetsgivarna>" <19215> (NN UTR PLU DEF NOM "arbetsgivare"))
("<.>" <19216> (DL MAD "."))
("<DET>" <19219> (PN NEU SIN DEF SUB/OBJ "det"))
("<[R>" <19220> (VB PRS AKT "vara"))
***
("<L[TT>" <19221> (AB POS "l{tt"))
("<L[TT>" <19221> (JJ POS NEU SIN IND NOM "l{tt"))
("<ATT>" <19222> (IE "att"))
("<F\RST]>" <19223> (VB INF AKT "f|rst}"))
("<kritiken>" <19224> (NN UTR SIN DEF NOM "kritik"))
("<mot>" <19225> (PP "mot"))
("<fallsk{rmsavtalen>" <19226> (NN NEU PLU DEF NOM "fallsk{rmsavtal"))
("<->" <19227> (DL MID "-"))
("<inte>" <19228> (AB "inte"))
("<minst>" <19229> (AB SUV "minst"))
("<n{r>" <19230> (HA "n{r"))
("<det>" <19231> (PN NEU SIN DEF SUB/OBJ "det"))
("<b|rjar>" <19232> (VB PRS AKT "b|rja"))
("<handla>" <19233> (VB INF AKT "handla"))
("<om>" <19234> (PP "om"))
("<m}ngmiljonbelopp>" <19235> (NN NEU PLU IND NOM "m}ngmiljonbelopp"))
("<.>" <19236> (DL MAD "."))
("<Men>" <19237> (KN "men"))
("<det>" <19238> (PN NEU SIN DEF SUB/OBJ "det"))
("<{r>" <19239> (VB PRS AKT "vara"))
("<viktigt>" <19240> (JJ POS NEU SIN IND NOM "viktig"))
("<att>" <19241> (IE "att"))
("<rikta>" <19242> (VB INF AKT "rikta"))
("<kritiken>" <19243> (NN UTR SIN DEF NOM "kritik"))
("<}t>" <19244> (PP "}t"))
+++
("<r{tt>" <19245> (JJ POS NEU SIN IND NOM "r{t"))
("<r{tt>" <19245> (JJ POS UTR/NEU SIN/PLU IND NOM "r{tt"))
("<h}ll>" <19246> (NN NEU SIN IND NOM "h}ll"))
("<.>" <19247> (DL MAD "."))
("<Det>" <19248> (PN NEU SIN DEF SUB/OBJ "det"))
("<{r>" <19249> (VB PRS AKT "vara"))
("<inte>" <19250> (AB "inte"))
("<de>" <19251> (DT UTR/NEU PLU DEF "den"))
("<personer>" <19252> (NN UTR PLU IND NOM "person"))
("<som>" <19253> (HP - - - "som"))
("<idag>" <19254> (AB "idag"))
("<sitter>" <19255> (VB PRS AKT "sitta"))
("<med>" <19256> (PP "med"))
("<stora>" <19257> (JJ POS UTR/NEU PLU IND/DEF NOM "stor"))
("<utl|sta>" <19258> (JJ POS UTR/NEU PLU IND/DEF NOM "utl|st"))
("<fallsk{rmsavtal>" <19259> (NN NEU PLU IND NOM "fallsk{rmsavtal"))
("<som>" <19260> (HP - - - "som"))
("<ska>" <19261> (VB PRS AKT "ska"))
("<vara>" <19262> (VB INF AKT "vara"))
("<m}ltavla>" <19263> (NN UTR SIN IND NOM "m}ltavla"))
("<.>" <19264> (DL MAD "."))
("<Kritiken>" <19265> (NN UTR SIN DEF NOM "kritik"))
("<ska>" <19266> (VB PRS AKT "ska"))
("<riktas>" <19267> (VB INF SFO "rikta"))
("<mot>" <19268> (PP "mot"))
("<dem>" <19269> (PN UTR/NEU PLU DEF OBJ "de"))
("<som>" <19270> (HP - - - "som"))
("<beviljar>" <19271> (VB PRS AKT "bevilja"))
("<orimligt>" <19272> (AB POS "orimligt"))
("<gener|sa>" <19273> (JJ POS UTR/NEU PLU IND/DEF NOM "gener|s"))
("<avtal>" <19274> (NN NEU PLU IND NOM "avtal"))
("<.>" <19275> (DL MAD "."))
***
("<Det>" <19278> (DT NEU SIN DEF "den"))
("<Det>" <19278> (PN NEU SIN DEF SUB/OBJ "det"))
("<viktigt>" <19279> (JJ POS NEU SIN IND NOM "viktig"))
("<att>" <19280> (IE "att"))
("<skilja>" <19281> (VB INF AKT "skilja"))
("<p}>" <19282> (PP "p}"))
("<begreppen>" <19283> (NN NEU PLU DEF NOM "begrepp"))
("<.>" <19284> (DL MAD "."))
("<Ett>" <19285> (DT NEU SIN IND "en"))
("<visst>" <19286> (JJ POS NEU SIN IND NOM "viss"))
("<m}tt>" <19287> (NN NEU SIN IND NOM "m}tt"))
("<av>" <19288> (PP "av"))
("<fallsk{rmsavtal>" <19289> (NN NEU PLU IND NOM "fallsk{rmsavtal"))
("<m}ste>" <19290> (VB PRS AKT "m}ste"))
("<finnas>" <19291> (VB INF SFO "finna"))
("<.>" <19292> (DL MAD "."))
("<Vanliga>" <19293> (JJ POS UTR/NEU PLU IND/DEF NOM "vanlig"))
("<anst{llda>" <19294> (NN UTR PLU IND NOM "anst{lld"))
("<omfattas>" <19295> (VB PRS SFO "omfatta"))
("<av>" <19296> (PP "av"))
("<lagen>" <19297> (NN UTR SIN DEF NOM "lag"))
("<om>" <19298> (PP "om"))
("<anst{llningstrygghet>" <19299> (NN UTR SIN IND NOM "anst{llningstrygghet"))
("<med>" <19300> (PP "med"))
("<r{tt>" <19301> (NN UTR SIN IND NOM "r{tt"))
("<till>" <19302> (PP "till"))
("<betald>" <19303> (PC PRF UTR SIN IND NOM "betald"))
("<upps{gningstid>" <19304> (NN UTR SIN IND NOM "upps{gningstid"))
("<p}>" <19305> (PP "p}"))
("<upp>" <19306> (AB "upp"))
("<till>" <19307> (PP "till"))
("<sex>" <19308> (RG NOM "sex"))
("<m}nader>" <19309> (NN UTR PLU IND NOM "m}nad"))
("<.>" <19310> (DL MAD "."))
("<En>" <19311> (DT UTR SIN IND "en"))
("<f|retagsledare>" <19312> (NN UTR SIN IND NOM "f|retagsledare"))
("<eller>" <19313> (KN "eller"))
("<politiker>" <19314> (NN UTR SIN IND NOM "politiker"))
("<skyddas>" <19315> (VB PRS SFO "skydda"))
("<inte>" <19316> (AB "inte"))
("<av>" <19317> (PP "av"))
("<lagen>" <19318> (NN UTR SIN DEF NOM "lag"))
("<.>" <19319> (DL MAD "."))
("<Han>" <19320> (PN UTR SIN DEF SUB "han"))
("<eller>" <19321> (KN "eller"))
("<hon>" <19322> (PN UTR SIN DEF SUB "hon"))
("<kan>" <19323> (VB PRS AKT "kunna"))
("<f}>" <19324> (VB INF AKT "f}"))
("<g}>" <19325> (VB INF AKT "g}"))
("<fr}n>" <19326> (PP "fr}n"))
("<ena>" <19327> (RG UTR/NEU SIN DEF NOM "ena"))
("<dagen>" <19328> (NN UTR SIN DEF NOM "dag"))
("<till>" <19329> (PP "till"))
("<den>" <19330> (DT UTR SIN DEF "den"))
***
("<andra>" <19331> (JJ POS UTR/NEU SIN DEF NOM "annan"))
("<andra>" <19331> (RO UTR/NEU SIN/PLU IND/DEF NOM "andra"))
("<utan>" <19332> (PP "utan"))
("<att>" <19333> (IE "att"))
("<sj{lv>" <19334> (JJ POS UTR SIN IND NOM "sj{lv"))
("<ha>" <19335> (VB INF AKT "ha"))
("<missk|tt>" <19336> (VB SUP AKT "missk|ta"))
("<sitt>" <19337> (PS NEU SIN DEF "sin"))
("<jobb>" <19338> (NN NEU SIN IND NOM "jobb"))
("<.>" <19339> (DL MAD "."))
("<ETT>" <19342> (DT NEU SIN IND "en"))
("<RIMLIGT>" <19343> (JJ POS NEU SIN IND NOM "rimlig"))
("<M]TT>" <19344> (NN NEU SIN IND NOM "m}tt"))
("<av>" <19345> (PP "av"))
("<inkomstskydd>" <19346> (NN NEU SIN IND NOM "inkomstskydd"))
("<m}ste>" <19347> (VB PRS AKT "m}ste"))
("<{ven>" <19348> (AB "{ven"))
("<direkt|rer>" <19349> (NN UTR PLU IND NOM "direkt|r"))
("<och>" <19350> (KN "och"))
("<politiker>" <19351> (NN UTR PLU IND NOM "politiker"))
("<ha>" <19352> (VB INF AKT "ha"))
("<r{tt>" <19353> (NN UTR SIN IND NOM "r{tt"))
("<till>" <19354> (PP "till"))
("<.>" <19355> (DL MAD "."))
("<Men>" <19358> (KN "men"))
("<det>" <19359> (PN NEU SIN DEF SUB/OBJ "det"))
("<{r>" <19360> (VB PRS AKT "vara"))
("<viktigt>" <19361> (JJ POS NEU SIN IND NOM "viktig"))
("<att>" <19362> (SN "att"))
("<det>" <19363> (PN NEU SIN DEF SUB/OBJ "det"))
("<{r>" <19364> (VB PRS AKT "vara"))
("<just>" <19365> (AB "just"))
("<rimligt>" <19366> (JJ POS NEU SIN IND NOM "rimlig"))
("<->" <19367> (DL MID "-"))
("<till>" <19368> (PP "till"))
("<sin>" <19369> (PS UTR SIN DEF "sin"))
("<konstruktion>" <19370> (NN UTR SIN IND NOM "konstruktion"))
("<och>" <19371> (KN "och"))
("<till>" <19372> (PP "till"))
("<sin>" <19373> (PS UTR SIN DEF "sin"))
("<storlek>" <19374> (NN UTR SIN IND NOM "storlek"))
("<.>" <19375> (DL MAD "."))
("<Annars>" <19378> (AB "annars"))
("<{r>" <19379> (VB PRS AKT "vara"))
("<risken>" <19380> (NN UTR SIN DEF NOM "risk"))
("<stor>" <19381> (JJ POS UTR SIN IND NOM "stor"))
("<att>" <19382> (SN "att"))
("<b}de>" <19383> (KN "b}de"))
("<personal>" <19384> (NN UTR SIN IND NOM "personal"))
("<och>" <19385> (KN "och"))
("<kunder>" <19386> (NN UTR PLU IND NOM "kund"))
("<reagerar>" <19387> (VB PRS AKT "reagera"))
("<.>" <19388> (DL MAD "."))
("<I>" <19389> (PP "i"))
("<s}dana>" <19390> (JJ POS UTR/NEU PLU IND NOM "s}dan"))
("<fall>" <19391> (NN NEU PLU IND NOM "fall"))
("<blir>" <19392> (VB PRS AKT "bli"))
("<fallsk{rmsavtalen>" <19393> (NN NEU PLU DEF NOM "fallsk{rmsavtal"))
("<dyra>" <19394> (JJ POS UTR/NEU PLU IND/DEF NOM "dyr"))
("<p}>" <19395> (PP "p}"))
("<mer>" <19396> (AB KOM "mycket"))
("<{n>" <19397> (KN "{n"))
***
("<ett>" <19398> (DT NEU SIN IND "en"))
("<ett>" <19398> (RG NEU SIN IND NOM "ett"))
("<s{tt>" <19399> (NN NEU SIN IND NOM "s{tt"))
("<.>" <19400> (DL MAD "."))
("<EN>" <19403> (DT UTR SIN IND "en"))
("<GOD>" <19404> (JJ POS UTR SIN IND NOM "god"))
("<PRINCIP>" <19405> (NN UTR SIN IND NOM "princip"))
("<{r>" <19406> (VB PRS AKT "vara"))
("<att>" <19407> (SN "att"))
("<utsatta>" <19408> (PC PRF UTR/NEU PLU IND/DEF NOM "utsatt"))
("<jobb>" <19409> (NN NEU PLU IND NOM "jobb"))
("<ska>" <19410> (VB PRS AKT "ska"))
("<ge>" <19411> (VB INF AKT "ge"))
("<bra>" <19412> (AB POS "bra"))
("<betalt>" <19413> (PC PRF NEU SIN IND NOM "betald"))
("<medan>" <19414> (SN "medan"))
("<de>" <19415> (PN UTR/NEU PLU DEF SUB "de"))
("<utf|rs>" <19416> (VB PRS SFO "utf|ra"))
("<->" <19417> (DL MID "-"))
("<inte>" <19418> (AB "inte"))
("<vecklas>" <19419> (VB PRS SFO "veckla"))
("<ut>" <19420> (PL "ut"))
("<till>" <19421> (PP "till"))
("<en>" <19422> (DT UTR SIN IND "en"))
("<j{ttefallsk{rm>" <19423> (NN UTR SIN IND NOM "j{ttefallsk{rm"))
("<efter}t>" <19424> (AB "efter}t"))
("<.>" <19425> (DL MAD "."))
("<Statsr}det>" <19428> (NN NEU SIN DEF NOM "statsr}d"))
("<Bo>" <19429> (PM NOM "Bo"))
("<Lundgren>" <19430> (PM NOM "Lundgren"))
("<har>" <19431> (VB PRS AKT "ha"))
("<f|rordat>" <19432> (VB SUP AKT "f|rorda"))
("<en>" <19433> (DT UTR SIN IND "en"))
("<gr{ns>" <19434> (NN UTR SIN IND NOM "gr{ns"))
("<p}>" <19435> (PP "p}"))
("<24>" <19436> (RG NOM "24"))
("<m}nader>" <19437> (NN UTR PLU IND NOM "m}nad"))
("<f|r>" <19438> (PP "f|r"))
+++
("<avg}ngsvederlag>" <19439> (NN UTR SIN IND NOM "avg}ngsvederlag"))
("<avg}ngsvederlag>" <19439> (NN NEU SIN IND NOM "avg}ngsvederlag"))
("<i>" <19440> (PP "i"))
("<de>" <19441> (DT UTR/NEU PLU DEF "den"))
("<statligt>" <19442> (AB POS "statligt"))
("<{gda>" <19443> (PC PRF UTR/NEU PLU IND/DEF NOM "{gd"))
("<kreditinstituten>" <19444> (NN NEU PLU DEF NOM "kreditinstitut"))
("<.>" <19445> (DL MAD "."))
("<Han>" <19446> (PN UTR SIN DEF SUB "han"))
("<vill>" <19447> (VB PRS AKT "vilja"))
("<ocks}>" <19448> (AB "ocks}"))
("<att>" <19449> (SN "att"))
("<pengarna>" <19450> (NN UTR PLU DEF NOM "peng"))
("<betalas>" <19451> (VB PRS SFO "betala"))
("<ut>" <19452> (PL "ut"))
("<m}nadsvis>" <19453> (AB "m}nadsvis"))
("<och>" <19454> (KN "och"))
("<kan>" <19455> (VB PRS AKT "kunna"))
("<h}llas>" <19456> (VB INF SFO "h}lla"))
("<inne>" <19457> (AB "inne"))
***
("<om>" <19458> (PP "om"))
("<om>" <19458> (SN "om"))
("<oegentligheter>" <19459> (NN UTR PLU IND NOM "oegentlighet"))
("<uppt{cks>" <19460> (VB PRS SFO "uppt{cka"))
("<och>" <19461> (KN "och"))
("<utreds>" <19462> (VB PRS SFO "utreda"))
("<.>" <19463> (DL MAD "."))
("<Det>" <19466> (PN NEU SIN DEF SUB/OBJ "det"))
("<{r>" <19467> (VB PRS AKT "vara"))
("<ett>" <19468> (DT NEU SIN IND "en"))
("<f|rs|k>" <19469> (NN NEU SIN IND NOM "f|rs|k"))
("<att>" <19470> (IE "att"))
("<strama>" <19471> (VB INF AKT "strama"))
("<upp>" <19472> (PL "upp"))
("<floran>" <19473> (NN UTR SIN DEF NOM "flora"))
("<av>" <19474> (PP "av"))
("<f|rm}ner>" <19475> (NN UTR PLU IND NOM "f|rm}n"))
("<,>" <19476> (DL MID ","))
("<som>" <19477> (HP - - - "som"))
("<borde>" <19478> (VB PRT AKT "b|ra"))
("<kunna>" <19479> (VB INF AKT "kunna"))
("<tj{na>" <19480> (VB INF AKT "tj{na"))
("<som>" <19481> (KN "som"))
("<f|rebild>" <19482> (NN UTR SIN IND NOM "f|rebild"))
("<{ven>" <19483> (AB "{ven"))
("<f|r>" <19484> (PP "f|r"))
("<andra>" <19485> (JJ POS UTR/NEU PLU IND/DEF NOM "annan"))
("<f|retag>" <19486> (NN NEU PLU IND NOM "f|retag"))
("<.>" <19487> (DL MAD "."))
("<F\RETAGSLEDNINGARNA>" <19490> (NN UTR PLU DEF NOM "f|retagsledning"))
("<[R>" <19491> (VB PRS AKT "vara"))
("<STILBILDARE>" <19492> (NN UTR PLU IND NOM "stilbildare"))
("<f|r>" <19493> (PP "f|r"))
("<de>" <19494> (DT UTR/NEU PLU DEF "den"))
("<anst{llda>" <19495> (NN UTR PLU IND NOM "anst{lld"))
("<i>" <19496> (PP "i"))
("<f|retagen>" <19497> (NN NEU PLU DEF NOM "f|retag"))
("<.>" <19498> (DL MAD "."))
("<En>" <19499> (DT UTR SIN IND "en"))
("<viss>" <19500> (JJ POS UTR SIN IND NOM "viss"))
("<}terh}llsamhet>" <19501> (NN UTR SIN IND NOM "}terh}llsamhet"))
("<i>" <19502> (PP "i"))
("<k{rva>" <19503> (JJ POS UTR/NEU PLU IND/DEF NOM "k{rv"))
("<tider>" <19504> (NN UTR PLU IND NOM "tid"))
("<{r>" <19505> (VB PRS AKT "vara"))
("<inte>" <19506> (AB "inte"))
("<bara>" <19507> (AB "bara"))
("<kl{dsam>" <19508> (JJ POS UTR SIN IND NOM "kl{dsam"))
***
("<utan>" <19509> (KN "utan"))
("<utan>" <19509> (PP "utan"))
("<kanske>" <19510> (AB "kanske"))
("<helt>" <19511> (AB POS "helt"))
("<n|dv{ndig>" <19512> (JJ POS UTR SIN IND NOM "n|dv{ndig"))
("<f|r>" <19513> (PP "f|r"))
("<att>" <19514> (IE "att"))
("<}terskapa>" <19515> (VB INF AKT "}terskapa"))
("<den>" <19516> (DT UTR SIN DEF "den"))
("<legitimitet>" <19517> (NN UTR SIN IND NOM "legitimitet"))
("<som>" <19518> (HP - - - "som"))
("<Ulf>" <19519> (PM NOM "Ulf"))
("<Laurin>" <19522> (PM NOM "Laurin"))
("<talar>" <19523> (VB PRS AKT "tala"))
("<om>" <19524> (PP "om"))
("<.>" <19525> (DL MAD "."))
("<Kraven>" <19528> (NN NEU PLU DEF NOM "krav"))
("<|kar>" <19529> (VB PRS AKT "|ka"))
("<nu>" <19530> (AB "nu"))
("<p}>" <19531> (PP "p}"))
("<att>" <19532> (SN "att"))
("<b|rsbolagen>" <19533> (NN NEU PLU DEF NOM "b|rsbolag"))
("<ska>" <19534> (VB PRS AKT "ska"))
("<offentligg|ra>" <19535> (VB INF AKT "offentligg|ra"))
("<sina>" <19536> (PS UTR/NEU PLU DEF "sin"))
("<f|rpliktelser>" <19537> (NN UTR PLU IND NOM "f|rpliktelse"))
("<av>" <19538> (PP "av"))
("<det>" <19539> (DT NEU SIN DEF "den"))
("<h{r>" <19540> (AB "h{r"))
("<slaget>" <19541> (NN NEU SIN DEF NOM "slag"))
("<.>" <19542> (DL MAD "."))
("<Det>" <19543> (PN NEU SIN DEF SUB/OBJ "det"))
("<{r>" <19544> (VB PRS AKT "vara"))
("<ett>" <19545> (DT NEU SIN IND "en"))
("<rimligt>" <19546> (JJ POS NEU SIN IND NOM "rimlig"))
("<|nskem}l>" <19547> (NN NEU SIN IND NOM "|nskem}l"))
("<fr}n>" <19548> (PP "fr}n"))
("<aktie{garna>" <19549> (NN UTR PLU DEF NOM "aktie{gare"))
("<att>" <19550> (IE "att"))
("<veta>" <19551> (VB INF AKT "veta"))
("<vilka>" <19552> (HD UTR/NEU PLU IND "vilken"))
("<potentiella>" <19553> (JJ POS UTR/NEU PLU IND/DEF NOM "potentiell"))
("<utgifter>" <19554> (NN UTR PLU IND NOM "utgift"))
("<som>" <19555> (HP - - - "som"))
("<ligger>" <19556> (VB PRS AKT "ligga"))
("<dolda>" <19557> (PC PRF UTR/NEU PLU IND/DEF NOM "dold"))
("<i>" <19558> (PP "i"))
("<dessa>" <19559> (DT UTR/NEU PLU DEF "denna"))
("<avtal>" <19560> (NN NEU PLU IND NOM "avtal"))
("<.>" <19561> (DL MAD "."))
("<Att>" <19564> (SN "att"))
("<avtalen>" <19565> (NN NEU PLU DEF NOM "avtal"))
("<varit>" <19566> (VB SUP AKT "vara"))
("<hemliga>" <19567> (JJ POS UTR/NEU PLU IND/DEF NOM "hemlig"))
("<har>" <19568> (VB PRS AKT "ha"))
("<s{kert>" <19569> (AB POS "s{kert"))
("<varit>" <19570> (VB SUP AKT "vara"))
("<en>" <19571> (DT UTR SIN IND "en"))
("<f|ruts{ttning>" <19572> (NN UTR SIN IND NOM "f|ruts{ttning"))
("<f|r>" <19573> (PP "f|r"))
("<flera>" <19574> (PN UTR/NEU PLU IND SUB/OBJ "flera"))
("<av>" <19575> (PP "av"))
("<de>" <19576> (DT UTR/NEU PLU DEF "den"))
("<stora>" <19577> (JJ POS UTR/NEU PLU IND/DEF NOM "stor"))
("<avtal>" <19578> (NN NEU PLU IND NOM "avtal"))
("<som>" <19579> (HP - - - "som"))
("<nu>" <19580> (AB "nu"))
("<diskuteras>" <19581> (VB PRS SFO "diskutera"))
("<.>" <19582> (DL MAD "."))
("<Ju>" <19583> (KN "ju"))
("<st|rre>" <19584> (JJ KOM UTR/NEU SIN/PLU IND/DEF NOM "stor"))
("<kravet>" <19585> (NN NEU SIN DEF NOM "krav"))
("<p}>" <19586> (PP "p}"))
("<offentlighet>" <19587> (NN UTR SIN IND NOM "offentlighet"))
("<blir>" <19588> (VB PRS AKT "bli"))
("<,>" <19589> (DL MID ","))
("<desto>" <19590> (AB "desto"))
***
("<troligare>" <19591> (AB KOM "troligt"))
("<troligare>" <19591> (JJ KOM UTR/NEU SIN/PLU IND/DEF NOM "trolig"))
("<{r>" <19592> (VB PRS AKT "vara"))
("<det>" <19593> (PN NEU SIN DEF SUB/OBJ "det"))
("<ocks}>" <19594> (AB "ocks}"))
("<att>" <19595> (SN "att"))
("<de>" <19596> (DT UTR/NEU PLU DEF "den"))
("<avtal>" <19597> (NN NEU PLU IND NOM "avtal"))
("<som>" <19598> (HP - - - "som"))
("<sluts>" <19599> (VB PRS SFO "sluta"))
("<h}ller>" <19600> (VB PRS AKT "h}lla"))
("<sig>" <19601> (PN UTR/NEU SIN/PLU DEF OBJ "sig"))
("<inom>" <19602> (PP "inom"))
("<acceptabla>" <19603> (JJ POS UTR/NEU PLU IND/DEF NOM "acceptabel"))
("<gr{nser>" <19604> (NN UTR PLU IND NOM "gr{ns"))
("<.>" <19605> (DL MAD "."))
You are invited to send comments and feedback to multext@lpl.univ-aix.fr.