Corpus infoGeneral statsTokens |
210138
|
---|
Words |
201787
|
---|
Types | 14369 |
---|
Lemmas | 5012 |
---|
Hapax legomenon | 8213 |
---|
Dis legomenon | 2607 |
---|
POS tags | 546 |
---|
DocumentsNumber of documents |
401
|
---|
Average (tokens per document) | 524 |
---|
Median (tokens per document) | 350 |
---|
Longest document (tokens) | 9105 |
---|
Shortest document (tokens) | 30 |
---|
Oldest document (year) | |
---|
Most recent document (year) | |
---|
Group by part of speechMain POS tag | N | % |
---|
untagged | 104842 | 49.89 |
---|
common noun | 19383 | 9.22 |
---|
preposition | 17854 | 8.50 |
---|
verb | 16503 | 7.85 |
---|
pronoun | 13130 | 6.25 |
---|
determiner | 11383 | 5.42 |
---|
conjunction | 8888 | 4.23 |
---|
punctuation | 8351 | 3.97 |
---|
adverb | 4704 | 2.24 |
---|
adjective | 2914 | 1.39 |
---|
proper noun | 1750 | 0.83 |
---|
interjection | 436 | 0.21 |
---|
numeral | 0 | 0.00 |
---|
foreign word | 0 | 0.00 |
---|
Total | 210138 | 100.00 |
---|
Group by projectGroup by text typeGroup by centuryGroup by provinceProvince | N | % |
---|
_ | 98571 | 46.91 |
---|
Guipúzcoa | 20865 | 9.93 |
---|
Valladolid | 20256 | 9.64 |
---|
Nápoles | 17001 | 8.09 |
---|
Barcelona | 7857 | 3.74 |
---|
Zaragoza | 5765 | 2.74 |
---|
s.l. | 4304 | 2.05 |
---|
¿? | 3666 | 1.74 |
---|
Huesca | 2814 | 1.34 |
---|
Madrid | 2660 | 1.27 |
---|
Valencia | 2496 | 1.19 |
---|
Vizcaya | 2408 | 1.15 |
---|
Nueva Aquitania | 2390 | 1.14 |
---|
Caller | 1751 | 0.83 |
---|
Barcerlona | 1688 | 0.80 |
---|
Roma | 1622 | 0.77 |
---|
Lacio | 1312 | 0.62 |
---|
Piacenza | 1122 | 0.53 |
---|
Viena | 1121 | 0.53 |
---|
Álava | 1105 | 0.53 |
---|
Isla de Francia | 950 | 0.45 |
---|
Toledo | 792 | 0.38 |
---|
Tirol | 693 | 0.33 |
---|
Figueras | 602 | 0.29 |
---|
Perpiñán | 572 | 0.27 |
---|
Mühldorf | 567 | 0.27 |
---|
Salamanca | 550 | 0.26 |
---|
Amberes | 510 | 0.24 |
---|
Castellón | 449 | 0.21 |
---|
Sevilla | 431 | 0.21 |
---|
Navarra | 406 | 0.19 |
---|
París | 380 | 0.18 |
---|
Castilnuovo de Nápoles | 352 | 0.17 |
---|
Murcia | 330 | 0.16 |
---|
Liguria | 317 | 0.15 |
---|
Guipúzcoa? | 290 | 0.14 |
---|
Bruselas-Capital | 260 | 0.12 |
---|
Aragón | 250 | 0.12 |
---|
Jaén | 248 | 0.12 |
---|
Badajoz | 212 | 0.10 |
---|
Lisboa | 203 | 0.10 |
---|
Total | 210138 | 100.00 |
---|
Group by institutionInstitution | N | % |
---|
Total | 0 | 100.00 |
---|
Group by century and province (absolute frequencies) | XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
---|
Almería | | | | | | 0 | 0 |
---|
Granada | | | | | | 0 |
---|
Jaén | | | | | | 0 |
---|
Málaga | | | | | | 0 | 0 |
---|
Córdoba | | | | | | 0 |
---|
Cádiz | | | | | | 0 | 0 |
---|
Sevilla | | | | | | 0 |
---|
Huelva | | | | | | 0 |
---|
Madrid | | | | | | 0 | 0 |
---|
Burgos | | | | | | 0 |
---|
others | | | | | | 0 | 0 |
---|
Total (century) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
---|
Group by century and province (relative frequencies) | XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
---|
Almería | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Granada | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Jaén | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Málaga | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Córdoba | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Cádiz | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Sevilla | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Huelva | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Madrid | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Burgos | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
others | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
---|
Total (century) | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100.00 | 100.00 |
---|
Measures of lexical diversityMeasure | Description | Formula | Result |
---|
TTR | type-token ratio | | 0.071 |
---|
RTTR | Giraud's root type-token ratio | | 31.987 |
---|
CTTR | Carroll's corrected type-token ratio | | 22.619 |
---|
C | Herdan's C index | | 0.784 |
---|
S | Somer's S index | | 0.903 |
---|
M | Maas' index | | 0.029 |
---|
H | Honoré's index | | 2851.151 |
---|
K | Yule's K index | | 110.745 |
---|
D | Simpson's D index | | 0.011 |
---|
HTR | Hapax-token ratio | | 0.572 |
---|
DTR | Dis-token ratio | | 0.181 |
---|
VGR | Vocabulary growth rate | | 0.041 |
---|
|