Corpus infoGeneral statsTokens |
Words |
Types | 14369 |
Lemmas | 5012 |
Hapax legomenon | 8213 |
Dis legomenon | 2607 |
POS tags | 546 |
DocumentsNumber of documents |
Average (tokens per document) | 524 |
Median (tokens per document) | 350 |
Longest document (tokens) | 9105 |
Shortest document (tokens) | 30 |
Oldest document (year) | |
Most recent document (year) | |
Group by part of speechMain POS tag | N | % |
untagged | 104842 | 49.89 |
common noun | 19383 | 9.22 |
preposition | 17854 | 8.50 |
verb | 16503 | 7.85 |
pronoun | 13130 | 6.25 |
determiner | 11383 | 5.42 |
conjunction | 8888 | 4.23 |
punctuation | 8351 | 3.97 |
adverb | 4704 | 2.24 |
adjective | 2914 | 1.39 |
proper noun | 1750 | 0.83 |
interjection | 436 | 0.21 |
numeral | 0 | 0.00 |
foreign word | 0 | 0.00 |
Total | 210138 | 100.00 |
Group by projectGroup by text typeGroup by centuryGroup by provinceProvince | N | % |
_ | 98571 | 46.91 |
Guipúzcoa | 20865 | 9.93 |
Valladolid | 20256 | 9.64 |
Nápoles | 17001 | 8.09 |
Barcelona | 7857 | 3.74 |
Zaragoza | 5765 | 2.74 |
s.l. | 4304 | 2.05 |
¿? | 3666 | 1.74 |
Huesca | 2814 | 1.34 |
Madrid | 2660 | 1.27 |
Valencia | 2496 | 1.19 |
Vizcaya | 2408 | 1.15 |
Nueva Aquitania | 2390 | 1.14 |
Caller | 1751 | 0.83 |
Barcerlona | 1688 | 0.80 |
Roma | 1622 | 0.77 |
Lacio | 1312 | 0.62 |
Piacenza | 1122 | 0.53 |
Viena | 1121 | 0.53 |
Álava | 1105 | 0.53 |
Isla de Francia | 950 | 0.45 |
Toledo | 792 | 0.38 |
Tirol | 693 | 0.33 |
Figueras | 602 | 0.29 |
Perpiñán | 572 | 0.27 |
Mühldorf | 567 | 0.27 |
Salamanca | 550 | 0.26 |
Amberes | 510 | 0.24 |
Castellón | 449 | 0.21 |
Sevilla | 431 | 0.21 |
Navarra | 406 | 0.19 |
París | 380 | 0.18 |
Castilnuovo de Nápoles | 352 | 0.17 |
Murcia | 330 | 0.16 |
Liguria | 317 | 0.15 |
Guipúzcoa? | 290 | 0.14 |
Bruselas-Capital | 260 | 0.12 |
Aragón | 250 | 0.12 |
Jaén | 248 | 0.12 |
Badajoz | 212 | 0.10 |
Lisboa | 203 | 0.10 |
Total | 210138 | 100.00 |
Group by institutionInstitution | N | % |
Total | 0 | 100.00 |
Group by century and province (absolute frequencies) | XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
Almería | | | | | | 0 | 0 |
Granada | | | | | | 0 |
Jaén | | | | | | 0 |
Málaga | | | | | | 0 | 0 |
Córdoba | | | | | | 0 |
Cádiz | | | | | | 0 | 0 |
Sevilla | | | | | | 0 |
Huelva | | | | | | 0 |
Madrid | | | | | | 0 | 0 |
Burgos | | | | | | 0 |
others | | | | | | 0 | 0 |
Total (century) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Group by century and province (relative frequencies) | XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
Almería | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Granada | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Jaén | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Málaga | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Córdoba | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Cádiz | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Sevilla | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Huelva | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Madrid | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Burgos | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
others | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Total (century) | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100.00 | 100.00 |
Measures of lexical diversityMeasure | Description | Formula | Result |
TTR | type-token ratio | | 0.071 |
RTTR | Giraud's root type-token ratio | | 31.987 |
CTTR | Carroll's corrected type-token ratio | | 22.619 |
C | Herdan's C index | | 0.784 |
S | Somer's S index | | 0.903 |
M | Maas' index | | 0.029 |
H | Honoré's index | | 2851.151 |
K | Yule's K index | | 110.745 |
D | Simpson's D index | | 0.011 |
HTR | Hapax-token ratio | | 0.572 |
DTR | Dis-token ratio | | 0.181 |
VGR | Vocabulary growth rate | | 0.041 |