Corpus infoGeneral stats| Tokens |
80184
|
|---|
| Words |
80184
|
|---|
| Types | 8045 |
|---|
| Lemmas | 6 |
|---|
| Hapax legomenon | 4955 |
|---|
| Dis legomenon | 1263 |
|---|
| POS tags | 6 |
|---|
Documents| Number of documents |
259
|
|---|
| Average (tokens per document) | 310 |
|---|
| Median (tokens per document) | 267 |
|---|
| Longest document (tokens) | 1071 |
|---|
| Shortest document (tokens) | 6 |
|---|
| Oldest document (year) | |
|---|
| Most recent document (year) | |
|---|
Group by part of speech| Main POS tag | N | % |
|---|
| untagged | 80183 | 100.00 |
|---|
| common noun | 1 | 0.00 |
|---|
| proper noun | 0 | 0.00 |
|---|
| verb | 0 | 0.00 |
|---|
| adjective | 0 | 0.00 |
|---|
| adverb | 0 | 0.00 |
|---|
| determiner | 0 | 0.00 |
|---|
| pronoun | 0 | 0.00 |
|---|
| preposition | 0 | 0.00 |
|---|
| numeral | 0 | 0.00 |
|---|
| interjection | 0 | 0.00 |
|---|
| conjunction | 0 | 0.00 |
|---|
| foreign word | 0 | 0.00 |
|---|
| punctuation | 0 | 0.00 |
|---|
| Total | 80184 | 100.00 |
|---|
Group by projectGroup by text typeGroup by centuryGroup by province| Province | N | % |
|---|
| Granada | 19284 | 24.05 |
|---|
| Madrid | 12935 | 16.13 |
|---|
| _ | 12286 | 15.32 |
|---|
| Córdoba | 9873 | 12.31 |
|---|
| Salamanca | 8862 | 11.05 |
|---|
| Huesca | 7344 | 9.16 |
|---|
| País Vasco | 2339 | 2.92 |
|---|
| Andalucía | 2099 | 2.62 |
|---|
| Navarra | 1124 | 1.40 |
|---|
| Sevilla | 656 | 0.82 |
|---|
| Cuenca | 568 | 0.71 |
|---|
| Madrid | 566 | 0.71 |
|---|
| Vizcaya | 490 | 0.61 |
|---|
| Hueca | 471 | 0.59 |
|---|
| huesca | 430 | 0.54 |
|---|
| Valladolid | 365 | 0.46 |
|---|
| Castilla-La Mancha | 254 | 0.32 |
|---|
| Castilla-la Mancha | 238 | 0.30 |
|---|
| Total | 80184 | 100.00 |
|---|
Group by institution| Institution | N | % |
|---|
| Total | 0 | 100.00 |
|---|
Group by century and province (absolute frequencies) | XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
|---|
| Almería | | | | | | 0 | 0 |
|---|
| Granada | | | | | | 0 |
|---|
| Jaén | | | | | | 0 |
|---|
| Málaga | | | | | | 0 | 0 |
|---|
| Córdoba | | | | | | 0 |
|---|
| Cádiz | | | | | | 0 | 0 |
|---|
| Sevilla | | | | | | 0 |
|---|
| Huelva | | | | | | 0 |
|---|
| Madrid | | | | | | 0 | 0 |
|---|
| Burgos | | | | | | 0 |
|---|
| others | | | | | | 0 | 0 |
|---|
| Total (century) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|
Group by century and province (relative frequencies) | XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
|---|
| Almería | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Granada | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Jaén | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Málaga | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Córdoba | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Cádiz | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Sevilla | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Huelva | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Madrid | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Burgos | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| others | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|---|
| Total (century) | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100.00 | 100.00 |
|---|
Measures of lexical diversity| Measure | Description | Formula | Result |
|---|
| TTR | type-token ratio | | 0.100 |
|---|
| RTTR | Giraud's root type-token ratio | | 28.411 |
|---|
| CTTR | Carroll's corrected type-token ratio | | 20.089 |
|---|
| C | Herdan's C index | | 0.796 |
|---|
| S | Somer's S index | | 0.906 |
|---|
| M | Maas' index | | 0.028 |
|---|
| H | Honoré's index | | 2939.960 |
|---|
| K | Yule's K index | | 122.965 |
|---|
| D | Simpson's D index | | 0.012 |
|---|
| HTR | Hapax-token ratio | | 0.616 |
|---|
| DTR | Dis-token ratio | | 0.157 |
|---|
| VGR | Vocabulary growth rate | | 0.062 |
|---|
|