material_in_the_databases
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
material_in_the_databases [2022/03/01 10:08] – pm | material_in_the_databases [2023/09/06 09:49] (current) – pm | ||
---|---|---|---|
Line 2: | Line 2: | ||
We give a description of the material included in the two sub-bases. | We give a description of the material included in the two sub-bases. | ||
- | ===== BCS ===== | + | ===== BCMS ===== |
- | Add description | + | The verb selection for BCMS was conducted using the corpora [[https:// |
- | + | Different shapes that the same verbs have in two or each of the varieties were introduced as separate entries and annotated as variants of one verb. Some typical examples of variants are ekavian and ijekavian versions (e.g. // | |
- | **Some general notes**\\ | + | borrowed verbs (e.g. // |
===== Slovenian | ===== Slovenian | ||
- | The list of most common Slovenian verbs was made using [[https:// | + | The list of 3000 most common Slovenian verbs was made using [[https:// |
+ | |||
+ | ===== Some general notes ===== | ||
- | **Some general notes**\\ | + | Items that got on the list due to mistakes in annotation in the corpus were excluded from our list and replaced by the next web on the list of most common verbs. One such example from Slovenian is ‘// |
- | Items that got on the list due to mistakes in annotation in the corpus | + | The list of verbs includes several homophonous verbs. Since the corpus |
- | The list of verbs includes several homophonous verbs. Since the corpus is not annotated for meaning, homophonous verbs are counted as one verb. For example, the verb //brati// can mean ‘read’ or ‘gather, collect’. In such cases the annotators annotated the verb for the propertes associated with what they took to be the more frequent use of the verb. Same goes for prefixed versions (// | ||
+ | ===== Annotation ===== | ||
+ | The following properties were annotated for each verb. If a property is only applicable to one sub-base, this is indicated. | ||
+ | * the verb’s regional variant, | ||
+ | * its variants regarding the realization of the phoneme yat (for SC), | ||
+ | * the base of the verb (the chunk preceding the theme vowel), | ||
+ | * the 3rd person singular present tense form, | ||
+ | * the theme vowel, | ||
+ | * frequency in the corpus (tokens per million words), | ||
+ | * availability of imperfective interpretation, | ||
+ | * prefix (the rightmost one), | ||
+ | * second prefix (second rightmost), | ||
+ | * third prefix, | ||
+ | * whether the verb can be intransitive, | ||
+ | * whether the verb can be intransitive with an external argument, | ||
+ | * whether the verb denotes a state, | ||
+ | * whether the verb can take an argument (each type of argument annotated as a separate property): in the accusative case, in the dative case, in the genitive case, in the instrumental case, a clausal argument, a PP argument, an obligatory reflexive accusative, | ||
+ | * whether there are two verbs, one without and another with the reflexive accusative, | ||
+ | * the verb’s aspectual pair, | ||
+ | * whether each of the following morphological operations applies to the verb to get to the aspectual pair (descriptively speaking; the application of each operation to derive the aspectual pair annotated as a separate property): adding a suffix, removing a suffix, adding a prefix, removing a prefix, apophony, theme vowel change, suppletion, | ||
+ | * whether the verb includes each of the following suffixes (availability of each suffix annotated as a separate property, the suffixes in bold only in SC): **ava/ | ||
+ | * whether the verb is simplex (root + theme vowel + inflection), | ||
+ | * whether the verb is derived from a word of another category, | ||
+ | * whether the verb involves root allomorphy and the list of root allomorphs, | ||
+ | * for each of the following positions it was marked whether it bears prosodic prominence, | ||
+ | * the passive participle, | ||
+ | * the -nje nominalization. | ||
material_in_the_databases.1646125701.txt.gz · Last modified: 2022/03/01 10:08 by pm