TODO: check upper level organization, remodel strength as categories (as in DCR), rename UsageAndFrequencyFeature to UsageFeature
Top categories of the OLiA Reference Model
2010/01/19 created
2010/04/08 removed NPFunction (=> SyntacticRole)
2010/04/13 added MorphologicalProcess, MorphologicalFeature, DiscourseFeature, AnimacyFeature,
ReferentTypeFeature, RegisterFeature, UsageAndFrequencyFeature
2010/04/14 validation, PossessiveFeature removed (see olia:hasOwnerNumber), moved olia:NarrativeType and olia:PolarityFeature here
2010/04/15 additions in accordance to the PTB Bracketing Guidelines: NullElement, SentenceTypeFeature (Santorini 1991, Bies et al. 1995)
2010/11/30 added TopologicalField in accordance to the TueDa-D/Z annotation guidelines (Telljohann et al. 2009)
2011/07/29 replace url by purl
2011/07/31 added ProximityFeature
2011/08/03 added SpecificityFeature
2011/08/04 SubordTypeFeature, CoordTypeFeature deprecated, added NumeralAgreementClass
2011/08/11 StrengthFeature recast as MorphologicalFeature rather than MorphosyntacticFeature
2011/08/15 EmphasisFeature added
2011/08/15 PhonologicalProcess added (for Elision and Apocope, formerly both classified as MorphologicalProcess)
2013/06/25 EvidentialityFeature, ClusivityFeature added (from ISOcat), intensity as new label to EmphasisFeature
LexicalRelation for labels for relations holding between lexemes
2013/06/27 AgreementFeature (from ISOcat, as superclass of NominalAgreementClass, Person, Gender, Number; not as a relation between words)
2013/06/28 EvaluativeFeature (for ISOcat PreferredEvaluative and PejorativeEvaluative), ModalityFeature (Modality and Mood distinction revised)
2016/04/18 fixed minor validity warnings
2016/08/29 added MovementFeature as cover term for Rising and Control (for compliance with lexinfo)
2020/02/06 fixed use of owl:deprecated
2020/02/24 description of Constituent
2022-05-23 rdfs:labels added (automatically)
Christian Chiarcos, chiarcos@uni-potsdam.de
introduced to account for DCR space, etc.
http://www.isocat.org/datcat/DC-1425
Property attached to a given inflected form that usually permits to distinguish this form from the generic lemmatised form of the word.
(http://www.isocat.org/datcat/DC-1425)
http://www.isocat.org/datcat/DC-1376 (referent type)
Type of concrete object or concept (the referent) that an expression represents (the reference).
(DFKI; http://www.isocat.org/datcat/DC-1376)
referent type
distinguishes for distal and proximal determiners, e.g., in Macedonian (MULTEXT-East, see http://purl.org/olia/mte/multext-east.owl#CliticDistalDeterminer and http://purl.org/olia/mte/multext-east.owl#CliticProximalDeterminer)
In many Indo-European languages, proximity is a relevant feature of pronominal systems (e.g., Macedonian proximal vs. distal determiners, cf. English this vs. that).
In several indigeneous languages of North America, proximity is represented by verbal agreement (then also known as obviation, e.g., Blackfoot third [proximal] and "fourth" [distal 3rd] person).
As defined here, proximity is considered a morphosyntactic feature, because it applies to morphosyntactic *markers* of proximity.
(Chiarcos)
proximity
distance
obviation
intensity
http://www.isocat.org/datcat/DC-2197
adopted from ILPOSTS, there a property of Adverb, Classifier, Demonstrative, Noun, Particle, Adposition, Quantifier, Verb, identified with http://www.isocat.org/datcat/DC-2197 "intensity"
Applies to grammatical (morphosyntactic) markers of emphasis for languages where emphatic expressions are distinguished from non-emphatic forms.
In Irish Gaelic, for example, the unmarked personal pronouns (e.g., sé, é ‘he, him’) is distinguished from the emphatic pronoun (e.g., seisean, eisean ‘he, him’).
Beyond pronouns, also nouns can be emphatically marked, e.g., by adding a clitic reflexive element to them. (Mulkern 2007).
(Ann E. Mulkern. Knowing who’s important: Relative discourse salience and Irish pronominal forms. In Nancy A. Hedberg and Ron Zacharski, editors, The Grammar-Pragmatics Interface: Essays in honor of Jeanette K. Gundel, pages 113–142. John Benjamins, Amsterdam and Philadelphia, 2007.)
Quality of being felt very strongly or having a strong effect (intensity, http://www.isocat.org/datcat/DC-2197)
emphasis
http://www.isocat.org/datcat/DC-3031
The category that encodes "whether the addressee (addressees) are included in or excluded from the set of referents which also contains the speaker".
clusivity
distinguishes specific and nonspecific determiners, e.g., in Klallam (Salish, see olia:SpecificArticle, olia:NonspecificArticle), or in Persian (http://purl.org/olia/mte/multext-east.owl#CliticSpecificDeterminer)
"By ʻspecificʼ and ʻnon-specificʼ I intend the difference between the
two readings of English indefinites like (3):
(3) Iʼm looking for a deer.
In the specific reading there is a particular deer, say Bambi, that I am
looking for. In the non-specific reading I will be happy to find any deer.
Von Heusinger (2002) likes the test in English of inserting ʻcertainʼ after the
ʻaʼ to fix the specific reading. In either reading of (3) a deer is being
introduced as a new discourse referent. This is opposed to ʻdefiniteʼ which
requires a previous pragmatic instantiation as in ʻIʼm looking for the deer.ʼ
In English both the readings of (3) are indefinite. In Klallam, the specific
demonstratives are neither definite nor indefinite."
(Montler, Timothy. 2007. Klallam demonstratives. Papers ICSNL XLVII. The 42nd International Conference on Salish and Neighbouring Language, pp. 409-425. University of British Columbia Working Papers in Linguistics, Volume 20; on specific vs. nonspecific determiners in Klallam, a Salish language, http://montler.net/papers/KlallamDemons.pdf)
specificity
genericity
http://www.isocat.org/datcat/DC-3185
isoCat ID: http://www.isocat.org/datcat/DC-3185
Evidentiality denotes the basis that the speaker
has for claiming that the event has occurred (or is going to take place).
Appear to show a greater number of distinctions in the realis mood
(especially in the past tense) than the irrealis mood. [Bhat 1999: 63-64,
70]
http://www.isocat.org/datcat/DC-1988
Classification indicating the relative level of language individually assigned to a lexeme or term or to a text type.
(ISO12620; http://www.isocat.org/datcat/DC-1988)
register
http://www.isocat.org/datcat/DC-1965 (frequency), http://www.isocat.org/datcat/DC-1959 (dating)
Frequency: The relative commonness with which a term occurs.
(ISO12620; http://www.isocat.org/datcat/DC-1965)
Dating: Indication specifying whether the usage is old or modern.
(http://www.isocat.org/datcat/DC-1959)
In EAGLES, Strength is a language-specific attribute for pronouns and determiners to characterize weak (reduced) and strong (full) forms (http://www.ilc.cnr.it/EAGLES96/annotate/node19.html#oav2av 20.11.06).
Here, it is extended to full and reduced adjective inflection in Slavic languages (cf. MULTEXT-East). May be renamed to ReductionFeature.
true
Deprecated: reimplemented within SubordinatingConjunction taxonomy
The SubordType is in Eagles an additional language-specific attribute, applying to subordinating conjunctions only. (http://www.ilc.cnr.it/EAGLES96/annotate/node19.html#oav2u 17.11.06)
verb form mood
mood
Modality is a facet of illocutionary point or general intent of a speaker,
or a speaker's degree of commitment to the expressed proposition's believability,
obligatoriness, desirability or reality.
(ISO12620; http://www.isocat.org/datcat/DC-1427)
The term „mood“ is used by some authors in the same sense as „modality“, while
others distinguish the two (...) using „mood“ to refer to the contrastive
grammatical expressions of different modalities, and thus reserving „modality“
to refer to the meanings so expressed. >A grammatical category is related to
a variety of factors affecting the nature of a predication, such as factors include
factivity, certainty (evidentials), attitudes, speaker’s knowledge/beliefs/desires,
agent’s ability/volitionality, etc.; a set of distinctive forms used to express
modality (as verbal inflections or a set of auxiliarys, each signaling a modality).
(http://www.uni-erfurt.de/sprachwissenschaft/proxy.php?port=8080&file=lido/servlet/Lido_Servlet Modus 14.05.07)
http://www.isocat.org/datcat/DC-1427
mood
modality
Modality is a facet of illocutionary point or general intent of a speaker,
or a speaker's degree of commitment to the expressed proposition's believability,
obligatoriness, desirability or reality.
(ISO12620; http://www.isocat.org/datcat/DC-1427)
The term „mood“ is used by some authors in the same sense as „modality“, while
others distinguish the two (...) using „mood“ to refer to the contrastive
grammatical expressions of different modalities, and thus reserving „modality“
to refer to the meanings so expressed. >A grammatical category is related to
a variety of factors affecting the nature of a predication, such as factors include
factivity, certainty (evidentials), attitudes, speaker’s knowledge/beliefs/desires,
agent’s ability/volitionality, etc.; a set of distinctive forms used to express
modality (as verbal inflections or a set of auxiliarys, each signaling a modality).
(http://www.uni-erfurt.de/sprachwissenschaft/proxy.php?port=8080&file=lido/servlet/Lido_Servlet Modus 14.05.07)
http://www.isocat.org/datcat/DC-1427
agreement
PVAGR
http://www.isocat.org/datcat/DC-2188
http://www.isocat.org/datcat/DC-4973
Agreement: Formal relationship whereby a word (or a sub-part of a word) requires a corresponding form of another word (or sub-part of a word) (http://www.isocat.org/datcat/DC-2188)
PVAGR: agreement as shown (stem + affix) in inflected forms of a finite verb (http://www.isocat.org/datcat/DC-4973)
http://purl.org/olia/mte/multext-east.owl#NumeralAgreementClass
In most Slavic languages, Numerals and Quantifiers involve specific agreement patterns, e.g., in Russian:<br/>
(a) SingularQuantifier (MTE v4: Numeral/Class="definite1"): requires noun in nominative singular, e.g., один год "one year"
(b) PaucalQuantifier (MTE v4: Numeral/Class="definite234"): requires noun in genitive singular, e.g., два/три/четыре года "two/three/four years"
(c) PluralQuantifier (MTE v4: Numeral/Class="definite"):requires noun in genitive plural, e.g., пять/много/сколько/столько лет "five/many/how many/that many years"<br/>
Bulgarian has done away with the distinction between 4 and 5, and generalised the 2-4 form to all numerals (and some other quantifiers), but the others generally keep it. Also Slovene has a living dual (both Sorbians likewise, but they haven't been MTEd).<br/>
Some Czech feminine and neuter body parts have preserved dual forms, and if the noun is dual, so are its attributes (adjectives, pronouns). So 2 differs formally from 3-4. The corresponding agreement pattern is a DualQuantifier (MTE v4: Numeral/Class="definite2").
(Ivan A. Derzhanski & Christian Chiarcos, http://purl.org/olia/mte/multext-east.owl#NumeralAgreementClass)
aspect
http://www.isocat.org/datcat/DC-1242
Category associated to verbs and referring to the way the grammar marks the duration or type of temporal activity. (http://www.isocat.org/datcat/DC-1242)
Aspect is a grammatical category associated with verbs that expresses a temporal view of the event or state expressed by the verb. (http://www.sil.org/linguistics/glossaryoflinguisticterms/WhatIsAspect.htm 17.11.06)
The in Eagles optional attribute Aspect is needed for Greek and Slavonic verbs. It corresponds also to the Past Simple/Imperfect distinction of Romance languages. (http://www.ilc.cnr.it/EAGLES96/annotate/node18.html#oav1av 17.11.06)
http://www.isocat.org/datcat/DC-1298
Grammatical category for the variation in form of nouns, pronouns, and any words agreeing
with them, depending on how many persons or things are referred to.
(www.wordreference.com/English/definition.asp?en=number 12; http://www.isocat.org/datcat/DC-1298)
A grammatical number is a morphological category characterized by the expression of quantity
through inflection or agreement. (http://en.wikipedia.org/wiki/Grammatical_number 17.11.06)
generalization over PreferredEvaluative and PejorativeEvaluative in ISOcat
Represents the difference between countable and uncountable nouns, e.g., in English
EAGLES
http://www.isocat.org/datcat/DC-1902, extended to cover http://purl.org/olia/mte/multext-east.owl#Humanness
The characteristic of a word indicating that in a given discourse community, its referent is considered to be alive or to possess a quality of volition or consciousness.
(ISO12620; http://www.isocat.org/datcat/DC-1902)
animacy
Syntactic valency pertains to the number of syntactic arguments a verb requires.
Semantic valency pertains to the number of arguments of a semantic predicate.
If syntactic valency is greater than the semantic valency, an expletive pronoun may be used,
cf. van Valin and LaPolla (1997) on the distinction between syntactic valency and semantic valency.
EAGLES
In generative grammar, syntactic alternations of semantically equivalent utterances are explained by movement operations.
Even though alternative analyses have been proposed, we keep this conventional term to account for syntactic phenomena such as Raising and Control.
Introduced as part of the lexinfo extension as a cover term for Raising and Control properties of (lexical entries of) syntactic verbs.
http://www.isocat.org/datcat/DC-2240 (reduplication type, no values given)
type of reduplication
(http://www.isocat.org/datcat/DC-2240)
reduplication type
The optional Eagles attribute Separability is relevant for German compound verbs ("fängt ...an", "anfangen") and also to phrasal verbs in Danish and English.
(http://www.ilc.cnr.it/EAGLES96/annotate/node18.html#oav1c 15.11.06)
The grammatical person is deictic reference to the participant role of a referent, such as the speaker, the addressee, and others. Grammatical person typically defines a language's set of personal pronouns. It also frequently affects verbs, sometimes nouns, and possessive relationships as well. (http://en.wikipedia.org/wiki/Grammatical_person 17.11.06)
Indication of grammatical person (1st, 2nd, 3rd, etc.) associated with a given inflected form.
(ISO12620; http://www.isocat.org/datcat/DC-1328)
morphosyntactic category
part of speech
http://www.isocat.org/datcat/DC-1345
partOfSpeech: Term used to describe how a particular word is used in a sentence. (http://www.isocat.org/datcat/DC-1345)
"extended" EAGLES
- saubere Definitionen statt form-basierter Zusammenlegung von Kategorien
- VerbalNoun in der Schnittmenge von NonfiniteVerb und CommonNoun
- Classifier zunächst als top-level-konzept
- AuxiliaryVerb aufgespalten in StrictAuxiliarVerb, ModalVerb, Copula
- neues Konzept Quantifier, da jedoch eher semantisch als syntaktisch definiert, mit Numeral zu einem neuen top-level-node zusammengefasst (entsprechend die Definbition bei Bußmann: Quantor enthält ausdrücklich Numerale, diverse Determinierer und Pronomen)
=> sfb:PRONQUANT -> eagles:Quantifier & eagles:PronounOrDeterminer & !eagles:Numeral
sfb:PRONQUANT_AT -> eagles:Quantifier & eagles:IndefiniteDeterminer & !eagles:Numeral
sfb:PRONQUANT_SU -> eagles:Quantifier & eagles:IndefinitePronoun & !eagles:Numeral
=> Numeral als Subkonzept von Quantifier
TODO: saubere definition statt form-baseierten kategorien
http://www.isocat.org/datcat/DC-1415
Terminal nodes of syntactic annotations, termed "Word" here, are the same structural entities that are subject to morphosyntactic (Part of Speech) annotations.
cf. http://www.isocat.org/datcat/DC-1415 "word": Linguistic unit composed of at least a part of speech and a lemma.
SyntacticWord is the class of syntactic units occupying the lowest position in
a syntactic construction. They are the largest units resistant to insertion of
new constituents within their boundaries; or they are the smallest constituents
that can be moved within a sentence without making the sentence ungrammatical.
(http://www.linguistics-ontology.org/gold/2008/SyntacticWord)
Linguistic unit composed of at least a part of speech and a lemma.
(Gil Francopoulo; http://www.isocat.org/datcat/DC-1415)
http://www.isocat.org/datcat/DC-1415, cf. GOLD SyntacticWord
word
In syntactic theory, a constituent is a unit of words that constitute a unit of multiple independent words. Formally, this can be illustrated by substitution (and other tests):
"[The dog] bit Peter.": valid, "the dog" can be replaced by "lions" (etc.).
"*The [dog bit] Peter.": non-valid, there is no valid one-word replacement for "dog bit".
"The dog [bit Peter].": valid, "bit Peter" can be replaced by "slept" (etc.)
Note that OLiA takes a theory-independent stance on "constituent". Constituent thus does not necessarily refer to a syntactic constituent, but to a node in syntax annotation that contains other nodes. This includes elements whose constituency has been debated (verb phrases), but also elements that are (potentially incomplete) phrase candidates (chunks).
(Christian Chiarcos)
Modelled like tokens, Santorini (1991, §4.2), Bies (1995, §2.3, §2.5)
added in accordance with PTB bracketing guidelines ,Santorini (1991, §4.2), Bies (1995, §2.3, §2.5)
verb tense
tense
http://www.isocat.org/datcat/DC-4964
verb tense: property referring to the way the grammar marks (via affixes and/or suppletion) the time at which the action denoted by the verb took place. (http://www.isocat.org/datcat/DC-4964)
Tense is a grammatical category, typically marked on the verb, that deictically refers to the time of the event or state denoted by the verb in relation to some other temporal reference point. (http://www.sil.org/linguistics/glossaryoflinguisticterms/WhatIsTense.htm 17.11.06)
Morphosyntactic and morphological features.
The voice of a verb describes the relationship between the action (or state) that the verb expresses and the participants identified by its arguments (subject, object, etc.). (http://en.wikipedia.org/wiki/Grammatical_voice 17.11.06)
The term gender refers to various forms of expressing biological or sociological gender by inflecting words. Nouns, pronouns, articles and the adjectives denote the gender of their referent. (http://en.wikipedia.org/wiki/Grammatical_gender 17.11.06)
Category based on (depending on languages) the natural distinction between sex and formal criteria.
(http://www.isocat.org/datcat/DC-1297)
InflectionType is in Eagles an optional attribute for adjectives. Weak and Strong are values for adjectival inflection in the Germanic languages German, Dutch and Danish. (http://www.ilc.cnr.it/EAGLES96/annotate/node18.html#oav1av 14.11.06)
true
Deprecated: reimplemented as subhierarchy of CoordinatingConjunction
The CoordType attribute subclassifies coordinating conjunctions. (http://www.ilc.cnr.it/EAGLES96/annotate/node18.html#oav1av 17.11.06)
http://www.isocat.org/datcat/DC-1926
definiteness
In grammatical theory, definiteness is a feature of noun phrases, distinguishing between entities which are specific and identifiable in a given context (definite noun phrases) and entities which are not (indefinite noun phrases).
(http://en.wikipedia.org/wiki/Definiteness 02.05.07)
Property about the possiblity to identify an entity. (http://www.isocat.org/datcat/DC-1926)
An in Eagles additional language-specific attribute for nouns/ noun phrases is Definiteness.
(http://www.ilc.cnr.it/EAGLES96/annotate/node19.html#oav2 16.11.06)
Note that olia_system:LinguisticAnnotation, olia_system:UnitOfAnnotation and olia_system:Feature pertain to the usage of type labels and annotated features according to a particular annotation scheme, whereas, here, category and feature mean linguistic concepts. Thus, an olia:MorphosyntacticCategory (etc.) is not necessarily represented by a olia_system:UnitOfAnnotation.
The OLiA ontology specifies linguistic concepts on a theoretical basis, as for concepts used in annotations, see system.owl.
There is a great extent of overlap between LinguisticCategories/LinguisticFeatures and Categories/Features as defined in system.owl.
degree
http://www.isocat.org/datcat/DC-1419
Property concerning comparison. (http://www.isocat.org/datcat/DC-1419)
The Eagles-recommended attribute Degree applies only to inflectional comparatives and superlatives. In some languages, e.g. Spanish, the number of such adjectives is very small. (http://www.ilc.cnr.it/EAGLES96/annotate/node17.html#recn)
The optional attribute Reflexivity is applied to main verbs in French, German, Dutch, etc., and determines the selection of "avoir" or "être", etc., as auxiliary for the Perfect.
(http://www.ilc.cnr.it/EAGLES96/annotate/node18.html#oav1c 15.11.06)
Case is a grammatical category determined by the syntactic or semantic function of a noun or pronoun. The term case has traditionally been restricted to apply to only those languages which indicate certain functions by the inflection of nouns, pronouns, or noun phrase constituents, such as adjectives and numerals. (http://www.sil.org/linguistics/glossaryoflinguisticterms/WhatIsCase.htm 17.11.06)
In linguistics, a theta role or θ-role is the semantic role a noun
phrase plays in a sentence. The term Thematic role denotes the same
concept. As such it is a semantic rather than a syntactic feature, in
contrast to such notions as the subject of a sentence or a
prepositional object.
(http://en.wikipedia.org/wiki/Subject_(linguistics).
Originally, semantic roles were referred to as "case roles", leading to
an unfortunate name clash with grammatical case (e.g. in GOLD CaseValue)
classification of sentences according to their function, e.g., illocutionary act
Bies et al 1995
Relation to be used when the syntactic function of a constituent is different from its morphosyntactic type,
cf. FormFunctionDiscrepancy in the PTB bracketing guidelines, Bies et al. (1995, §2.2.1)
introduced for relation labels such as homonym. These label relations holding between two lexemes.
Traditionally, "grammatical relations" or "grammatical roles" are specifically those between the verb (clause) and its arguments/adjunct/complementizer.
In modern corpus research, however, a broad variety of relations between nominal heads, their arguments, modifiers, etc. are distinguished and the scope of "Grammatical Relation" has extended here to cover these as well.
(http://purl.org/linguistics/gold/syntacticRole)
Syntactic constructions that involve multiple constituents, or that are independent of the concept of constituent, e.g., word order phenomena, non-canonical sentences , ...
agreement feature
animacy feature
aspect feature
case feature
clusivity feature
constituent
coord type feature
countability feature
definiteness feature
degree feature
dependency relation
discourse entity
discourse feature
dominance relation
emphasis feature
evaluative feature
evidentiality feature
gender feature
inflection type feature
lexical relation
linguistic concept
modality feature
mood feature
morphological category
morphological feature
morphological process
morphosyntactic category
morphosyntactic feature
movement feature
narrative type
null element
number feature
numeral agreement class
orthographic entity
person feature
phonological process
polarity feature
proximity feature
reduplication type feature
referent type feature
reflexivity feature
register feature
semantic feature
semantic role
semantic unit
sentence type feature
separability feature
specificity feature
strength feature
subord type feature
syntactic category
syntactic construction
syntactic feature
syntactic function
syntactic relation
syntactic role
tense feature
topological field
usage and frequency feature
valency feature
voice feature
word