Grammar
Tenses
Present
Present Simple
Present Continuous
Present Perfect
Present Perfect Continuous
Past
Past Continuous
Past Perfect
Past Perfect Continuous
Past Simple
Future
Future Simple
Future Continuous
Future Perfect
Future Perfect Continuous
Passive and Active
Parts Of Speech
Nouns
Countable and uncountable nouns
Verbal nouns
Singular and Plural nouns
Proper nouns
Nouns gender
Nouns definition
Concrete nouns
Abstract nouns
Common nouns
Collective nouns
Definition Of Nouns
Verbs
Stative and dynamic verbs
Finite and nonfinite verbs
To be verbs
Transitive and intransitive verbs
Auxiliary verbs
Modal verbs
Regular and irregular verbs
Action verbs
Adverbs
Relative adverbs
Interrogative adverbs
Adverbs of time
Adverbs of place
Adverbs of reason
Adverbs of quantity
Adverbs of manner
Adverbs of frequency
Adverbs of affirmation
Adjectives
Quantitative adjective
Proper adjective
Possessive adjective
Numeral adjective
Interrogative adjective
Distributive adjective
Descriptive adjective
Demonstrative adjective
Pronouns
Subject pronoun
Relative pronoun
Reflexive pronoun
Reciprocal pronoun
Possessive pronoun
Personal pronoun
Interrogative pronoun
Indefinite pronoun
Emphatic pronoun
Distributive pronoun
Demonstrative pronoun
Pre Position
Preposition by function
Time preposition
Reason preposition
Possession preposition
Place preposition
Phrases preposition
Origin preposition
Measure preposition
Direction preposition
Contrast preposition
Agent preposition
Preposition by construction
Simple preposition
Phrase preposition
Double preposition
Compound preposition
Conjunctions
Subordinating conjunction
Correlative conjunction
Coordinating conjunction
Conjunctive adverbs
Interjections
Express calling interjection
Grammar Rules
Preference
Requests and offers
wishes
Be used to
Some and any
Could have done
Describing people
Giving advices
Possession
Comparative and superlative
Giving Reason
Making Suggestions
Apologizing
Forming questions
Since and for
Directions
Obligation
Adverbials
invitation
Articles
Imaginary condition
Zero conditional
First conditional
Second conditional
Third conditional
Reported speech
Linguistics
Phonetics
Phonology
Semantics
Pragmatics
Linguistics fields
Syntax
Morphology
Semantics
pragmatics
History
Writing
Grammar
Phonetics and Phonology
Reading Comprehension
Elementary
Intermediate
Advanced
Measuring productivity: the significance of neologisms
المؤلف: Andrew Carstairs-McCarthy
المصدر: An Introduction To English Morphology
الجزء والصفحة: 95-8
2024-02-05
863
So far, I have discussed various aspects of the productivity of derivational processes, such as -ity and -ness suffixation, without the help of any objectively quantifiable measurements. I have not introduced any scale from 0 to 1, say, in terms of which -ness might score 0.9 and -ity 0.5. This may look like a serious defect. Any conclusions about formal regularity and generality, in particular, must be more or less subjective unless they are based on figures, one may think. What we need is a comparison between the actual frequency of a process and its potential frequency, appropriately defined. The more closely the ‘actual’ figure approaches the ‘potential’ figure, the more productive the process is, in some sense.
In practice, however, devising such a measure has turned out to be extremely tricky. For example, I suggested earlier that -ity is formally regular when applied to adjectives with certain suffixes such as -ous, -ive and -able, but otherwise irregular, for example, when it is attached to suffixless adjectives. This has the advantage that a non-existent noun such as ‘richity’ can be classed as formally irregular, but the disadvantage that it entails that actual nouns such as purity, sanity, oddity and severity must be irregular too. If we dislike this outcome, we must extend the range of adjectives that count as potential bases for -ity suffixation so as to include at least pure, sane, odd and severe. But how far should this extension go? If the new potential bases are taken to be just those un-suffixed adjectives for which corresponding nouns in -ity exist, then the ratio of actual to potential -ity nouns will remain high – but only because we have contrived that it should be so. If, at the other extreme, we let pure, sane, odd and severe persuade us that any adjective whatever can be a potential base, then the actual-to-potential ratio dwindles to almost zero, and our measure fails to capture the difference in ‘feel’ between ‘gloriosity’ (plausible, but blocked by glory) and *richity (highly implausible). What is the appropriate intermediate position between these extremes, then? It is hard to answer that question except subjectively. Thus the objectivity that a numerical ratio would supply turns out to be frustratingly elusive.
Since about 1990, however, a new set of numerical measures have been devised that avoid subjective bias and yet seem to correspond well to what we feel we mean when we talk about ‘productivity’. These measures exploit the extremely large corpora, or bodies of linguistic material, that have been assembled on computer by linguists and dictionary-makers for the purpose of studying both the frequency with which words (lexemes and word forms) occur and the contexts in which they occur. For a process to be productive, in one sense, it should be a process that can be used to form brand new lexemes, or neologisms. So can we identify neologisms in one of these large corpora? Unfortunately, we cannot do so directly; all we can tell for certain is that a lexeme with an earlier dated occurrence in the corpus is not brand new. However, we can rely on the fact that most neologisms within the corpus will be rare. In fact, all will be rare except those that quickly become fashionable. So, even if we cannot directly identify neologisms, an alternative that is both appropriate and feasible is to identify words that are extremely rare, especially those that appear only once in the whole corpus: so-called hapax legomena (singular hapax legomenon), a Greek expression borrowed from classical studies, meaning ‘said (only) once’. We can now focus on the morphological processes that are used in hapax legomena (and other very rare words), and compare them with processes that are used in more frequently occurring words.
Such studies shed interesting new light on the relationship between the familar pair -ity and -ness. In the Cobuild corpus of about eighteen million English word-tokens (based at Birmingham University and used by the dictionary publisher Collins), the number of word-types exhibiting -ity (roughly 400) is not greatly less than the number exhibiting -ness (roughly 500). However, most of the -ity words are of common occurrence (more technically, their token-frequency is high), while many more of the types exhibiting -ness have low token frequency, including hapax legomena. That is, although by one measure -ness seems to be not much more productive than –ity is, it is far more likely than –ity to be used in the creation of neologisms.
The suffix -ness rates high both in the number of words that contain it (words as types, that is, not tokens), and in its availability for neologisms. The suffix -ity ranks high by the first measure but low by the second. Could an affix rank low by the first measure and high by the second? The answer is yes. The Cobuild corpus contains relatively few word-types with the suffix -ian (as in Canadian, Wagnerian), yet a very high proportion of these are of low token-frequency. Rather surprisingly, therefore, for an affix to be suitable for use in a brand new word, it does not have to appear in a large number of existing words.
There is far more that could be said about the ways in which studies of very large corpora can shed light on word formation in English. To understand it in greater depth presupposes some knowledge of statistical techniques, however. For present purposes, it is enough to be aware that such statistical studies are being carried out, and that they go a considerable way towards firming up the notions of generality and formal regularity which I defined in an unquantified fashion.