Previous: Embodiment and Symbol Grounding Up: Related Work Next: Neurophysiology of Color Vision
The notion of basic color terms originates in the anthropological and linguistic work of Brent Berlin and Paul Kay in the late sixties [Berlin \& Kay 1969]. They studied color naming behavior with native speakers of a variety of languages, and the existing literature on the subject, and recorded two main findings:
Basic color terms are defined as having the following characteristics:
A set of 329 color chips mounted on a piece of cardboard was used as stimulus material. The color chips were selected using the Munsell color system (Section ), and represented 40 equally spaced hues 8 degrees of brightness, plus 9 neutral hues ranging from white through grey to black (Figure ).
A constant light source was used to illuminate the chips. After the basic color terms of the language in question had been elicited, subjects were instructed to indicate the focal point (best example) of each basic color category and its outer boundary in the set of color chips.
Berlin and Kay found that the foci of basic color terms are similar across totally unrelated languages, and that they cluster into discrete, contiguous areas in the color space. The boundaries between color categories, on the other hand, were found to be variable across languages and even for repeated trials with the same informants. They speculate that unless the effect is due to the experimental procedure,
it is possible that the brain's primary storage procedure for the physical reference of color categories is concerned with points (or very small volumes) of the color solid rather than extended volumes. Secondary processes, of lower salience and inter-subjective homogeneity, would then account for the extensions of reference to points in the color solid not equivalent to (or included in) the focus. (p. 13)
This suggestion is taken up almost 20 years later by [Lakoff 1987], who considers color to be an example of a ``generative cognitive category'', one that consists of a focus (or foci) combined with a ``complex cognitive mechanism'' that generates other members of the category from the focus in a consistent but not a priori predictable manner.
Berlin and Kay also found that although different languages encode different numbers of basic color categories in their vocabularies, a total universal inventory of exactly eleven basic color categories exists, from which the eleven or fewer basic color terms of any given language are always drawn. The English terms corresponding to the eleven categories are
white, black, red, green, yellow, blue, brown, purple, pink, orange, grey.In addition, they found that if a language encodes fewer than eleven basic color categories, then there exist strict limitations on which categories it will encode. The distributional restrictions across languages can be summarized as a sequence of ``evolutionary stages'':
Berlin and Kay interpret this sequence as reflecting both distributional facts and a chronological order of lexical encoding of basic color terms in each language. This chronological order is in turn interpreted as a sequence of evolutionary stages. More recent work has revised the evolutionary sequence somewhat, e.g. [Kay et al. 1991]. Although Berlin and Kay speculate that there is a correlation between the general cultural and technological complexity of societies and the complexity of the color vocabulary (Figure ), they also admit that
... the particular order in which color foci universally became encoded in individual lexicons ... is a difficult problem which is only vaguely understood at this time. (p. 17)
One may wonder why English, of all languages, turns out to have the most (11) basic color categories, and thus sits at the top of the complexity hierarchy with respect to color vocabulary. Other than reflecting the ``general cultural and technological complexity of society'', there might be an unintended bias in the experiment or the interpretation of the data. For example, it might be difficult to recognize a basic color term in another language for which there is no corresponding basic color term in English. Nothing in the data seems to indicate that the 11 terms of English is a theoretical maximum. Indeed, Cairo [Cairo 1977] predicts some additional potential basic color terms which have not been lexicalized in English (yet?), e.g. something akin to ``khaki''.
Berlin and Kay seem to be concerned with issues of symbol grounding and embodiment too, although they don't use those terms of course (their work predates the symbol grounding and embodiment work considerably):
The study of the biological foundations of the most peculiarly and exclusively human set of behavioral abilities - language - is just beginning ..., but sufficient evidence has already accumulated to show that such connections must exist for the linguistic realms of syntax and phonology. The findings reported here concerning the universality and evolution of basic color lexicon sic suggest that such connections are also to be found in the realm of semantics. (p. 110)The research presented in this dissertation can be seen as a formal investigation into such connections.
Berlin and Kay's findings have been corroborated in numerous other studies since (see the bibliography in [Berlin \& Kay 1969]). Boynton, for example, reports that other research has shown that basic color terms are listed first and used more reliably, with greater consensus and shorter response times than any other color terms [Boynton 1990][p. 240]. They also translate easily between languages and are commonly learned by the age of five, at which time very few non-basic color terms are used. None of these special attributes apply to non-basic color terms. Although Berlin and Kay discount the possibility that the boundary effect is due to their experimental procedure, Kay and McDaniel [Kay \& McDaniel 1978] seem to be less convinced, as I will discuss below. Cairo [Cairo 1977] also has some doubts about their experimental procedure.