Previous: The Normalized Gaussian Category Model Up: From Color Space to Color Names Next: Fitting the Model to the Data
Before we can apply the category model described in the previous section to Berlin and Kay's color naming data, we need to quantify the stimulus set they used for their experiments. As described in Section , they used a set of 329 Munsell color chips consisting of 40 equally spaced hues at 8 equally spaced brightness (Value) levels each, all at maximum saturation (Chroma), and a gray scale consisting of nine equally spaced brightness (Value) steps. They asked subjects to point out both the extent and the foci of the basic color categories of their native language on the array of color chips, viewed under a light source approximating the CIE standard source A (Figure ).
In the following sections I will always use the CIE XYZ space, the CIE L*a*b* space, and the NPP color space for comparative purposes. The XYZ space is in a sense ``the mother of all RGB spaces'', since the various RGB spaces are simple linear transforms of it. It is generally accepted as an approximation to the spectral sensitivities of the human cone photoreceptors, and thus a ``primary'' representation, as close to the sensor as we can hope to get. The L*a*b* space is defined by the CIE to be perceptually equidistant across (most of) the color gamut, and is often used as a reference in color work. It is a non-linear transform of the XYZ space. It also performs very well for our purpose, as shown below. The NPP space is of course the one that we derived from neurophysiological measurements in Chapter , and is also a non-linear transform of the XYZ space. We use this space to attempt to link the category model to the underlying neurophysiology.
The conversion from Munsell coordinates, in which the stimulus set is defined, to CIE XYZ coordinates, which is the basis for the color spaces we are interested in, is non-trivial, and there is no simple mathematical conversion possible. Fortunately, the Munsell set of standard color reference chips, from which the Berlin and Kay set is chosen, has been measured spectrophotometrically and converted to CIE xyY coordinates in the past [Newhall et al. 1943]. After conversion from CIE xyY, we obtain unnormalized CIE XYZ coordinates for each of the stimuli contained in the Berlin and Kay set. To normalize the coordinates to the unit cube, with the gray axis going from to , I used Von Kries adaptation:
where is the vector representing the unnormalized stimulus values, and is the vector representing the unnormalized white reference stimulus values. Although Berlin and Kay's gray axis only runs from Munsell Value 1 to 9, I used the coordinates of Munsell Value 10 as white reference, since that is the maximum Munsell Value defined, i.e. the ``whitest white'' available. Although Von Kries adaptation cannot theoretically be shown to exactly undo all the effects of a non-flat spectrum light source, it works well enough in practice to be allowable, especially with a light source that is as close to a flat spectrum as the CIE C source used in these measurements [Wyszecki \& Stiles 1982]. The obtained stimulus set is shown in Figure , represented in CIE XYZ, CIE L*a*b*, and NPP coordinates.
Some interesting things to note about this figure are:
In particular, note the irregularities in the lower blue region. I have added Munsell Values 0 and 10 to the gray axis, for a total of 11 stimuli.
Combining the information from Figure with the derived color space coordinates of the stimuli, we can now describe the boundary of a Basic Color category as a polygon passing through the coordinates of each of the boundary stimuli, and the focus of a Basic Color category as the center of mass of the points indicated as focal points. Figure shows the boundaries and foci obtained in this way.
Note that the shape and size of the polygons is different in different color spaces, and that in general a straight line on the Berlin and Kay chart does not necessarily translate into a straight line in the color space, since the stimuli are lying on or near a curved surface.