Symbol discriminability models
for improved flight displays
Albert J. Ahumadaa, Maite
Trujillo San-Martinb and Jennifer Gillec
a NASA Ames Research Center. MS 262-2, Moffett Field, CA
94035-1000
b National Research Council, NASA Ames Research Center, CA
c University of California, Santa
Cruz, NASA Ames Research Center, CA
Aviation display system
designers and evaluators need to know how discriminable displayed symbols will
be over a wide range of conditions to assess the adequacy and effectiveness of
flight display systems. If
flight display symbols are to be safely recognized by pilots, it is necessary
that they can be easily discriminated from each other. Sometimes psychophysical
measurements can answer this question, but computational modeling may be
required to assess the numerous conditions and even help design the empirical
experiments that may be needed. Here we present an image discrimination model that
includes position compensation. The model takes as input the luminance values
for the pixels of two symbol images, the effective viewing distance, and gives
as output the discriminability in just-noticeable-differences (d’) and the x
and y offset in pixels needed to minimize the discriminability. The model
predictions are shown to be a useful upper bound for human symbol
identification performance.
Keywords: Human vision models, cockpit display, symbology, image
discrimination.
1. INTRODUCTION
Display technology advances afford the capacity
to display traffic information in the cockpit using traffic symbols varying in
shape and color to provide crucial information during flight operations. In this paper we present measurements of the
ability of observers to identify such symbols and present a discriminability
metric for the evaluation of the discriminability of symbol pairs. The goal of the metric is to provide
objective measures of discriminability in support of design and manufacturing
requirements for improved cockpit and air traffic management. The metric is based on our earlier work on
image discrimination models for detection and discrimination1-5 and some previous work
on symbol discrimination.6, 7
Discriminability is only one component of what
Yey and Chandra8
call distinctiveness, the degree to which the symbol can be identified by
itself. Symbol identification involves
many cognitive processes, such as feature learning, feature extraction,
attention, and memory effects. Bruner9 gives a good overview
of these higher-level processes that can affect symbol categorization. Here we are only trying to develop a simple
symbol discriminability metric that only depends mainly on low-level visual
processes. If such a metric predicts
that symbols will be confused, they should indeed be confused, but if it says
that they are discriminable, they may still be classified as the same
symbol. We present a tool that could be
used to measure the discriminability of pairs of stimuli. All pairs in a set of potential symbols
would need to be compared to ensure discriminability, but discriminability
would not ensure accurate categorization.
For example, in the color domain, it is well known that many colors are
discriminable from each other, but that relatively few color categories can be
accurately recognized by naïve observers. 10, 11
Our goal here is to extend an image discrimination
model so that it can serve as a symbol discriminability metric. Image
discrimination models take two images as input and predict their
discriminability in just-noticeable-differences (JNDs) for a particular viewing
condition. The luminance-only (achromatic), single-channel versions of these
models have been successful at predicting the detectability of simple and
complex targets.1-5 Watson
has also extended them to predict letter identification.6, 7 Two modifications were needed. One
modification in these applications is the inclusion of image translation. A letter can be recognized independent of
its position in the image. The other
modification is the addition of a response prediction mechanism. Watson added noise in the image domain and
the response was selected by finding the best fitting response template. Although a response model may be necessary
to simulate a pilot performance in an overall system evaluation, this part of
the model is computationally intensive and not necessary if the goal is to only
to ensure that all the symbols are discriminable from each other. The metric we propose is an image
discrimination model that takes two symbols and computes the smallest JND
between them over a range of symbol positions and symbol sizes. Because some of the symbols are rotated to
indicate heading, the model should, in general, take rotation into account
also. Here we have only compared the
zero-heading-angle version of the symbols.
2. EXPERIMENTAL MEASUREMENT OF
SYMBOL IDENTIFICATION
Two sets of symbol
identification data will be presented. The first set was collected for the FAA
by Zuschlag,12 the second set is a partial replication of that
study. The methods for the first study
will be described first, followed by a description of the methodological
differences in the replication.
2.1 Experimental Methods
2.1.1 Stimuli
The 19 symbol images
appear in Figure 2.1.1. Their colors
are described in Table 2.1.1.
19
18
5
6
7
8
9
14
13
1
2
3
4
.
15
16
17
10
11
12
Figure 2.1.1:
The Volpe experiment symbols.
|
Color |
Symbol # |
R |
G |
B |
|
red |
9 |
254 |
0 |
0 |
|
pink |
14, 19 |
252 |
204 |
155 |
|
cyan |
1, 2, 5, 6 |
0 |
254 |
252 |
|
green |
10, 11, 15, 16 |
0 |
255 |
0 |
|
yellow |
3, 4, 7, 8, 12, 13, 17, 18 |
255 |
255 |
0 |
|
black |
Background |
0 |
0 |
0 |
Table 2.1.1: Symbol set
(color, stimuli index, RGB values for the color at its peak level)
Nine of the symbols
(1-4, 10-14) are directional. The
directional symbols were presented in one of six orientations on the flight
display, but the orientation on the response display was always vertical.
2.2.2 Observers
Ten pilot observers were
recruited at an airport. All had normal
color vision and adequate visual acuity.
2.2.3 Procedure
On each trial, the
observer was presented with one of the nineteen symbols in isolation on an Avidyne
flight situation display (5RR-MFC-Series). The displayed image had a size of 4
in by 3 in, a resolution of 320 by 234 pixels, and thus a display resolution of
80 pixels per in. Four different
viewing distances were used: 22, 44, 66, and 88 inches. At the 22 inch viewing distance the screen
thus displayed 30.7 pixels per degree.
The image was presented for 0.25 sec, preceded by a 1.5 sec fixation
cross (horizontal and vertical white (255, 255, 255) 400 by 2 pixel lines) and
followed by a crossed-hatch pattern.
The observer used a mouse to respond by clicking on one of the 19
symbols presented continuously on a separate laptop display i.e.: Panasonic
Toughbook (CF-37). The symbols were displayed on the laptop in the arrangement
of Figure 2.1.1. Each observer responded to six replications of each stimulus,
(6 x 19 x 4 = 456 trials). The oriented
symbols were presented on the flight display in one of six orientations (45,
90, 135, 180, 225, or 315 degrees) selected at random, but the orientation on the
response display was always vertical. Table 2.2.3 shows the sequence of the 4
distance conditions for each of the 10 observers.
Distance\ Observer .
(in) 1
2 3 4 5 6
7 8 9 10
22 1
1 3 4 4 1
2 3 3 4
44 2
2 4 1 3 4
1 2 1 2
66 3
3 1 2 2 3
4 1 4 1
88 4
4 2 3 1 2
3 4 2 3
Table 2.2.3: Sequence of
distance conditions for each observer.
2.2.4 Replication method
variations
One observer responded
to 10 repetitions of each symbol on 7 different days at a distance of 110
inches, which gave the same pixel resolution in pixels per degree as the 88
inch distance in the original experiment. The primary display was a Sony
Trinitron (GDM-FW900, RGB line pitch = 0.23 mm). The display controller resolution was set to 1024 by 768 pixels
and the pixel size was set to 64 pixels per inch in both directions. The observer’s mouse controlled a cursor on
another Sony Trinitron (CPD-200SX, Dot pitch = 0.25 mm). The symbol exposure duration was 0.5 sec,
instead of 0.25 sec. Symbols were
chosen randomly with replacement from the 190 stimuli presented on each day.
2.3 Results
Table 2.3.1 shows the
number of valid trials per observer for each of the distances in the original
experiment.
Distance\ Observer .
..(in) 1 2 3
4 5 6 7 8
9 10
22 114 114 114 114 114 114 113 114 114 114
...44 114 114 114 114 113 114 114 114 114 106
...66 114 114 114 114 114 114 114 114 113 114
...88 114 114 112 114 113 114 114 114 114 112
Table 2.3.1: Number of valid trials per observer for each
distance.
Table 2.3.2 shows the
number of errors made by each observer in each of the distance conditions. The
last line shows the number of color errors at the 88 in distance.
Distance\ Observer .
(in) 1 2
3 4 5 6 7
8 9 10
22 4
1 6 11 7 1
3 0 0 3
44 4
3 3 21 10 2
8 0 0 2
66 19
11 28 31 58 45
4 25 6 27
88 42
33 31 56 86 70
46 47 33 51
Col88 8 1 0
6 36 8 7 4
2 6
Table 2.3.2: Number of errors made by each observer at
each distance
In this data set, only
the 88 in distance consistently resulted in errors for all observers. The 66
inch distance errors correlated strongly with the temporal position of that
condition in the experiment (n=10, Spearman’s r = -0.56). Observer 5 was the only one to have the 88
in distance condition first and made a much higher number of symbol and color
confusions in that condition than any other observer. We now restrict our attention to the confusion matrix for the 88
in distance and the best 9 observers (Table 2.3.3). Rows represent symbols presented; columns represent responses
made.
Red| Pink| Cyan |
Green | Yellow |
9 14 19 1 2
5 6 10 11 15 16 4 8
12 17 18 7 13 3
9
59 0
0 0 0 0 0
0 0 0 0 0 0 0
0 0 0 0 0
14 0 39 9 2
0 3 0 1 0
0 0 0 0 0
0 0 2 1 0
19 0 31 19 0 0
4 1 0 0 0
0 0 0 0 0 1 0
0 0
1 0
0 0 22 13 1
0 6 3 1 2
0 0 0 0 0
0 0 0
2 0
0 0 5 35 0 5
2 8 0 1 0 0 0
0 0 0 0 0
5 0
0 0 2 0 31 10 0
0 4 6 0 0
0 0 0 0 0 0
6 0
0 0 0 8 2 30
1 0 0 4 0
0 0 0 0 0
0 0
10 0 0 0
0 1 0 0 7 39
0 0 0 0 0
0 0 0 0 0
11 0 0 0
0 0 0 0 6 41
1 0 0 0 0
0 0 0 0 0
15 0 0 0
0 0 0 1 1 3
13 39 0 0 0 0
0 0 0 0
16 0 0 0
0 0 0 0 2
3 6 35 0 0 1 1 0
0 0 0
4 0
0 0 0 0 0
0 0 0 0 0 54
2 0 0 0 0
0 0
8 0
0 0 0 0 0
0 0 0 0 0 1
46 0
0 0 0 0 0
12 0 0 0
0 0 0 0 1
0 0 0 0 0 38
0 1 0 2 11
17 0 0 0
0 0 0 0 0
0 0 0 0 0 10 29
1 6 11 3
18 0 0 0
0 0 0 0 0
0 0 0 0 0
1 1 14 5 33 4
7 0
0 0 0 0 0
0 0 0 0 0 0 0
0 2 2 28 13 12
13 0 1 0
0 0 0 0 0
0 0 1 0 0
4 2 2 12 34 13
3 0
0 0 0 0 0
0 0 0 0 0 0 0
4 0 0 1 4 39
Table 2.3.3: Confusion
matrix for the 88 in distance and the best 9 observers. Each row has, for a symbol presented, the
total number of times each of the 19 possible responses was given.
The information transmitted10, 11, 13
in this confusion matrix is 2.2 bits per symbol, the information transmission
rate for 7 errorless symbols (log2 7). The information in a discrete probability or frequency
distribution with probabilities pi, i = 1, n, is given by Eq. 2.3.1.
H(p) = – Si log2 pi
(2.3.1)
Associated with a
confusion table are three distributions, the proportion of times each symbol
was presented, the proportion of times each response was made, and the
proportion of times each symbol presentation and response pair occurred. If we call the information in these
distributions H(s), H(r), and H(s,r), then the information transmitted in the
confusion matrix is given by Eq. 2.3.2.
T(s,
r) = H(s) + H( r) – H(s, r) (2.3.2)
The first row and column
show that the red square (9) was not confused with any other symbol. The large yellow circle (8) was confused 3
times with the same circle with the orientation line (4), but only rarely and
the two were never confused with any other symbol. This 2 by 2 confusion matrix can be converted to a d’ value (Eq.
2.3.3).
d’ = z(Pr(8|8)) – z(Pr(8|4) =
z(46/47) – z(2/56) = 2.03 – (–1.80) = 3.83 (2.3.3)
Although this 2 by 2
matrix is roughly symmetric, many of the confusions were not. The only two pink
symbols, the plane (14) and the cross (19) were usually confused with each
other, but both were more frequently reported to be the plane symbol. The same tendency was shown when they were
changed to yellow and surrounded by a circle (13 and 18). Another consistent response bias is that
when an open figure and its filled version are confused, the response tends to
be ‘filled’. This is illustrated by the
confusions between the diamonds (5 and 6), the triangles (1 and 2), the
outlined diamonds (15 and 16), and the outlined triangles (10 and 11). The confusions that suggest the need for
size scaling are those in which a shape was given the response of its outlined
version. Even though the color of the
outlined versions is green rather than cyan, the triangles and diamonds alone
were sometimes responded to as their outlined version.
Table 2.3.4 shows the
confusion matrix for the second data set.
For this matrix the information transmitted per symbol is 3.7 bits,
corresponding to 13 errorless symbols.
The data for the second set are also organized by color, and in this
data set there were no confusions between different colors.
Red| Pink | Cyan |
Green | Yellow |
9 14 19 1
2 5 6 10 11 15 16 4 8 12 17 18
7 13 3
9
70 0
0 0 0 0 0
0 0 0 0 0 0 0
0 0 0 0 0
14 0 69 1 0
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
19 0 18 52 0 0
0 0 0 0 0
0 0 0 0 0
0 0 0 0
1 0
0 0 64 6 0 0
0 0 0 0 0 0 0
0 0 0 0 0
2 0
0 0 1 69 0 0
0 0 0 0 0 0 0
0 0 0 0 0
5 0 0 0
5 0 57 8 0 0
0 0 0 0 0 0 0
0 0 0
6 0
0 0 0 11 0 59 0
0 0 0 0 0
0 0 0 0 0 0
10 0 0
0 0 0 0 0 52 18
0 0 0 0 0
0 0 0 0 0
11 0 0 0
0 0 0 0 13 57 0
0 0 0 0 0
0 0 0 0
15 0 0 0
0 0 0 0 0 0
45 25 0 0 0 0
0 0 0 0
16 0 0 0
0 0 0 0 0
0 9 61 0 0 0
0 0 0 0 0
4 0
0 0 0 0 0
0 0 0 0 0 69
1 0 0 0 0
0 0
8 0
0 0 0 0 0
0 0 0 0 0 0
70 0
0 0 0 0 0
12 0 0 0
0 0 0 0 0
0 0 0 0 0 70
0 0 0 0 0
17 0 0 0
0 0 0 0 0
0 0 0 0 0 0
70 0
0 0 0
18 0 0 0
0 0 0 0 0
0 0 0 0 0
0 1 45 19 3 2
7 0
0 0 0 0 0
0 0 0 0 0 0 0
0 0 1 66 0 3
13 0 0 0
0 0 0 0 0
0 0 0 0 0
0 0 1 8 19 42
3 0
0 0 0 0 0
0 0 0 0 0 0 0
0 0 0 0 2 68
Table 2.3.4: Replication
Data Confusion matrix
3. DISCRIMINATION MODELS
Here we describe in detail the computations performed by the
modeling program and the parameter values.
The model is available at the first author’s NASA website14.
The model program takes as input a pair of images. Both images are converted from color to luminance. Next they are converted to visible contrast
images, taking into account the contrast sensitivity of the visual system and
contrast-gain masking. The translation
that minimizes the Euclidean distance between the visible contrast images is
then found, and that minimum distance is scaled into a number of JNDs (d’) by
the assumption that 1 JND is 10-6 deg2 sec of contrast
energy (0 dBB). 3, 5
3.1 Color to luminance
conversion
The conversion from color RGB values (0-255) to luminance
(cd/m2) was done using the simple color gamma function shown in Eq.
3.1.1.
L = L0 + LR
(R/255)g
+ LG (G/255) g + LB (R/255) g) (3.1.1)
Table 3.1.1 gives the values of these parameters for the two
primary displays.
Display L0 LR LG LB g
Avidyne
7.1 26.7 85.9 14.0 2.71
Sony 2.8 15.6 54.0 6.8 2.53
Table 3.1.1: Gamma function parameters
for the two primary displays.
After conversion to luminance, the images were
extended from their original sizes of 25 by 25 or 32 by 32 pixels to 128 by 128
pixels by imbedding them in a 128 by 128 matrix of L0 values. This
was done to minimize wraparound effects from the FFT-based filtering. Next the
images were pixel-replicated by a factor of 4 so that the position optimization
would occur at quarter-pixel resolution.
3.2 Visible contrast
calculation
The luminance image was first blurred by a
Gaussian with a standard deviation of 1.414/60 deg. Next a background image B was computed by mixing the image L with
a proportion pL = 0.8 of the background,
B = pL
L + (1–pL) L0 (3.2.1)
Next the background image is blurred by a
Gaussian with a standard deviation of 16 times 1.414/60 deg. Then a contrast image C (Eq. 3.2.2) is
formed by point-by-point division of these two images.
C
= L / B – 1
(3.2.2)
A contrast-energy image E is then formed by
point-by-point squaring of C, as shown in Eq. 3.2.3.
E = C2
(3.2.3)
This image is then blurred with the same
Gaussian used to blur B. And finally the
blurred contrast image E is used to adjust the contrast by the contrast gain
function applied point-by-point, giving the visible contrast image V (Eq.
3.2.4).
V = C
/ (1 + k E) 0.5 (3.2.4)
The value of k was set to 400 corresponding to a
masking RMS threshold contrast of 5%.
3.3 JND calculation
The minimum distance between two images can be found quickly
by finding the maximum of the cross correlation between the two images in the
Fourier domain. The minimum distance between two visible contrast images is
then converted to JNDs in d’ units as follows (Eq. 3.3.1).
d’
= S |V1 – V2|
(3.3.1)
Where the contrast sensitivity S = (t / 10-6 deg2
sec) 0.5. The variable t is
the duration of the symbols, 0.25 sec in
the original experiment and 0.5 sec in the replication, giving values of S of
500 and 707, respectively.
3.4 Model predictions
compared with results
Since the current model does not include color differences,
the observer data to be predicted will be limited to the confusions among pairs
of the same color. There are only a few same-color pairs in which the responses
to the pair are only limited to the pair, allowing a d’ value to be computed
using the formula above. For all the pairs, it is possible to compute the
information transmitted from that pair of symbols to all responses. Table 3.4.1
shows the information transmitted for all the same color pairs for the original
experiment. Table 3.4.2 shows the same table for the replication.
pink
19
14 0.10
cyan
2 5 6
1 0.25 0.74 0.65
2 0.78 0.43
5 0.45
green
11 15 16
10 0.22 0.81 0.76
11 0.76 0.72
15 0.05
yellow
8 12 17
18 7 13
3
4 0.81
1.00 1.00 1.00
1.00 1.00 1.00
8 1.00
1.00 1.00 1.00
1.00 1.00
12 0.51
0.68 0.69 0.52
0.38
17 0.41 0.39
0.30 0.58
18 0.27
0.13 0.57
7 0.14 0.43
13 0.35
Table 3.4.1: Information
transmitted in bits per symbol for all same-color pairs in the original
experiment.
pink
19
14 0.49
cyan
2 5 6
1 0.73 0.82 0.89
2 0.97 0.67
5 0.75
green
11 15 16
10 0.24 1.00 1.00
11 1.00 1.00
15 0.22
yellow
8 12 17
18 7 13
3
4 0.95
1.00 1.00 1.00
1.00 1.00 1.00
8 1.00
1.00 1.00 1.00
1.00 1.00
12 1.00
1.00 1.00 1.00
1.00
17 0.95 1.00
1.00 1.00
18 0.45 0.61