Presented
to the UCI/OSA Color/Vision Meeting, October 2001, but still work in progress
Classification
Weights and Internal Noise Level Estimation
Albert
J. Ahumada, Jr.
Abstract
For
the case of linear discrimination of
two stimuli in white Gaussian noise in the presence of internal noise, a
method is described for estimating linear classification weights from the sum
of noise images segregated by stimulus and response. The recommended method for
combining the two response image for the same stimulus is to difference the
average images. Weights are derived for combining images over stimuli and
observers. Methods for estimating the level of internal noise are described,
especially for the case of repeated presentations of the same noise sample. Simple tests for particular hypotheses about
the weights are shown based on observer agreement with a noiseless version of
the hypothesis.
List
of symbols in order of appearance
m the number of image components
S s s = 0 to 1, 1 by m signal vector
p s
s = 0 to 1, probability of signal S s
N 1 by m noise vector with components N i
, i = 1, m
T 1 by m trial stimulus vector with
components T i , i = 1, m
E[.] averaging or expectation operator
Var[.]
variance computing operator
s2 variance of N i
L 1 by m classification vector
b bias of linear classifier
R r r = 0 to 1, the event that the observer
made response r
||.|| vector length, ||L|| = (LL T)
½
Pr{} probability of enclosed event
psr
probability of response R r
given signal S s
F Z(.) cumulative standard normal distribution
function
d
0' sensitivity of linear
classifier
b 0
shifted bias of linear
classifier, b – L S 0 T
Z(.) functional inverse of the cumulative
standard normal distribution function, F Z -1(.)
L I
classification vector L of the
ideal observer
d
I' sensitivity of the ideal
observer
r the sampling efficiency of L, L L I T
b 0, H random shifted bias of the
human observer model
g2
variance of b H
a2
proportion of external noise in the
classification variable, 1/ (1+ g2 )
d H'
sensitivity of the human observer
model
b H
performance bias of the human
observer model
f Z(.) standard normal distribution density
function
A SR the averages of the noises N for the
trials segregated by signal S S and detection response R R
N SR a random noise N conditional on signal S S
and detection response R R
v SR
the expected value of N SR
when m = 1.
U an orthonormal m by m
transformation
I the m by m identity transformation
z i
a standard normal variable
M r r = 0 to 1, the event that the error free
model made response r
pMS0 the probability of event M 0 given that the signal was S s
b Ms the
signal dependent error free model criterion,
b 0 if S 0
, or b
0 - d 0' if S 1
Linear
classification of two signals in additive white Gaussian noise
The
signals and noise
S 0
and S 1 are 1 by m signal vectors, presented with probabilities
p 0 and p 1 = (1 - p 0) for
n trials. On each trial a random noise sample vector N is added to the
signal, so the trial stimulus
T =
S 0 + N
or
T =
S 1 + N.
N
is a 1 by m vector of independent samples of identically distributed Gaussian
variables N i with
E[N i]
= 0
and
Var[N i]
= E[(N i - E[N i])2] = s2,
where
E[.] is the averaging or expectation operator and Var[.] computes the
variance. Without loss of generality we
can assume that the noise has been normalized by its standard deviation so that
s2
= s = 1.
The
linear observer model
The
linear observer classifying the noisy signals as R 0 or R 1
would use a vector L
and
respond R 1 if and only if
L T
T > b,
where
b is a
response criterion and T indicates the matrix transpose operator, so
that
m
L T T = å L i T i
.
i
Also
without lack of generality we will assume that L has unit length (L and b have already been divided
by the length of L) so that
||L||
= (LL T) ½ = 1.
The
performance of an observer is characterized by the error rates
p01
= Pr{R 1 | S 0},
the
probability of signal S 0 being followed by response R 1
and
p 10
= Pr{R 0 | S 1},
the
probability of signal S 1 being followed by response R 0.
For
the linear classifier with vector L and criterion b,
L N
T
is
Gaussian with mean zero and unit variance.
Hence
p 01
= Pr{L (S 0 + N)
T > b } = 1- F Z( b – L S 0 T )
and
p 10
= Pr{L (S 1 + N) T < b } = F Z( b – L S 1
T ),
where
F Z(.) is the cumulative standard Gaussian distribution function.
If
we define sensitivity and bias parameters
d
0' = L (S 1 - S 0) T
and
b 0
= b – L S 0 T,
then
the error rates are
p 01
= 1 - F Z (b 0)
and
p 10
= F Z (b 0 –
d 0').
These
parameters can be found from the error rates as
b 0
= Z(1 - p 01)
and
d
0' = b 0
- Z(p 10),
where
Z(p)
=F Z -1(p)
is the functional inverse of the cumulative
standard normal distribution function F Z(.).
The
ideal observer
The
ideal observer classifying the noisy signals as R 0 or R 1
would use the linear classifier
L I
= (S 1 - S 0) / ||S 1 - S 0||.
For
the ideal observer,
d
I' = L I (S 1 - S 0) T = ||S 1 - S 0||
.
The efficiency of a non-ideal linear classifier is
given by
(d
0'/d I') 2 = (L L I T )
2 = r 2,
the
square of the correlation between the actual and the ideal classifier
coefficients, sometimes called the sampling efficiency.
A
noisy human observer model
Human
observers classify the same images different ways on different
presentations. This is modeled here by
assuming that the observer's criterion b 0, H (corresponding to b 0) is a normally distributed
random variable with
E[b 0, H] = b 0
and
Var[b 0, H] = g2
independent
of the noise N. It does not matter
whether the variability is added to the criterion or the classification
function value. Since the noiseless
criterion b 0 was defined as the criterion for a variable
with unit variance, the parameter 1+ g2 can be interpreted as the
total variance of the classification variable and
a2
= 1/ (1+ g2
)
as
the proportion of variance in the classification variable that is from the
external noise N. The error
probabilities are now
p 01 = Pr{L N T > b 0, H }
= Pr{(L N T - (b 0, H - b 0)) / (1+ g2) 1/2 > a b 0 }
= 1- F Z(a b 0)
and
p 10
= F Z(a (b 0 -
d 0')).
If
we define the observer's sensitivity and biases as
d H'
= a d
0'
and
b H
= a b
0,
then
we can compute these parameters from the human model observer error rates as
b H
= Z(1 - p 01)
and
d H'
= b
H - Z(p 10).
The
efficiency of the human observer model is
(d H'/d I')
2 = a2
r
2.
Since
r
2 £ 1, a lower bound for alpha is given by
a ³ d H'/d I',
and
an upper bound for g2 is given by
g2
£ (d I'/d H')
2 - 1.
These
bounds are reached when L is L I and the inefficiency is only
the result of the internal or criterion noise.
The
classification images
The
classification image components are the four average noises A SR,
the averages of the noises N for the trials segregated by signal S S and
detection response R R . We would like to find the mean and the
variance of the pixels of A SR as a function of the parameters
(S 1, S 0, L, b 0, and g or a).
The
single pixel case
In
the single pixel (m=1) case we are trying to find the mean of a single Gaussian
variable N that has been truncated by a random criterion b H. Let N SR be the truncated variable when s was the
stimulus and r was the response and
v SR = E[N SR].
Then
since ||L|| = 1, L = 1 so
L N
T < b 0, H
if
and only if
N
< b
0, H.
So
in the case that s = r = 0,
v 00
= E[N 00] = E[N | N < b H]
and
for the other cases
v 01
= E[N 01] = E[N | N > b H]
v 10
= E[N 10] = E[N | N < b H - d 0']
v 11
= E[N 11] = E[N | N > b H - d 0'] .
Single
pixel, no noise
Consider
now the single pixel case when there is no noise in the criterion (b 0, H = b 0).
b 0
E[N 00]
= E[N | N < b 0]
= ( ò z f Z(z) dz )/ F
Z(b 0)
= - f Z(b 0)/ F Z(b 0),
-¥
where
the integration of z exp(-z 2 /2) is enabled by the variable
substitution
t =
-z 2/2.
If b 0 is zero,
E[N 00]
= -(2/p) 1/2 = -0.7979,
the
expected value of the absolute value of a standard Gaussian.
Similarly,
¥
E[N 01]
= E[N | N > b 0]
= ( ò z f Z(z) dz ) / (1-F Z (b 0)) = f Z (b 0)/ (1-F Z (b 0)).
b 0
Single
pixel, noisy criterion
The
Gaussian criterion case can be reduced to the fixed criterion case by a change
of variables. Let z 0
be the standard Gaussian used to form the criterion b H, so that
b 0, H = g z 0 + b 0 .
Then
v 01
= E[N 01] = E[N | N > b] = E[ N | N < g z 0 + b 0].
If
we let
z 1
= a (N – g z 0)
and
z2
= a (g N + z0)
The
new variables z 1 and z 2 are independent (E[z 1
z 2] = 0), standard (E[z 1] = E[z 2]
= 0, Var[z 1] = Var[z 2] = 1) Gaussian
variables, such that
N =
a (z 1
+ g z 2)
and
N
< g z 0
+ b
0
if
and only if
z1
< ab
0.
So
v 00
= E[N 00] = E[N | N < g z 0 + b 0] = E[a (z 1 + g z 2) | z 1
< a b
0]
= a E[ z 1 | z 1 < ab 0]
= - a f Z (a b 0) / F Z (a b 0)
= - a f Z (b H) / F Z (b H)
= - a f Z (Z(p 00))/p 00 .
Expressed
in terms of p 00 the
effect of the criterion noise on the mean v 00 is just to
reduce v 00 by the
factor a.
Similarly,
v 01
= E[N 01] = E[N | N > b 0, H] = E[a (z 1 + g z 2) | z 1
> a b
0]
= a E[ z 1 | z 1 > a b 0]
= a f Z (a b 0) / (1 - F Z (a b 0))
= a f Z (b H) / (1 - F Z (b H))
= a f Z (Z(p 00)) / (1-p 00)
= a f Z(Z(p 01)) / p 01,
since
p 01 = 1 - p 00 and
Z(p) = -Z(1 - p).
If
false alarms are less frequent than correct rejections (p 01 <
p 00 ), then
|v 01|
> |v 00|,
a
larger absolute expected value on a false alarm than a correct rejection
trial.
The
signal case is the same with the criterion changed to b 0, H – d 0' so that
v 10 = E[N 10] = E[N
| N < b 0, H – d 0'] = -a f Z (Z(p 10))/p 10,
and
v 11
= E[N 11] = E[N | N > b 0, H – d 0'] = a f Z (Z(p 11))/p 11
.
Again,
if misses are less frequent than hits (p 10 < p 11
), then
|v 10|
> |v 11|,
a
larger absolute expected value on a miss than a hit trial. Regardless of the signal, the expected value
only depends on the response proportion and the criterion variability.
The
multiple pixel case
Another
independent variable transformation allows the single pixel case result to
solve the multiple pixel case. Let us first examine the means and variances of
the pixels of N 00. For
any vector L of unit length, it is possible to construct an orthonormal
transformation U whose first row is L, that is
U =
(L T L 1 T … L m-1 T)
T ,
such
that
U
T U = I,
where
I is the identity transformation (the transpose of U is its inverse).
When
this transformation is applied to N T we get a new noise U N T
whose distribution is the same as that of N T, but whose first pixel
is L N T. On an S0
trial, a noise vector N will be classified as N 00 if and only
if the first pixel of U N T,
z 1
= L N T < b 0, H.
The
rest of the pixels (z 2, ..., z m) of U N
T are independent standard Gaussian variables (with mean zero and
variance 1).
E[N 00
T] = E[ N T | L N T < b 0, H]
=
E[U T U N T | L N
T < b 0, H]
=
E[U T (z 1, z 2, ... , z m
)T | z 1 < b 0, H]
= U
T E[(z 1, z 2, ... , z m
)T | z 1 < b 0, H]
= U
T (v 00, 0, ... , 0 )T
= v 00 L
T.
Using
similar argument for the other cases, leads to the general result that
E[N SR
] = v SR L .
The
mean of a classified noise is proportional to the classifying vector L. The variance of individual elements of N SR
,
Var[N SR, i
] = L i2 Var[z 1 |
S S "S R "]+ (1-L i2)
,
which
is bounded by Var[z 1 | S S "S R
"] and one. Truncation of a
Gaussian can only decrease the variance, so
Var[N SR, i
] < 1.
Since
||L|| = 1, if there are very many significant weights in L they will have to be
small, so that
Var[N SR, i
] ~ 1.
Let
A SR be the average value of a number n SR of N SR. Any combination of the form
L est
= w 01 A 01 – w00 A 00
+ w 11 A 11 – w 10 A 10,
with
positive weights w SR will be an estimate of L times a positive
constant. To maximize the signal to
noise ratio of an estimate of L from a
sample with n SR approximately independent samples of each
type, the individual estimates N SR /v SR
should be weighted inversely by the square of their variances, which are
approximately 1/v SR2, so we should weight each N SR by v SR .
A
good un-normalized estimate of the classifier L is thus given by the v SR
weighted sums n SR A SR .
L G
= v 01 n 01 A 01 - v00
n00 A 00 + v 11 n 11
A 11 - v 10 n 10 A 10
.
If
we replace p SR in v SR by p SR
= n SR / n S, taking advantage of the fact that
f
Z (Z(p))= f Z (Z(1-p)),
we
obtain
L G
= a ( n 0
f Z (Z( p 01)) (A 01 - A 00 ) + n 1 f Z (Z(
p 10)) (A 11 - A 10) ).
The
more frequent stimulus should be given more weight and the more error prone
stimulus should be given more weight. If both stimuli are equally frequent and
the error rates are equal, the formula is proportional to
L ave
= A 01 – A 00
+ A 11 – A 10
,
the
combination rule originally used by Ahumada and Beard (). Another way of expressing the good weighting
scheme for combining average classification images A SR, O over
responses, stimuli, and observers, in that order that to combine over
responses, just take the difference,
L G,
S, O = A S1, O - A S0, O.
When
combining images for different stimuli, they need to be weighted by factors
involving the relative frequencies of the stimuli and the extremeness of the
error proportions for the stimuli.
L G,
O = w 1, O L G, 1, O + w 0, O
L G, 0, O.
where
w S,
O = n S, O exp( - Z( p SS, O)2/2)
.
If
estimates are to be combined over M observers, they need to be weighted by the
observer's proportion of variance due to the external noise variance, a2 = 1/(1+ g2), and the number of trials
run by the observer (which is included here in the n S, O).
L G
= a
12 L G, 1 + a 22 L G, 2 +
... + a M 2
L G, M .
Measuring
the internal noise
Response
agreement with the same external noise sample
Ahumada and Beard (1998 ARVO) presented a method for estimating the amount of internal noise relative to the external noise (alpha or gamma) using the agreement between observer responses on pairs of trials where the same external noise and the same stimulus were presented. They did not specify a linear observer model, but they effectively assume that the decision component based on the noise corresponding to our
z 1 = L N T
is a standard Gaussian and the rest of their assumptions correspond to our model with criterion variability.
The subscripts i and j are added to indicate two separate trials. The agreement probability for a particular signal and response condition with the same noise is denoted by
p ASR = Pr{R R, i , R R, j | S S, i , S S, j , N i = N j }.
To obtain this probability we will compute it conditional on the value of z 1 and then average over the possible values of z 1.
Conditional on the value of z 1, for S = 0, the probability of a response R = 0 is given by
Pr{R 0, i | S 0, i , z 1 } = Pr{ z 1 < b 0, H | z 1}
= Pr{ z 1 < g z 0 + b 0 | z 1}
= Pr{ (z 1 - b 0) / g < z 0 | z 1}
= F Z ((b 0 - z 1)/ g).
For another response to the same signal and the same noise the criterion variability would be independent and the probability of two responses of R = 0 would be given by
Pr{R 0, i , R 0, j | S 0, i , z 1 } = Pr{ z 1 < b 0, H | z 1}2
= F Z((b 0 - z 1)/ g) 2.
= F Z((b H / a - z 1) / g) 2.
= F Z((b H (1 + g2 ) 0.5 - z 1) / g) 2.
The probability of two correct R 0, i responses to the same noise is then
+¥
pA00 = ò F Z((b H (1 + g2 ) 0.5 - z 1) / g) 2 f Z(z 1) dz 1,
-¥
and
+¥
pA01 = ò (1- F Z((b H (1 + g2 )) 0.5 - z 1) / g) 2 f Z(z 1) dz 1.
-¥
The equations for arbitrary S can be written
+¥
pAS0 = ò F Z((Z(p S0)(1 + g2 ) 0.5 - z 1)
/ g) 2 f Z (z 1)
dz 1,
-¥
and
+¥
pAS1 = ò (1 - F Z((Z(p S0)(1 + g2 ) 0.5 - z 1) / g)) 2 f Z (z 1) dz 1.
-¥
Ahumada and Beard (1998) used a least squares search method to solve for g as a function of the probability of agreement for both responses ( pAS0 + pAS1 ) and p S0, but it is sufficient to solve for the probability of agreement for either one of them.
Estimating observer noise using a model classifier
Ahumada and Beard (1998) also derived an estimate for g based on the assumption that z 1 comes from a known model. This allows falsification of the model if the estimates of g are not consistent.
One estimate is based on the ratio of the performance of the model, d 0', to that of the observer, d H' . From above we have
d
H' = a d 0' ,
so
a = d H' / d 0' .
and
g = ((d 0' / d H')) 2
- 1) 0.5.
Another
estimate comes from the trial by trial agreement between the observer and the
model.
For a given value of z1 , the event M 0 that the model will respond that the signal was S 0 when it actually was S S will occur if z 1 is less than the criterion for the model,
b MS = Z(pMS0),
which is b 0 if s = 0 or b 0 - d 0' if s = 1. The probability that the observer and the model agree on this response will be
b MS
Pr{R 0, M 0} = ò F Z((b H (1 + g2 ) 0.5 - z 1) / g) 2 f Z (z 1) dz 1.
-¥
Note that the other possibilities can be computed from this and simpler ones, from the relationships
Pr{R 0, M 0}+ Pr{R 0, M 1} = Pr{R 0} ,
Pr{R 0, M 0}+ Pr{R 1, M 0} = Pr{M 0} ,
and
Pr{R 1, M 1}+ Pr{R 0, M 1} = Pr{M 1} .
Examples of Model Testing based on Noise Estimates
Now we show two tests of the hypothesis that the observer is a noisy version of a particular model. The particular model we will test is a linear classifier, whose classification image is similar to that of the observers. For the first test we compare predictions of a 2 from observer self agreement with predictions of a 2 from detection performance, where the proportion of observer variance from external noise a 2 is then given by the square of the ratio of the observer d H' to the model d 0',
a 2 = (d H' / d 0' ) 2 .

Error bars for the self-agreement estimates were computed by generating 95% confidence limits for the proportion of self-agreement and then computing the corresponding noise proportion. These proportions were based on small samples compared to those used to compute the d's. The results for the observers with the highest d's, PW and DF indicate that if they were noisy versions of the model, they should agree more with themselves considering how well they do, so they must be using some other measure. Observer BLB performance and agreement are consistent with her following the model.
Observer-Model Agreement Example
For a given level of internal noise, the formulas above predict the probability of agreement of the noisy model (observer) and the noise free model. The same model as above was hypothesized and the level of the internal noise was based on the ratio of the human observer and the model's d's. An unbiased criterion was used for the model's responses,
b 0 = 0.5 d 0'.
The figure plots the ratio of the amount the observed agreement exceeds the amount of agreement expected for independent proportions to the amount the predicted agreement exceeds the independence prediction.
(

That is, the observer-model agreement proportions are converted to the fraction of the distance from the independence prediction to the model prediction. The error bars are 95% confidence limits. The intervals for PW and DF are bounded away from 1, so they are not consistent with the model by this measure either, while BLB remains consistent.
References
C.K. Abbey, M.P. Eckstein, F.O. Bochud (1999) Estimation of human-observer templates for 2 alternative forced choice tasks, Proc. SPIE (E.A. Krupinski, Ed.) 3663:284-295.
C.K. Abbey, M.P. Eckstein (2000) Estimates of human-observer templates for simple detection tasks in correlated noise, Proc. SPIE (E.A. Krupinski, Ed.) 3981, 70-77.
C.K. Abbey, M.P. Eckstein (2001a) Maximum-likelihood and maximum a-posteriori estimates of human-observer templates, Proc. SPIE (E.A. Krupinski and D.P. Chakraborty, Eds.) 4325.
C.K. Abbey, M.P. Eckstein (2001b) Theory for estimating human-observer templates in two-alternative forced-choice experiments, Proc. 17th International Conference on Inferential Processes in Medical Imaging (M.F. Insana and R. Leahy, Eds.) Springer-Verlag, Berlin.
A.J. Ahumada, Jr. (1996) Perceptual classification images from vernier acuity masked by noise, Perception 26 (ECVP Suppl.), 18 (abstract). HTML
A.J. Ahumada, Jr., B.L. Beard (1998) Response classification images in vernier acuity, Investigative Ophthalmology and Visual Science 39 (4, ARVO Suppl.) S1109 (abstract). HTML
A.J. Ahumada, Jr., B.L. Beard (1999) Response Correlation Functions from Multiple Sensor Models, Psychonomics Society Annual Meeting, Los Angeles, CA. HTML
Albert J. Ahumada Jr., Bettina L. Beard (1999) Classification Images for Detection, Investigative Ophthalmology and Visual Science 40 (4, ARVO Suppl.), S572 (abstract). HTML
A.J. Ahumada, Jr., J. Lovell (1971) Stimulus features in signal detection, Journal of the Acoustical Society of America 49, 1751-1756.
A.J. Ahumada, Jr., R. Marken, A. Sandusky (1975) Time and frequency analyses of auditory signal detection, Journal of the Acoustical Society of America 57, 385-390.
E. Barth, B.L. Beard, A.J. Ahumada, Jr. (1999) Nonlinear features in vernier acuity, in Human Vision and Electronic Imaging III, ed. B.E. Rogowitz and T.N. Pappas, Proc. Vol. 3644, pp.88-96, SPIE, San Jose, CA. HTML
B.L. Beard, A.J. Ahumada, Jr. (1997) Relevant image features for vernier acuity, Perception 26, 38 (abstract).
B.L. Beard, A.J. Ahumada, Jr. (1998) Technique to extract relevant image features for visual tasks, in Human Vision and Electronic Imaging III, ed. B.E. Rogowitz and T.N. Pappas, Proc. Vol., 3299, pp.79-85, SPIE, San Jose, CA. HTML
B.L. Beard, A.J. Ahumada (2000) Response Classification Images for Parafoveal Vernier Acuity, Investigative Ophthalmology and Visual Science 41 (4, ARVO Suppl.), S804 (abstract). HTML
D.C. Edwards, M.A. Kupinski, R.M. Nishikawa, C.E. Metz (2000) Estimation of linear observer templates in the presence of multi-peaked gaussian noise through 2AFC experiments, Proc. SPIE (E.A. Krupinski, Ed.) 2981, 86-96.
J. Gold, P.J. Bennett, and A.B. Sekuler (1999) Identification of band-pass filtered letters and faces by human and ideal observers, Vision Research 39(21), 3537-3560.
J.M. Gold, R.F. Murray, P.J. Bennett, and A.B. Sekuler (2000) Deriving behavioral receptive fields for visually completed contours, Current Biology 10(11), 663-666.
J.M. Gold, R.F. Murray, A.B. Sekuler, P.J. Bennett (2000) Snapshots of visual completion, Investigative Ophthalmology and Visual Science 41(4), S317.
K. Knoblauch, J.P. Thomas, M. D'Zmura (1999) Feedback temporal frequency and stimulus classification, Investigative Ophthalmology and Visual Science 40(4), 4171.
K. Knoblauch, R. Yssaad-Fesselier (2000) Stimulus features used during learning of an unknown visual signal, Investigative Ophthalmology and Visual Science 41(4), S46.
D.G. Pelli, B. Farell (1999) Why use noise? Journal of the Optical Society of America A 16(3), 647-653.
D.L. Ringach (1998) Tuning of orientation detectors in human vision, Vision Research 38(7), 963-972.
J.A.
Solomon (2000) A picture of orientation discrimination, Investigative
Ophthalmology and Visual Science 41(4 ARVO Suppl.), 4241 B188 S1 (abstract).
J.A.
Solomon (2000) Channel selection with non-white-noise masks, Journal of the
Optical Society of America A, 17(6), 986-993.
J.A.
Solomon, M.J. Morgan (1999) Reverse correlation reveals psychophysical
receptive fields, Investigative Ophthalmology and Visual Science 40(4 ARVO
Suppl.), S572 (abstract).
A.B. Watson, R. Rosenholtz (1997) A Rorschach test for visual classification strategies, Investigative Ophthalmology and Visual Science 38(4 ARVO Suppl.), S1 (abstract).