LETTER

Communicated by Laurence T. Maloney

A Model for Perceptual Averaging and Stochastic Bistable Behavior and the Role of Voluntary Control Ansgar R. Koene [email protected] Department of Psychology, University College London, London, WC1H 0AP, U.K.

We combine population coding, winner-take-all competition, and differentiated inhibitory feedback to model the process by which information from different, continuously variable signals is integrated for perceptual awareness. We focus on “slant rivalry,” where binocular disparity is in conflict with monocular perspective in specifying surface slant. Using a robust single parameter set, our model successfully replicates three key experimental results: (1) transition from signal averaging to bistability with increasing signal conflict, (2) change in perceptual reversal rates as a function of signal conflict, and (3) a shift in the distribution of percept durations through voluntary control exertion. Voluntary control is implemented through the use of a single top-down bias input. The transition from signal averaging to bistability arises as a natural consequence of combining population coding and wide receptive fields, common to higher cortical areas. The model architecture does not contain any assumption that would limit it to this particular example of stimulus rivalry. An emergent physiological interpretation is that differentiated inhibitory feedback may play an important role for increasing percept stability without reducing sensitivity to large stimulus changes, which for bistable conditions leads to increased alternation rate as a function of signal conflict. 1 Introduction Many perceptual aspects of our environment present themselves to the observer through multiple sensory channels. The slant in depth of a surface, for instance, is provided through binocular disparity, but also through perspective cues like foreshortening. To construct a coherent internal representation of a stimulus, the brain must somehow integrate the different sensory signals. One way to investigate this integration process is to subject the visual system to conflicting signals from different sensory channels. This can instigate bistability in perception; an example is binocular rivalry, which results when the signals from the two eyes provide incompatible information, leading to a breakdown of binocular signal integration. Quite a few models Neural Computation 18, 3069–3096 (2006)

 C 2006 Massachusetts Institute of Technology

3070

A. Koene

have been put forward to explain bistability in perception on the basis of competition between either the sensory signals or the alternative percepts (Vickers, 1972; Sugie, 1982; Matsuoka, 1984; Kawamoto & Anderson, 1985; Lehky, 1988; Blake, 1989; Mueller & Blake, 1989; Mueller, 1990; Ditzinger & Haken, 1989; Lehky & Blake, 1991; Lumer, 1998; Dayan, 1998; Kalarickal & Marshall, 2000; Laing & Chow, 2002; Merk & Schnakenberg, 2002; Stollenwerk & Bode, 2003; Wilson, 2003; Zhou, Gao, White, & Yao, 2004). How sensory cues are successfully combined in the case of stable perception, however, has not been addressed in these models. The combination of sensory signals into a unified stable percept has generally been studied as a separate issue (e.g., binocular slant perception, as opposed to binocular slant rivalry), resulting in models that do not consider the properties of ambiguous perception. Key information for our understanding of perception that has therefore been missing is an understanding of the transition from signal combination to signal rivalry. Perceived surface slant is a measure that is particularly suitable to provide such information because it consists of both a regime with slant integration and a regime with slant rivalry. Both regimes have recently been experimentally investigated by van Ee and colleagues. To study signal integration, they measured how much depth is perceived when subjects view a slanted plane in which binocular disparity and monocular perspective provide different slant information for slant about a vertical axis (van Ee, van Dam, & Erkelens, 2002) about a horizontal axis (van Dam & van Ee, 2005) and for real planes slanted in depth (van Ee, Krumina, Pont, & van der Ven, 2005). Figure 1 illustrates the stimulus and the percepts. Using a metrical experimental paradigm, it was found that for small cue conflict, perceived slant was a weighted average of the perspective and disparity-specified slants. When the cue conflict was large, however, observers experienced bistable slant rivalry. Slant-rivalry appeared to have dynamics and stochastic bistable properties that are similar to other rivalry stimuli (van Ee, 2005). In a subsequent fMRI study, Brouwer, Tong, Schwarzbach, and van Ee (2004) revealed both systematic increases in activity in intraparietal sulcus and lateral occipital complex, as well as increasing alternation rates at higher incongruencies. Eye movements, including microsaccades, were shown to be not essential for the perceptual alternation process, suggesting that slant rivalry is a central process (van Dam & van Ee, 2005). Extant models of perceptual bistability commonly assume a bottom-up binary process in which the percept results from a competition between two discrete alternatives. The most common architecture is depicted in Figure 2. In the slant rivalry experiments, however, the percept can assume any of a complete range of possible slants. This sensory signal conflict-dependent transition from averaging to bistability found in slant rivalry has not been explicitly addressed by traditional competition-based bistability models. In addition, this transition in combination with voluntary control has not been taken into account.

Perceptual Averaging and Bistability Model

3071

Right side far

Left eye

Right eye Left side far

Figure 1: The slant rivalry stimulus. In the anaglyph, stereogram monocular perspective and binocular disparity specify conflicting surface slants about the vertical axis. When the left eye views the green (shown in light gray) image and the right eye views the red (shown in dark gray) image, two competing percepts can be experienced. In the perspective-dominated percept, the grid recedes in depth with its right side farther away (it is perceived as a slanted rectangle). In the disparity-dominated percept, the left side of the grid is farther away (it is perceived as a trapezoid with the near edge shorter than the far edge). Each of the two percepts can be selected and maintained at will in a relatively controlled fashion. (More demonstrations of slant rivalry can be found online at www.phys.uu.nl/∼vanee/.)

Voluntary control exertion by the subject affects the dynamics of perceptual alternations in a variety of perceptual situations (e.g., Lack, 1978; Goryo, Robinson, & Wilson, 1984; Tsal, 1984; Schulman, 1992; Gomez, Argandona, Solier, Angulo, & Vazquez, 1995; Hol, Koene, & van Ee, 2003; Toppino, 2003; Meng & Tong, 2004; Chong, Tadin, & Blake, 2005). Van Ee, van Dam, & Brouwer (2005) examined to what degree the perceptual reversal frequency in slant rivalry is under voluntary control. They found that slant rivalry is systematically influenced by voluntary control, which makes it interesting for this study. In their work, they examined four voluntary control exertion tasks: natural, hold perspective, hold disparity, and speed-up. For the natural task, the subject passively viewed the stimulus and indicated (through key presses) which side of the slanted plane he or she perceived to be closer. For the hold perspective and hold disparity tasks, subjects were instructed to attempt to perceive the right or left side closer (corresponding to the perspective-specified slant) or to perceive the other

3072

A. Koene

Competing input signals

Output

Excite Inhibit Figure 2: Classical models for bistable perception. The neural network architecture of classical models for both binocular rivalry and perceptual rivalry is essentially bottom-up using reciprocal inhibition. The competing interpretations constitute the input signals. Random signal noise generates slight signal strength differences over time, even for identical input signals. The signal that is slightly stronger suppresses the weaker signal through reciprocal inhibition. Some form of internal dynamics, such as temporal integration with gain control, is used to make the activity of the neurons at time T depend on their activity at T − δt, producing the inherent bias toward maintaining the previous percept. The strength of this bias decays in time to produce the experimentally found percept duration distributions.

side closer (disparity-specified slant) for as long as possible. For the speedup task, the subject was instructed to alternate the two percepts as rapidly as possible. For each task, the probability density distribution histograms of percept durations showed a skewed, asymmetric (gamma) distribution (Brascamp, van Ee, Pestman, & van den Berg, 2005), similar to the distributions found for binocular rivalry (e.g., Levelt, 1966). The effect of voluntary control resulted in a shift in both the peak of the distribution and the mean percept duration. In this article, we propose a neural network that uses a combination of population coding (for the averaging) and winner-take-all competition (for the bistability). The effect of voluntary control is incorporated in the model as a top-down process that primes the neurons corresponding to the instructed shift in attention such that they have an elevated baseline response. Using a single parameter set, this network successfully replicates the three key results of the slant rivalry studies: (1) transition from cue averaging to bistability as a function of stimulus incongruence, (2) increased alternation rates as a function of increasing cue conflict, and (3) a clear shift in the distribution of percept durations as a result of voluntary control exertion.

Perceptual Averaging and Bistability Model

3073

Top-down bia s

Perspective signal Input Disparity signal Noise

Winner-take-all decision network

Layer 1

Output

Layer 3

Figure 3: General structure of our network model for slant rivalry. The disparity and perspective input signals are combined, on the one hand, with top-down bias signals from the control exertion instruction that was given to the subjects, and, on the other hand, with an inhibitory feedback signal from the output of the network. The feedback path biases the output toward the current percept. The internal network noise, which may in fact have its origin at different stages, is also added at the input stage. Based on these inputs, a winner-take-all network selects the current slant, resulting in a perceived slant that is being forwarded for subsequent processing. The layers refer to Figure 4.

2 Method (Model Design) Our model uses population coding that is similar to the coding found in the visual cortex (Hubel & Wiesel, 1959, 1979; van Essen, Anderson, & Fellman, 1992) with relatively broadly tuned receptive fields to generate percept averaging for small cue conflicts and winner-take-all competition (Marr & Poggio, 1977; McClelland & Rumelhart, 1981) to generate bistability for large cue conflicts. A weak top-down bias input is used to replicate the effect of attention. 2.1 General Architecture of the Model. The basic structure of our model is portrayed in Figure 3. The forward path combines the input signals (e.g., the perspective and disparity defined slants), resolves this information into a single output, and sends this output to the higher visual processing areas. The feedback path feeds the output (which determines the current percept) back to the input of the decision network, biasing it toward the current percept. The internal network noise, which may in fact have its true origin at different stages in the perceptual system, is also added at the input stage since this is where the noise influences the behavior of the network. The noise input is ultimately the cause of the stochastically alternating percepts in bistability. The top-down bias input provides the voluntary control that enables the subject to bias perception. When present, this bias input

3074

A. Koene

makes the network more sensitive to a specific subset of bottom-up input signals. The integration of different stimulus cues related to the same perceptual property (such as slant in depth) must, by the nature of its input signals, occur in higher cortical networks. Accordingly, the receptive fields of the input layer in our model were chosen to be relatively broadly tuned. One of the properties of broadly tuned receptive fields, which are exploited by our model, is that they naturally provide a means of averaging between similar input signals. The winner-take-all process, which selects among the potential percepts and generates the bistability behavior for large-cue conflicts, could result from lateral inhibitory connections that occur in many cortical networks (Lund, Angelucci, & Bressloff, 2003; Budd & Kisvarday, 2001; Crook, Kisvarday, & Eysel, 1998). For clarity, the details of the forward and feedback signal paths are discussed separately. The assumption of broadly tuned receptive fields in our network is not unreasonable considering the increasing receptive field size that has been reported in the visual cortex for some other perceptual properties (van Essen et al., 1992). 2.2 Model Forward Path. The forward path is shown in Figure 4. The neurons in layers 1, 2, and 3 form place-coded maps in which each neuron responds preferentially to a specific percept (e.g., slant). The averaging/bistability behavior of the model results from a combination of broad gaussian receptive fields in layer 1 and an iterative winnertake-all process that selects the location of peak activity in layer 1. The proposed population activation in layer 1 is similar to the response distribution found in direction-selective cells in visual brain area MT when stimulated with transparent moving random dot stimuli (Treue, Hol, & Rauber, 2000). It should be noted, however, that this does not mean that layer 1 corresponds to MT. The receptive field size increase from one cortical layer to the next (van Essen et al., 1992; Wilson, 2003) and winner-take-all type network behavior, resulting from inhibitory lateral connections, are both common features in cortical networks (Moldakarimov, Rollenhagen, Olson, & Chow, 2005). Using gaussian receptive fields, the level of activation of each layer 1 neuron depends on the degree to which the slant coded by it matches the slants coded by the input signals, that is, the use of receptive fields provides a natural implementation of gain factors that depend on the similarity between the input signal and the value coded by a particular layer 1 neuron. Figures 5A to 5C illustrate how differences between the (disparity and perspective defined) inputs affect the resulting bottom-up activation of layer 1 (black solid line). Figure 5D shows the effect of adding signal noise to the inputs used in Figure 5C. The resulting activation of the neurons in layer 1 goes through a winner-take-all competition process (Marr & Poggio, 1977; McClelland & Rumelhart, 1981). Layer 2 functions as a gate between layers 1 and 3. As long as there is more than one active neuron in layer 1, the strong inhibitory connections

Perceptual Averaging and Bistability Model

3075

Winner take all process

P Perspective & disparity signal Inputs

D

Output

Inhibit Excite

Layer 1

Layer 2

Layer 3

Figure 4: The feedforward path of the slant rivalry model. The input units provide the perspective-related (P) and disparity-related (D) signals. Each node in layers 1, 2, and 3 specifies one particular slant. In each layer, the nodes at the same position specify the same slant. For clarity, the figure focuses on the connections, which are active when the slant coded by the second node from the top is being processed. The connectivity paths for the other slants have the same structure. The slant is being selected by the winner-take-all process in layer 1. Layer 2 functions as a gate to stop layer 1 signals from reaching layer 3 before the winner-take-all process has focused the activity in layer 1 down to a single node. Layer 3 signals the output of the slant selection network to areas that determine the perceived slant. The recursive self-excitation in layer 3 maintains the layer 3 output during the time that layer 2 is stopping other inputs from reaching layer 3, that is, during the periods in which layer 1 is in the process of determining the winner.

keep all layer 2 neurons inhibited. Once the winner-take-all competition has silenced all but one layer 1 neuron, the excitatory connection, from this layer 1 neuron to layer 2, results in activation of its layer 2 counterpart. The activated layer 2 neuron subsequently inhibits all layer 3 neurons except the neuron coding the corresponding percept, which it excites. The layer 3 neurons are the output neurons of the network and also the source of the internal feedback signal back to layer 1. The recursive selfexcitation of the layer 3 neurons serves to maintain network output during the period that the layer 2 gate blocks bottom-up inputs from reaching layer 3. 2.3 Model Feedback Path. Inhibitory feedback connections from layer 3 to layer 1 (see Figure 6) bias the network toward maintaining the current

3076

A. Koene 1

1

Relative activation level [%]

A

Perspective Disparity Resulting

0.5

0.5

0

-50

0

1

50

0

-50

0

50

0

50

1

C

D

0.5

0

B

0.5

-50

0

0 50 -50 Slant coded by neuron [deg]

Figure 5: Activation of layer 1 neurons of the slant rivalry model. The dotted and dashed lines indicate the activation of the perspective and disparity inputs, respectively, for the coded slant angles. The solid line indicates the resulting activation. (A–C) The effect of increasing cue conflict on the resulting activation in the absence of internal noise. The activation shifts from having a single peak centered between the perspective and disparity specified slants (i.e., averaging) to having two peaks of equal height located close to the perspectiveand disparity-specified slants (i.e., bistability). The gaussian activation profiles reflect the width of the receptive fields of the layer 1 neurons, determining the minimum level of cue conflict where bistability arises. For intermediate cue conflicts, the resulting activation of layer 1 neurons has a single peak with an ill-defined location. For this case, there is slant averaging with a large intertrial standard deviation in perceived slant. (D) Internal signal noise produces one peak that is slightly higher than the other, causing the percept to shift.

percept. Increasing the activation of the neurons coding the currently perceived percept, through excitatory feedback or by modeling the neurons as leaky integrators (Matsuoka, 1984; Wilson, 2003; Laing & Chow, 2002; Mueller & Blake, 1989; Lehky, 1988), results in a uniform decrease in the relative probability of going to any of the other possible percepts. The use of distributed inhibitory feedback has the advantage that the strength of the inhibitory connection can be a function of the similarity between the values (e.g., depth slant) coded by the layer 1 and layer 3 neurons. This allows the network to suppress some percept changes more than others, which is not possible when using excitatory feedback. It should be noted that this difference between using excitatory versus inhibitory feedback is apparent only

Perceptual Averaging and Bistability Model

3077

P Perspective & disparity signal Inputs

Output

D Inhibit Excite Layer 1

Layer 3

Figure 6: Feedback path of the slant rivalry model. The inhibitory feedback projects the current percept, as coded by layer 3, back to layer 1. Inhibitory feedback is one of the main features of our slant rivalry model. The use of inhibitory feedback has the advantage that it allows strong inhibition of layer 1 neurons that code a similar slant as the active layer 3 neuron, with only weakly inhibiting layer 1 neurons that code a very different slant. Decreasing the strength of the inhibitory feedback as a function of increasing difference between the slant coded by the origin (layer 3 neuron) and the destination (layer 1 neuron) reduces noise-induced percept modifications while maintaining sensitivity to large stimulus changes. Layer 2 is omitted since it is not involved in the feedback.

if there are more than two stable states that the network can be in (as is the case in the depth slant perception but not in binocular rivalry). If applied to a network involving only two competing units (e.g., Matsuoka, 1984, and Mueller, 1990), the inhibitory feedback proposed here would result in the same network dynamics as the use of excitatory feedback. The dashed line in Figure 7 illustrates the relative strength of the inhibitory feedback signals to layer 1 when the activity in layer 3 corresponds to the percept indicated by the black arrows. The strong inhibition of the direct neighbors of the current percept, and the decreasing inhibition strength with distance from the current percept, serves to reduce noise-induced percept changes while maintaining sensitivity to large stimulus differences. Figures 7A to 7C illustrate the effect of the inhibitory feedback on layer 1 activation for different cue conflicts (same as in Figure 5). The dotted lines indicate the excitation by the bottom-up inputs. The solid line indicates the resulting activation (i.e., forward minus feedback activation). Figure 7D illustrates the effect of signal noise for the same cue conflict as in Figure 7C. When the feedforward activation generates a single peak (see Figures 7A and 7B), the inhibitory feedback sharpens this peak, greatly reducing the

3078

A. Koene

1

1

Relative activation level [%]

A

Feedforward Feedback Resulting

0.5

0.5

0

-50

0

1

50

0

-50

0

50

0

50

1

C

D

0.5

0

B

0.5

-50

0

0 50 -50 Slant coded by neuron [deg]

Figure 7: Effect of inhibitory feedback on the activation of layer 1 neurons of the slant rivalry model. The dashed and dotted lines indicate the inhibition and excitation by the feedback and forward paths, respectively, for the coded slant angles. The solid line indicates the resulting activation (i.e., forward minus feedback activation). The profile of the feedback inhibition reflects both the strong inhibition of the direct neighbors of the current percept (indicated by black arrows) and the decreasing inhibition strength with distance from the current percept. (A–C) The effect of the inhibitory feedback for the different cue conflicts (see Figure 5). (D) The activation for the same cue conflict as in C but now with the addition of internal signal noise. When the disparity-specified and perspective-specified slants are similar (i.e., the feedforward activation generates a single peak), the inhibitory feedback has the effect of sharpening peaks, reducing the probability that the noise might cause fluctuations. (C) When the cue conflict is large, the inhibitory feedback has the effect of both sharpening the peak of the currently perceived slant and reducing the height of the competing peak, thereby reducing the probability of a percept change.

probability of noise-induced percept fluctuations. When the cue conflict is large (see Figure 7C), the inhibitory feedback sharpens the peak of the perceived slant and reduces the height of the competing peak, thereby reducing the probability of a percept change. 2.4 Gain Control. Experimental investigations of bistable perception have consistently found that the percept duration distributions are asymmetrically skewed (gamma-function-like) (e.g., Levelt, 1966; Fox &

Perceptual Averaging and Bistability Model

3079

Herrman, 1967; Borsellino, de Marco, Allazetta, Rinesi, & Bartolini, 1972; Sugie, 1982; Ditzinger & Haken, 1989; Gomez et al., 1995; Lehky, 1995; Kalarickal & Marshall, 2000; Merk & Schnakenberg, 2002; Zhou et al., 2004; Brascamp et al., 2005). This indicates that the instantaneous probability for a percept change increases with percept duration (if the percept change probability were constant, the percept duration distribution would be described by an exponential decay). A common way to model the increase in instantaneous percept change probability is through gain control of the neurons whose activity maintains the percept (Matsuoka, 1984; Lehky, 1988; Blake, 1989; Mueller, 1990; Dayan, 1998; Kalarickal & Marshall, 2000; Wilson, Blake, & Lee, 2001; Laing & Chow, 2002; Merk & Schnakenberg, 2002; Stollenwerk & Bode, 2003; Wilson, 2003). The gain control simulates a gradual reduction in neural activity (firing rate), sometimes referred to as fatigue (Palmer, 1999), which may be due to slow after-hyperpolarizing potentials (Lehky, 1988; Wilson, 2003; Lee, 2004), synaptic depression (Bear & Malenka, 1994), or self-inhibition (Stollenwerk & Bode, 2003). There are two places in our model where gain control would result in a gamma-like distribution of percept durations. One possibility is a gain reduction of the “winning” layer 1 neuron that “survived” the winnertake-all competition. In this case, the feedforward activation of layer 1 that sustains the current percept is reduced, increasing the instantaneous probability of a percept change. The other possibility is a gain reduction of the layer 3 neuron associated with the current percept, resulting in a reduction in the strength of the inhibitory feedback signal. In both cases, the gain reduction may be related to synaptic depression or slow afterhyperpolarizing potentials that accumulated over the prolonged period of activation of these neurons. Any combination of these possible sites of gain control is equally suited for reproducing the experimentally found percept duration distributions. Here we assume that the gain control is primarily in the “winning” layer 1 neuron. 2.5 Implementation of Voluntary Control. Top-down voluntary control or attention-driven percept biasing has been examined in a number of psychophysical studies on bistable perception (e.g., Lack, 1978; Gomez et al., 1995; Hol et al., 2003; Toppino, 2003; Meng & Tong, 2004; van Ee, Krumina et al., 2005; van Ee, Van Dam, et al., 2005; Chong et al., 2005; see also an early network model by Vickers, 1972). Attention or voluntary control is implemented in our model as a bias input to layer 1 (see Figure 8). This top-down input provides an excitatory subthreshold input to the layer 1 neurons that correspond to the attended percept. The bias in itself is a weak signal in order to avoid “hallucinatory” percept generation in the absence of any corresponding bottom-up inputs. For large-cue conflicts, where there are two activation peaks in layer 1, the top-down bias lifts the peak on the attended side slightly above the competing peak, increasing the

3080

A. Koene

‘Hold control exertion condition’

P Inputs

D

Bias input

Figure 8: Effect of voluntary control exertion in the slant rivalry model. Voluntary control on the slant selection is modeled through a top-down controlled bias. If the task is to hold one slant over the other, all layer 1 neurons coding such a slant receive an excitatory bias input. For speed-up control exertion, the bias is always sent to the side, coding the slant with opposite sign. For large-cue conflicts, where there are two activation peaks in the layer 1 population, this bias lifts the peak on the biased side slightly above the competing peak, thereby increasing the probability that the corresponding percept is perceived. To account for the experimental finding that spontaneous percept reversals cannot be completely suppressed, the strength of the top-down bias must be less than the peak network noise.

probability that the corresponding percept is being perceived. To account for the experimental finding that spontaneous percept reversals cannot be completely suppressed, the strength of the top-down bias must be less than the peak network noise. 2.6 Model Parameters. The key parameters in our model are: (1) receptive field tuning size of the layer 1 neurons; (2) the width of the (gaussian) inhibitory projective field in the feedback from layer 3 to layer 1; (3) the strength of the bottom-up input signals into layer 1; (4) amplitude and distribution of the signal noise; (5) initial strength of the inhibitory feedback signal; (6) the type of gain decrease in the “winning” layer 1 neuron as a function of percept duration; and (7) the strength of top-down activation bias of layer 1 related to voluntary control. 1. The width of the receptive fields in layer 1 determines how broad an area gets activated by the bottom-up inputs. This determines the level of cue conflict at which the network behaviour switches from

Perceptual Averaging and Bistability Model

3081

averaging to bistability. For slant perception using disparity and perspective cues, the receptive field tuning size can therefore in principle be determined from the data concerning the difference between perspective and disparity-specified slant at which bistability first occurs (van Ee et al., 2002; van Ee, Adams, & Mamassian, 2003; van Ee, Kromina et al., 2005). For simplicity, our current model implementation assumed that the receptive field size is the same for all layer 1 cells. 2. The width of the (gaussian) inhibitory projective field in the feedback from layer 3 to layer 1 determines how the mean rate of percept changes increases with increased cue conflict. The width of the inhibition field has to be greater than the receptive field size of the layer 1 units in order for it to affect percept stability for large-cue conflicts (i.e., during bistable perception). This parameter was set to twice the receptive field tuning width of the layer 1 units. Due to the coarseness of the available data concerning changes in mean percept alternation rate as a function of degree of cue conflict, more precise parameter fitting was not considered useful at this time. 3. The relative strength of the bottom-up inputs determines the probability that the corresponding percept is perceived. If an individual exhibits a strong bias toward perception based on one of the bottomup inputs, this is modeled by increasing the relative strength of this input. 4. The signal-to-noise ratio (SNR) in layer 1 regulates the variance in the location of peak activation in the population-coded layer 1. If the SNR is very large, the percept is fully determined by the bottom-up input signals. If the SNR is very small, the peak activation location will randomly alternate between neurons corresponding to a wide range of percepts. If the noise in layer 1 is due to stochastic firing properties of the individual layer 1 neurons, then the noise at each neuron is independent of its neighbor (i.e., uniformly distributed noise). If the noise is linked to the activation level of the neurons (i.e., more strongly activated neurons generate more noise), the noise distribution corresponds to the product of a uniform noise distribution and the noise-free activation levels of the neurons. This is similar to assuming that the primary source of noise in the system is in the strength of the input signals. Our model is able to simulate published experimental results with either noise model (the results shown here used uniform distributed, activity-independent noise). 5. With the properties of the noise and the bottom-up input signals fixed, the initial strength of the inhibitory feedback signal determines the probability for an immediate (i.e., within 1 second) percept change. Since the inhibitory feedback reduces the activity of the competing

3082

A. Koene

layer 1 neurons (see Figure 7), the probability for a percept change is negatively correlated to the strength of the feedback signal. The initial feedback strength does not fully determine the probability of a percept change beyond the immediate onset of the percept since the gain control of the “winning” layer 1 neuron (see parameter 6) gradually increases the probability of a percept change. 6. Gain control of the “winning” layer 1 neuron, which reduces the activation of this layer 1 neuron as a function of percept duration, determines how the probability of a percept switch changes with time. When the rate of neural activity reduction decreases exponentially with time, the percept durations predicted by the model fit a skewed asymmetrical (gamma-like) distribution. 7. When present (i.e., in the control-exertion conditions where the subject exerts voluntary control), the strength of the bias input is modeled as a constant fixed value. All control exertion conditions were modeled using the same bias input strength. The instruction “hold the left [or right] side in front” is modeled by a bias input to the layer 1 neurons that code for slants with the left (right) side in front (see Figure 8). For the speed-up control exertion case, the bias is always sent to the side coding a slant with opposite sign to the current percept. A mathematical description of the model and its parameters is given in the appendix. 3 Results To test the validity of our network we simulated the slant rivalry experiments by van Ee et al. (van Ee et al., 2002; van Ee, Krumina et al., 2005; van Ee, van Dam et al., 2005) and Brouwer et al. (2004) using 60,000 iterations per simulated experimental condition and control exertion task. As we will now show, all three key results have been successfully replicated using a single parameter set. 3.1 Signal Integration as a Function of Cue Conflict: Averaging vs. Bistability. Figure 9 shows that our slant rivalry network replicates the transition from averaging to bistable behavior as a function of cue conflict between perspective- and disparity-specified slant (van Ee et al., 2002; van Ee, Krumina et al., 2005; van Ee, van Dam et al., 2005). When the cue conflict is small, the signal integration results in averaging (+ symbols). For larger cue conflicts, the percept is bistable, alternating between the percepts indicated by the X and circle symbols. Far from the borders to the averaging regime, the perceived slant in the bistable regime closely corresponds to the slant signaled by either the perspective or the disparity cue. Close to the averaging regime, however, the perceived slant becomes a weighted average

Perceptual Averaging and Bistability Model

50 Persp.-70

50 Persp. -50

50 Persp. -25

0

0

0

-50

-50

-50

-50 Perceived slant [deg]

3083

0

50

-50

0

50

-50

Mean StdErr of 50 Persp. 0 experiment data: 2.5deg

0

50

50

50

50

0

0

0

-50

Persp. 25 -50

0

50

-50

50

Perspective-dominated Disparity-dominated Averaging percept Experiment data

0 Mean difference between model prediction and -50 experiment data: 6.5deg -50

0

Persp. 50

-50

-50 0 50 Disparity-specified slant [deg]

Persp. 70 -50

0

50

Figure 9: Experimental and simulated bistability. Our model produces both averaging and bistability in perceived slant as a function of perspective-specified and disparity-specified slant. Each panel shows the perceived slant versus the disparity-specified slant for a particular perspective specified slant. “Persp.” indicates perspective-specified slant. The X and circle symbols indicate the perceived slants when the percept is bistable for disparity-dominated and perspective-dominated percepts, respectively. The + symbols indicate the perceived slant when the percept corresponds to stable averaging of the disparity and perspective-specified slants. Our slant rivalry neural network replicates perceived bistable slants obtained under various experimental conditions (van Ee et al., 2002; van Ee, Krumina, et al., 2005; van Dam & van Ee, 2005). The black dots indicate the perceived slants reported in van Ee, et al. (2005). The mean absolute difference between the model predictions and the experimental data is 6.5 degrees (mean standard error of the experimental data was 2.5 degrees).

of the slants signaled by the two cues, with the weights gradually equalizing as the transition border to the averaging regime is approached. The black dots in Figure 9 indicate the perceived slants reported in van Ee, Krumina et al. (2005). The mean absolute difference between the model predictions and the experimental data is 6.5degrees. For comparison, the mean standard error of the experimental data was 2.5 degrees. 3.2 Signal Integration as a Function of Cue Conflict: Mean Percept Duration. Figure 10 shows how the degree of cue conflict affects the mean

3084

A. Koene 9.5 Disparity-dominated percept Perspective-dominated percept

9

7.5 7 6.5

Averaging regime

8

Bi-stable regime

Mean percept duration [s]

8.5

Values used in 'voluntary control' simulations

6 5.5 5 4.5

0 25 50 75 100 125 150 175 Conflict between disparity and perspective specified slant [deg]

Figure 10: Mean percept duration versus conflict between disparity and perspective-specified slants. The circle and diamond symbols indicate the mean percept duration for the disparity-dominated and the perspective-dominated percept, respectively, showing that the mean percept duration decreases with increasing cue conflict (asymptotically approaching 1s) due to the reduction in inhibitory feedback connection strength for increasingly different slants. This behavior, which depends essentially on the use of inhibitory feedback, has recently been supported by experimental results (Brouwer et al., 2004). Models that rely on excitatory feedback to strengthen the current percept, rather than on inhibitory feedback to weaken neighboring competitors, do not produce this behavior.

percept durations. For large-cue conflicts (i.e., in the bistable regime), the mean percept duration is negatively correlated with the degree of cue conflict. Experimental data by Brouwer et al. (2004) show the same trend in the rate of percept changes, that is, a decrease in mean percept duration, as a function of increasing stimulus incongruence. Brouwer et al. further reported that the change in mean percept duration is more closely correlated to subjective rather than objective cue conflict. This agrees with our model since the change in mean percept duration as a function of cue conflict depends on the tuning width of the layer 1 receptive fields and the size of the feedback projection from layer 3, both of which are internal parameters that may well be subject dependent. The positive correlation between degree of cue conflict and size of cortical activation reported by Brouwer et al. is also predicted by our model since larger-cue conflict results in less overlap in

Perceptual Averaging and Bistability Model

3085

the population of layer 1 neurons that is activated by the perspective and disparity cues. 3.3 Modulation of Bistability Behavior Through Voluntary Top-Down Biasing. Figure 11 shows the probability density distributions for the simulated percept durations in the bistable regime and the effect of top-down modulation. The top row shows the results for the “natural” condition without top-down bias. The second and third rows show the results for the “hold” control-exertions where a top-down bias was added favoring one over the two alternative percepts. The bottom row depicts the results for the speed-up control exertion where the top-down bias was always assigned to the percept that was currently suppressed. Our slant rivalry neural network replicates the experimentally found percept duration probability distributions obtained under these control exertion conditions (van Ee, Krumina et al. 2005). For each experimental condition and for all percept durations, the simulation error falls within the standard deviation of the experimentally found probability densities. 3.4 Robustness Analysis. The following robustness analysis was performed to illustrate how the model behavior is affected by the choice of its parameter values. Rather than exhaustively testing the model for all possible parameter sets, we opted to test each parameter separately by analyzing the effect of a 10% change in the tested parameter:

r

r

r

Receptive field width of the layer 1 neurons: Provided the ratio between layer 1 receptive field width and the width of the inhibitory projective field of the layer 3 feedback is held constant, increasing or decreasing the layer 1 receptive field width has the same effect as decreasing or increasing cue conflict. Width of the inhibitory projective field in the feedback from layer 3 to layer 1: If the layer 1 bottom-up receptive field tuning width is held constant, changing the width of the inhibition area changes the mean percept duration. The relationship between mean percept duration and degree of cue conflict (see Figure 10) is not affected. At the level of cue conflict used in our “voluntary control” simulations, a 10% increase or decrease in inhibition field area results in a 50% increase or 33% decrease in mean percept durations. Strength of the bottom-up input signals into layer 1: Increasing the strength of one bottom-up input to 10% more than the competing input shifts the distribution of percept durations such that the mean percept duration of the stronger cue is about 2.5 times longer than the mean percept duration of the weaker cue. Note that since our model concerns high-level stimulus conflicts, layer 1 in our model does not correspond to the sensor level. An N-fold increase in stimulus strength

3086

A. Koene Perspective-dominated

Disparity-dominated

Speed Up

Probability density [%] Hold Perspective Hold Disparity

Natural

0.4 0.2 0 0 0.4 0.2 0 0 0.4 0.2 0 0 0.4 0.2 0 0

0.4 Mean: 6.17s Mean error: 0.01s M. exp. std: 0.02s

10

20

0.2

30

Mean: 9.60s Mean error: 0.01s M. exp. std: 0.03s

10

20

0.2

30

Mean: 3.66s Mean error: 0.01s M. exp. std: 0.02s

10

20

0 0 0.4 0.2

30

Mean: 3.69s Mean error: 0.01s M. exp. std: 0.02s

10 20 Duration [s]

0 0 0.4

0 0 0.4 0.2

30

0 0

Mean: 5.84s Mean error: 0.01s M. exp. std: 0.02s

10

20

30

Mean: 3.56s Mean error: 0.01s M. exp. std: 0.02s

10

20

30

Mean: 9.22s Mean error: 0.01s M. exp. std: 0.02s

10

20

30

Mean: 3.60s Mean error: 0.01s M. exp. std: 0.02s

10 20 Duration [s]

30

Figure 11: Percept durations produced by the slant rivalry neural network. The left and right columns depict the probability density distributions for the disparity-dominated and perspective-dominated percepts durations, respectively. The top row shows the results for the natural control exertion condition without top-down bias. The second and third rows show the results for the hold control exertion where a top-down bias was added favoring one over the two alternative percepts. The bottom row depicts the results when the top-down bias was always assigned to the percept that was currently not prevailing. All simulations were done using the same fixed set of model parameters. The model parameters were determined by the natural control exertion condition, assuming no top-down bias. A single bias input was sufficient to fit each of the control exertion conditions. Our slant rivalry neural network replicates experimentally found percept duration probability density distributions obtained under the described control exertion conditions (van Ee, Krumina et al., 2005) within one standard error. Mean error indicates the mean difference between simulation results and experimental data, m.expt.std indicates mean standard deviation of the experimental data.

Perceptual Averaging and Bistability Model

r r r

r

3087

therefore does not automatically correspond to an N-fold increase in the input signal to layer 1. Amplitude of the signal noise: Increasing or decreasing the noise level by 10% decreases or increases the mean percept durations by approximately 10%. Initial strength of the inhibitory feedback signal: Increasing or decreasing the initial feedback strength by 10% results in an approximately 60% increase or 30% decrease in the mean percept durations. Gain change of the “winning” layer 1 neuron as a function of percept duration: A 10% increase or decrease in parameter α, β, or γ (see equation A.1) results in approximately a 1% decrease or increase, 10% decrease or increase, or 40% increase or 25% decrease in mean percept duration, respectively. Top-down activation bias related to voluntary control: Increasing or decreasing the strength of the voluntary control top-down bias by 10% increases or decreases the mean percept duration of the attended percept by 5% and decreases or increases the mean percept duration of the alternative percept by 5%.

4 Discussion We have presented a neural network model of signal integration that seamlessly combines averaging and stochastic bistability behavior, as well as voluntary control, in the perception of ambiguous slant stimuli. The network behavior changes from averaging to bistability as a function of the degree of conflict between the disparity-specified and perspective-specified slants and shows increasing rates of perceptual slant reversals for increasing stimulus incongruence. The voluntary control of perception is achieved through the use of a top-down bias input. Most previous mechanistic models for perceptual signal integration modeled either the generation of unified stable (binocular) perception (reviewed by Howard & Rogers, 2002) or bistability (Vickers, 1972; Sugie, 1982; Matsuoka, 1984; Kawamoto & Anderson, 1985; Lehky, 1988; Blake, 1989; Mueller & Blake, 1989; Mueller, 1990; Ditzinger & Haken, 1989; Lehky & Blake, 1991; Lumer, 1998; Dayan, 1998; Kalarickal & Marshall, 2000; Laing & Chow, 2002; Merk & Schnakenberg, 2002; Stollenwerk & Bode, 2003; Wilson, 2003; Zhou et al., 2004) without explicitly addressing the transition from one type of behavior to the other. To our knowledge, the only previous models that account for these perceptual transitions are a Bayesian signal integration model by van Ee et al. (2003) and a binocular vision model by Hayashi, Maeda, Shimojo, and Tachi (2004). Van Ee et al. provided a framework to describe the behavior of the perceptual system but did not address the neural network responsible for this perceptual behavior. Hayashi et al.’s

3088

A. Koene

model did provide a mechanistic framework for the switch from fusion to rivalry but is not easily generalized to other signal integration cases since it relies heavily on mechanisms that are specific to the role of half-occluded unpaired points in solving the binocular correspondence problem. The model of stereo depth acuity and transparency by Lehky and Sejnowski (1990) has some similarity to the population-coding scheme here. In that model, going to a transparent surface (two depths) corresponded to going from a unimodal to bimodal distribution of activity in the population, similar to here. However, in that model, there were no oscillations, as the percept of transparency is stable. The dynamics of the percept changes in our model are governed by the gain decay or fatigue function (see the appendix, “gain control”) of the layer 1 neurons that gradually decreases the stability of the percept and the signal-to-noise ratio of the input signals to layer 1. We have reproduced the results of a set of experiments (van Ee et al., 2002; van Ee, Krumina et al., 2005; van Ee, van Dam et al., 2005; van Dam & van Ee, 2005, Brouwer et al., 2004) that showed a transition from perceptual averaging of the disparity- and the perspective-specified slants to bistability (see Figure 9), changes in mean bistable percept duration as a function of perceptual cue conflict (see Figure 10), and the effect of voluntary control on the probability density distribution of bistable percept durations (see Figure 11). The results of our simulations provided a good fit with the experimental data. All of the qualitative behavior was reproduced, and the predicted percept durations (see Figure 11) were all within one standard deviation of the experimental results. 4.1 Purpose of the Inhibitory Feedback from the Current Percept. Most mechanistic models of bistable perception contain some mechanism (i.e., feedback or temporal integration) that allows the percept at time t to affect the way the input signals are being perceived at time t + δt (Matsuoka, 1984; Wilson, 2003; Laing & Chow, 2002; Mueller & Blake, 1989; Lehky, 1988). By strengthening the signals related to the current percept, the stability of the network behavior is enhanced, reducing the perceptual consequences of signal noise. In our model this stabilizing mechanism is implemented by the inhibitory feedback from layer 3 to layer 1. Without the inhibitory feedback, the combination of network noise and wide receptive fields in layer 1 would result in small random shifts of the location of peak activation, even in the absence of a cue conflict. This would cause the percept to wobble or jitter. Note, however, that the inhibitory feedback does not ensure consistency of the percept over separate presentations. On each presentation, the network noise affects at which exact value the peak is initially located. This location is then held stable. For unambiguous stimuli, the internal noise can be seen in the variance of perceived slant when the same stimulus is offered repeatedly. Since the purpose of the feedback is to enhance perceptual reliability by reducing the consequences of internal

Perceptual Averaging and Bistability Model

3089

noise, its effect on the ability of the visual system to detect changes in the environment should be minimized. Excitatory feedback, strengthening the signals related to the current percept, would have the same net effect as inhibitory feedback uniformly reducing all other signals in the network, resulting in a uniform reduction in the sensitivity to changes in the stimulus. Our inhibitory feedback signal allows a differentiated approach. By applying strong inhibition to the cells that are functionally close to the current percept and decreasing the inhibition strength with distance from the current percept, we minimize the reduction of the salience of big changes in the stimulus while suppressing noise-induced percept instability. An additional property of this type of differentiated inhibitory feedback is that it predicts a gradual increase in the rate of percept changes (decrease in mean percept duration) as the cue conflict is increased (Brouwer et al., 2004). Models that rely on excitatory feedback to strengthen the current percept do not produce this behavior. 4.2 Multiple Stages of Binocular Rivalry. The model of Hayashi et al. (2004) integrates stereopsis and binocular rivalry, which are usually treated separately, into a single framework of binocular vision. In this model rivalry arises due to interocular inhibition between representations of monocularly visible regions. In the hierarchy of visual perception, this can be considered as low- or midlevel rivalry occurring just after binocular convergence at the correspondence problem-solving stage. While this model provides an explanation for the occurrence of bistability in binocular rivalry, it does not explain slant rivalry data since the Hayashi et al. model is specifically tailored toward binocular fusion or rivalry. The Hayashi et al. model also does not predict the slight shift toward the nonperceived percept that was found in slant rivalry experiments. In our model, the population code in layer 1 encodes only the perceptual property of input signals, not the sensory origin of the stimulus cue from which the signal is derived. The percept derived from our signal integration network is therefore blind to its sensory origin and will not be affected by interchanging the sensory origin of stimulus cues. When applied to binocular rivalry, our network can thus be understood as modeling a network involved in higher-level stimulus rivalry (Logothetis, Leopold, & Sheinberg, 1996) rather than classical binocular rivalry. Thus, our model and the model by Hayashi et al. (2004) can be considered complementary parts in a two- (or more) stage process and as such may be linked to the multiple stage rivalry model suggested by Wilson (2003). 4.3 Predictions and Possible Extensions 4.3.1 Weighted Cue Combination. Many investigations of sensory cue combination have shown that cue reliability is taken into account in the percept formation (e.g., Landy, Maloney, Johnston, & Young, 1995; Oruc,

3090

A. Koene

Maloney, & Landy, 2003; Ernst & Banks, 2002). The cue reliability is either statistically determined (e.g., Bayesian estimators) or derived from ancilliary cues (related cues that provide a context for the cue estimation). A simple way in which to implement reliability-dependent cue weighting in our neural network is by adjusting the activation strength of the input signals to layer 1 (see, equation A.1: a 0, j (t)). This kind of cue weighting might implement a bias based on past experience of cue reliability. The predicted effect of biased cue weighting is very similar to the effect of the voluntary top-down bias (see section 3). Since our model uses population codes to signal the stimulus property, the model also implements online cue reliability-based weighting (Ernst & Buelthoff, 2004) through the layer 1 receptive fields. If a cue is unreliable, that is, there is a lot of variance in the layer 1 input provided by this cue, the area in layer 1 that is activated by the cue will smear, becoming broader with a lower peak. The broadening of the activated layer 1 area will have the same effect as using larger receptive fields in layer 1 would (see section 2). The decrease of the activation peak would have an effect that is similar to a voluntary top-down bias in favor of the more reliable cue. 4.3.2 Multistable Percepts with More Than Two Input Cues. In order to extend our signal integration network for the condition where there are more than two input cues, the additional cues can simply be added as further inputs to layer 1 of the network. For slant perception, for instance, surface texture and shading could be added as additional slant cues in exactly the same way as perspective and disparity (in equation A.1, simply add j = 3 and j = 4). The predicted percept(s) in this condition will again depend on the relative differences between the input signals (analogous to Figure 5). Due to the differentiated inhibitory feedback in our model, which decreases with functional distance, our model predicts preferential switching between pairs of perceptual states during multistable perception, as reported by Suzuki and Grabowecky (2002). The preferred switch will always be to the state that is most different from the current percept since that state is least inhibited. In slant perception, for instance, having four cues that signal 0, 45, 90, and 135degree slant, respectively, would result in preferential switching between 0 and 90degrees and between 45 and 135degrees since these are farthest from each other and thus inhibit each other least (note that 0degree slant is the same as 180degree slant). 5 Conclusion Our neural network provides a possible mechanism for explaining processes in which continuously variable information from different sources is integrated for perceptual awareness. It successfully incorporates both averaging and stochastic bistability behavior in slant perception. The transition in our model behavior from a percept-averaging regime to a regime

Perceptual Averaging and Bistability Model

3091

of bistability is a natural consequence of the combination of population coding and wide receptive field tuning common to higher cortical areas. Voluntary control of perception is achieved through the use of a single top-down bias input. Differentiated inhibitory feedback plays an important role in our network for both increased percept stability without reduced sensitivity to larger stimulus changes and increasing alternation rates as a function of stimulus incongruence. In this article, we have validated the model by applying it to slant rivalry instigated by conflicting perspective and disparity signals. The model architecture, however, does not contain any assumption that would limit it to this particular example of stimulus rivalry. Key elements of this model are the use of population coding and inhibitory feedback with wide receptive and projective fields. Appendix: Mathematical Description of the Model and Its Parameters The model was implemented in Matlab5 and is available on request. A.1 Layer 1. The activation of the layer 1 neurons by their bottom-up excitatory inputs and top-down inhibitory inputs is given by  a 1,i (t) = G 1,i (τ ) 



 k=i

   (m1,i − m0, j )2 a 0, j (t) exp − σ12 j=1

2 

 

(m1,i − m3,k )2 a 3,k (t) exp − σ32 β

G 1,i (τ ) = e −(τ +α) + γ ,





+ ε1,i (t) + α1,i (t) .

where α = 0.3, β = 0.38 and γ = 0.51, (A.1)

where a 1,i (t) is the activation of the ith layer 1 neuron at time t; a 0, j (t) is the activation of the jth bottom-up input at time t; m1,i is the center of the receptive field of the ith layer 1 neuron (i.e., the slant orientation coded by this cell); m0, j is the slant orientation signaled by the jth bottom-up input; σ1 is the width of the layer 1 receptive fields (for simplicity, all layer 1 neurons are assumed to have the same receptive field tuning width); a 3,k (t) is the activation of the kth layer 3 neuron at time t; m3,k is the center of the receptive field of the ith layer 1 neuron (i.e., the slant orientation coded by this cell); σ3 is the width of the layer 3 inhibitory projective fields (for simplicity, all layer 3 neurons are assumed to have the same projective field width); ε1,i (t) is the noise acting on the ith layer 1 neuron at time t; and α1,i (t) is the activation bias due to attention affecting the ith neuron of layer 1at time t. G 1,i (τ ) is the gain of the ith layer 1 where τ is the duration that this layer 1 neuron has signaled the current percept, that is, the perceived

3092

A. Koene

slant was the same as the slant coded by the ith neuron. For layer 1 neurons that do not code, the current percept τ = 0 (i.e., G(τ ) = 1). A.2 Winner-Take-All Interaction in Layer 1. A biologically plausible implementation could take the form of a network of lateral inhibition such that a 1,i (t + δt) = a 1,i (t) −

N  a 1, j (t) j=1

N

where a 1,i (t + δt) is the activation of the ith layer 1 neuron at time t + δt; a 1,i (t) is the activation of the ith layer 1 neuron at time t; N is the total number of layer 1 neurons, and a 1, j (t) is the activation of the ith layer 1 neuron at time t. After a couple of iterations, this will reduce the activity of all neurons, except the maximally activated neuron, to zero. A.3 Layer 2. The function of layer 2 is to accommodate the network for the processing time required by physiologically plausible winner-take-all processes. The activation of the layer 2 neurons by their bottom-up inputs is given by a 2,i (t) = Ea 1,i (t) −



I a 1,k (t),

k=i

with

E  I,

where a 2,i (t) is the activation of the ith layer 2 neuron at time t; E is the gain of the excitatory projections from layer 1 to layer 2; a 1,i (t) is the activation of the ith layer 1 neuron at time t; I is the gain of the inhibitory projections from layer 1 to layer 2; and a 1,k (t) is the activation of the kth layer 1 neuron at time t. A.4 Layer 3. The activation of the layer 3 neurons is given by  a 3,i (t) = min a 3,i (t − δt) + E a 2,i (t) −



 I a 2,k (t), S ,

k=i

with

E  I.

where a 3,i (t) is the activation of the ith layer 3 neuron at time t; a 3,i (t − δt) is the activation of the ith layer 3 neuron at time t − δt; E is the gain of the excitatory projections from layer 2 to layer 3; a 2,i (t) is the activation of the ith layer 2 neuron at time t; I is the gain of the inhibitory projections from

Perceptual Averaging and Bistability Model

3093

layer 2 to layer 3; and a 2,k (t) is the activation of the kth layer 2 neuron at time t. S is the neural activity saturation level. Acknowledgments Part of the work was developed while A.K. was at the INSERM U534, Lyon, France, in 2003. Major parts of the work have been presented at the Conf´erence Ladislav Tauc en Neurobiologie in 2003 and the European Conference on Visual Perception in 2004. A.K. was partly supported by a Human Frontiers Research Project grant assigned to A. Johnston and partly supported by a Gatsby charitable foundation grant assigned to L. Zhaoping. The help of R. van Ee in providing the slant rivalry data files is sincerely appreciated. References Bear, M. F., & Malenka, R. C. (1994). Synapic plasticity: LTP and LTD. Current Opinion in Neurobiology, 4, 389–399. Blake, R. (1989). A neural theory of binocular rivalry. Psychological Review, 96, 145– 167. Borsellino, A., de Marco, A., Allazetta, A., Rinesi, A., & Bartolini, B. (1972). Reversal time distribution in the perception of visual ambiguous stimuli. Kybernetik, 10, 139–144. Brascamp, J. W., van Ee, R., Pestman, W. R., & van den Berg A.V. (2005). Distribution of alternation rates in various forms of bistable perception. Journal of Vision, 5, 287–298. Brouwer, G. J., Tong, F., Schwarzbach, J., & van Ee, R. (2004). Neural correlates of stereoscopic depth perception in visual cortex. Society for NeuroScience, 664.13. Budd, J. M., & Kisvarday, Z. F. (2001). Local lateral connectivity of inhibitory clutch cells in layer 4 of cat visual cortex (area 17). Experimental Brain Research, 140(2), 245–250. Chong, S. C., Tadin, D., & Blake R. (2005). Endogenous attention prolongs dominance durations in binocular rivalry. Journal of Vision, 5, 1004–1012. Crook, J. M., Kisvarday, Z. F., & Eysel, U. T. (1998). Evidence for a contribution of lateral inhibition tuning and direction selectivity in cat visual cortex: Reversible inactivation of functionally characterized sites combined with neuroanatomical tracing techniques. European Journal of Neuroscience, 10(6), 2056–2075. Dayan, P. (1998). A hierarchical model of binocular rivalry. Neural Computation, 10, 1119–1135. Ditzinger, T., & Haken, H. (1989). Oscillations in the perception of ambiguous patterns. Biological Cybernetics, 61, 279–287. Ernst, M. O., & Banks, M. S. (2002). Humans integrate visual and haptic information in a statistically optimal fashion. Nature, 415(6870), 429–433. Ernst, M. O., & Buelthoff, H. H. (2004). Merging the senses into a robust percept. Trends in Cognitive Sciences, 8(4), 162–169.

3094

A. Koene

Fox, R., & Herrman, J. (1967). Stochastic properties of binocular rivalry alternations. Perception and Psychophysics, 2, 432–436. Gomez, C., Argandona, E. D., Solier, R. G., Angulo, J. C., & Vazquez, M. (1995). Timing and competition in networks representing ambiguous figures. Brain and Cognition, 29, 103–114. Goryo, K., Robinson, J. O., & Wilson, J. A. (1984). Selective looking and the MullerLyer illusion: The effect of changes in the focus of attention on the Muller-Lyer illusion. Perception, 13, 647–654. Hayashi, R., Maeda, T., Shimojo, S., & Tachi, S., (2004). An integrative model of binocular vision: A stereo model utilizing interocularly unpaired points produces both depth and binocular rivalry. Vision Research, 44(20), 2367–2380. Hol, K., Koene, A., & van Ee, R. (2003). Attention-biased multi-stable surface perception in three-dimensional structure-from-motion. Journal of Vision, 3, 486– 498. Howard, I. P., & Rogers, B. J. (2002). Seeing in depth, Vol 2: Depth perception. Toronto: I. Porteous. Hubel, D. H., & Wiesel, T. N. (1959). Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. Journal of Physiology (London), 148, 574–591. Hubel, D. H., & Wiesel, T. N. (1979). Brain mechanisms of vision. Scientific American, 241(3), 150–162. Kalarickal, G. J., & Marshall, J. A. (2000). Neural model of temporal and stochastic properties of binocular rivalry. Neurocomputing, 32–33, 843–853. Kawamoto, A. H., & Anderson, J. A. (1985). A neural network model of multistable perception. Acta Psychologica, 59, 35–65. Lack, L. C. (1978). Selective attention and the control of binocular rivalry. The Hague: Mouton. Laing, C. R., & Chow, C. C. (2002). A spiking neuron model for binocular rivalry. Journal of Computational Neuroscience, 12, 39–53. Landy, M. S., Maloney, L. T., Johnston, E. B., & Young, M. (1995). Measurement and modeling of depth Cue combination: In defense of weak fusion. Vision Research, 35(3), 389–412. Lee, S. H. (2004). Binocular battles on multiple fronts. Trends in Cognitive Sciences, 8(4), 148–151. Lehky, S. (1988). An astable multivibrator model of binocular rivalry. Perception, 17, 215–228. Lehky, S. R., (1995). Binocular rivalry is not chaotic. Proceedings of the Royal Society London B, 259, 71–76. Lehky, S. R., & Blake, R. (1991). Organization of binocular pathways: Modeling and data related to rivalry. Neural Computation, 3, 44–53. Lehky, S. R., & Sejnowski, T. J. (1990). Neural network model of stereoacuity and depth interpolation based on a distributed representation of stereo disparity. Journal of Neuroscience, 10, 2281–2299. Levelt, W. J. M. (1966). The alternation process in binocular rivalry. British Journal of Physiology, 57, 225–238. Logothetis, N. K., Leopold, D. A., & Sheinberg, D. L. (1996). What is rivalling during binocular rivalry? Nature, 380, 621–624.

Perceptual Averaging and Bistability Model

3095

Lumer, E. D. (1998). A neural model of binocular integration and rivalry based on the coordination of action-potential timing in primary visual cortex. Cerebral Cortex, 8, 553–561. Lund, J. S., Angelucci, A., & Bressloff, P. C. (2003). Anatomical substrates for functional columns in macaque monkey primary visual cortex. Cerebral Cortex, 13(1), 15–24. Marr, D., & Poggio, T. (1977). A computational theory of human stereo disparity. Science, 194, 283–287. Matsuoka, K. (1984). The dynamic model of binocular rivalry. Biological Cybernetics, 49, 201–208. McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation model of context effects in letter perception: I. An account of basic findings. Psychological Review, 88(5), 375–407. Meng, M., & Tong, F. (2004). Can attention selectively bias bistable perception? Differences between binocular rivalry and ambiguous figures. Journal of Vision, 4, 539–551. Merk, I., & Schnakenberg, J. (2002). A stochastic model of multistable visual perception. Biological Cybernetics, 86, 111–116. Moldakarimov, S., Rollenhagen, J. E., Olson, C. R., & Chow, C. C. (2005). Competitive dynamics in cortical responses to visual stimuli. Journal of Neurophysiology, 94(5), 3388–3396. Mueller, T. J. (1990). A physiological model of binocular rivalry. Visual Neuroscience, 4, 63–73. Mueller, T. J., & Blake, R. (1989). A fresh look at the temporal dynamics of binocular rivalry. Biological Cybernetics, 61, 223–232. Oruc, I., Maloney, L. T., & Landy, M. S. (2003). Weighted linear cue combination with possibly correlated error. Vision Research, 43, 2451–2468. Palmer, S. E. (1999). Vision science: Photons to phenomenology. Cambridge, MA: MIT Press. Shulman, G. L. (1992). Attentional modulation of figural aftereffect. Perception, 21, 7–19. Stollenwerk, L., & Bode, M. (2003). Lateral neural model of binocular rivalry. Neural Computation, 15, 2863–2882. Sugie, N. (1982). Neural models of brightness perception and retinal rivalry. Biological Cybernetics, 43, 13–21. Suzuki, S., & Grabowecky, M. (2002). Evidence for perceptual “trapping” and adaptation in multistable binocular rivalry. Neuron, 36, 143–157. Toppino, T. C. (2003). Reversible-figure perception: Mechanisms of intentional control. Perception and psychophysics, 65(8), 1285–1295. Treue, S., Hol, K., & Rauber, H.-J. (2000). Seeing multiple directions of motionphysiology and psychophysics. Nature Neuroscience, 3(3), 270–276. Tsal, Y. (1984). A Muller-Lyer illusion induced by selective attention. Quarterly Journal of Experimental Psychology, Human Experimental Psychology, 36A, 319– 333. van Dam, L. C. J., & van Ee, R. (2005). The role of (micro)saccades and blinks in perceptual bi-stability from slant rivalry. Vision Research, 45(18), 2417– 2435.

3096

A. Koene

van Ee, R. (2005). Dynamics of perceptual bi-stability for stereoscopic slant rivalry and a comparison with grating, house-face, and Necker cube rivalry. Vision Research, 45, 29–40. van Ee, R., Adams, W. J., & Mamassian, P. (2003). Bayesian modeling of cue interaction: Bi-stability in stereoscopic slant perception. Journal of the Optical Society of America A, 20, 1398–1406. van Essen, D. C., Anderson, C. H., & Fellman, D. J. (1992). Information processing in the primate visual system: An integrated systems perspective. Science, 255(5043), 419–423. van Ee, R., Krumina, G., Pont, S. P., & van der Ven, S. (2005). Voluntarily controlled bi-stable slant perception of real and photographed surfaces. Proceedings of the Royal Society London. Series B, Biological Sciences, 272(1559), 141–148. van Ee, R., van Dam L. C. J., & Brouwer, G. J. (2005). Voluntary control and the dynamics of perceptual bi-stability. Vision Research, 45, 41–55. van Ee, R., van Dam, L. C. J., & Erkelens, C. J. (2002). Bi-stability in perceived slant when binocular disparity and monocular perspective specify different slants. Journal of Vision, 2, 597–607. Vickers, D. (1972). A cyclic decision model of perceptual alternation. Perception, 1, 31–48. Wilson, H. R. (2003). Computational evidence for a rivalry hierarchy in vision. Proceedings of the National Academy of Sciences of the United States of America, 100(24), 14499–14503. Wilson, H. R., Blake, R., & Lee, S. H. (2001). Dynamics of travelling waves in visual perception. Nature, 412, 907–910. Zhou, Y. H., Gao, J. B., White, K. D., & Yao, K. (2004). Perceptual dominance time distributions in multistable visual perception. Biological Cybernetics, 90, 256–263.

Received January 4, 2006; accepted April 27, 2006.

A Model for Perceptual Averaging and Stochastic ...

It should be noted, however, that this does not mean that layer 1 corre- sponds to MT. .... asymmetrically skewed (gamma-function-like) (e.g., Levelt, 1966; Fox & ...

610KB Sizes 0 Downloads 256 Views

Recommend Documents

A dynamic stochastic general equilibrium model for a small open ...
the current account balance and the real exchange rate. ... a number of real frictions, such as habit formation in consumption, investment adjustment costs ...... also define the following equations: Real imports. (. ) m t t t t m Q c im. = +. (A30).

Stochastic cell transmission model (SCTM) A stochastic dynamic ...
Stochastic cell transmission model (SCTM) A stochastic ... model for traffic state surveillance and assignment.pdf. Stochastic cell transmission model (SCTM) A ...

Bayesian Model Averaging for Spatial Econometric ...
Aug 11, 2005 - represents a cross-section of regions located in space, for example, counties, states, or countries. y ¼ rWy ю ... If the sample data are to determine the posterior model probabilities, the prior probabilities ..... averaged estimate

Bayesian Model Averaging for Spatial Econometric ...
Aug 11, 2005 - There is a great deal of literature on Bayesian model comparison for nonspatial .... structure of the explanatory variables in X into account. ...... Further computational savings can be achieved by noting that the grid can be.

Bayesian Model Averaging for Spatial Econometric ...
11 Aug 2005 - We extend the literature on Bayesian model comparison for ordinary least-squares regression models ...... with 95 models having posterior model probabilities 40.1%, accounting for. 83.02% probability ...... choices. 2 MATLAB version 7 s

A Simple Stochastic Model for Generating Broken ...
Jan 5, 2009 - cloud-top height retrieved from the Moderate Resolution Imaging Spectroradiometer (MODIS). .... data. Schmidt et al. (2007) used cloud fields simulated by the last ..... President's program Leading Scientific Schools (Grant.

A discrete stochastic model for investment with an ...
A discrete stochastic model for investment with an application to the transaction costs case. Laurence Carassus a,), Elyes Jouini b. ` a UniХersite de Paris 7, CREST and CERMSEM, Paris, France. ´ b CREST and CERMSEM, UniХersite de Paris, 1 Pantheo

Approximated stochastic realization and model ...
applied to array processing by means of state space models. Jean-Pierre ... (I) (j 2 =-I) k= I where. m is the number of sources, a k and 'pk represent respectively ...

Perceptual Reasoning for Perceptual Computing
Department of Electrical Engineering, University of Southern California, Los. Angeles, CA 90089-2564 USA (e-mail: [email protected]; dongruiw@ usc.edu). Digital Object ... tain a meaningful uncertainty model for a word, data about the word must be

Default correlations derived with an averaging model
default correlation between groups of similar clients from a large rep- resentative data set. This approach assumes that all the elements in the correlation matrix ...

A Stochastic Inventory Model with Price Quotation
associated expenditures such as driving, telephone calls, computer fees, magazine ...... V1, we fix S2 and search for an optimal (r, S1) by reducing their values by 1 until a stopping condition is met. ...... algebra to get the desired expression.

A Stochastic Volatility Swap Market Model Sami Attaoui
Phone: 0140463170. E-mail: [email protected]. I would like to thank P. Poncet, ..... We compute swaption prices through the FFrFT and compare them, w.r.t. ...

Unaffected Perceptual Thresholds for Biological and ... - ScienceOpen
Oct 18, 2010 - dot kinematograms (RDKs), a number of studies have reported that participants with ASC had .... noise dot had the same trajectory of one of the dots in the walker. In the SO and UO conditions, ..... Thurman SM, Grossman ED (2008) Tempo

Similarity-Based Perceptual Reasoning for Perceptual ...
Dongrui Wu, Student Member, IEEE, and Jerry M. Mendel, Life Fellow, IEEE. Abstract—Perceptual reasoning (PR) is ... systems — fuzzy logic systems — because in a fuzzy logic system the output is almost always a ...... in information/intelligent

A stochastic representation for fully nonlinear PDEs and ...
where u(t, x) is a solution of PDE (0.1)-(0.2) and (Ys,Zs)s∈[t,T] is a unique pair of adapted .... enization by analytic approaches is also an interesting subject.

Fast Averaging
MapReduce. Conclusions. Motivation. Large data center (a cluster of computers). Used by Microsoft, Google, Amazon, Facebook, ... What functions ... 15/17. Introduction. Algorithm. MapReduce. Conclusions. Why it works. Estimated Frequency. Facebook. F

Fast Averaging
Laboratory for Information and Decision Systems. Massachusetts Institute of Technology. {bodas, devavrat}@mit.edu. Abstract—We are interested in the following question: given n numbers x1,...,xn, what sorts of approximation of average xave = 1 n (x

A No-reference Perceptual Quality Metric for ... - Research at Google
free energy of this inference process, i.e., the discrepancy between .... such that their encoding bit rates are all higher than 100 ..... approach in the DCT domain.

A Behavioural Model for Client Reputation - A client reputation model ...
The problem: unauthorised or malicious activities performed by clients on servers while clients consume services (e.g. email spam) without behavioural history ...

A Reconstruction Decoder for the Perceptual Computer
Abstract—The Word decoder is a very important approach for decoding in the Perceptual Computer. It maps the computing with words (CWW) engine output, ...

Unaffected Perceptual Thresholds for Biological and Non ... - PLOS
Oct 18, 2010 - 1 Department of Cognitive Science, University of California San Diego, La Jolla, California, United States of America, 2 Institute ... College London, London, United Kingdom .... that 4 years before taking part in this experiment.

Using Stochastic NTCC to Model Biological Systems
Computer Science. ... calling into play different degrees of precision (i.e. partial information) about temporal or .... ntcc provides the following constructors:.