zbMATH — the first resource for mathematics

Modeling attention to salient proto-objects. (English) Zbl 1112.92017
Summary: Selective visual attention is believed to be responsible for serializing visual information for recognizing one object at a time in a complex scene. But how can we attend to objects before they are recognized? In coherence theory of visual cognition, so-called proto-objects form volatile units of visual information that can be accessed by selective attention and subsequently validated as actual objects. We propose a biologically plausible model of forming and attending to proto-objects in natural scenes. We demonstrate that the suggested model can enable a model of object recognition in cortex to expand from recognizing individual objects in isolation to sequentially recognizing all objects in a more complex scene.

92C20 Neural biology
91E30 Psychophysics and psychophysiology; perception
Full Text: DOI
[1] Braun, J., Visual-search among items of different salience — removal of visual-attention mimics a lesion in extrastriate area V4, Journal of neuroscience, 14, 2, 554-567, (1994)
[2] Burt, P.J.; Adelson, E.H., The Laplacian pyramid as a compact image code, IEEE transactions on communications, COM-31, 4, 532-540, (1983)
[3] Chelazzi, L.; Miller, E.K.; Duncan, J.; Desimone, R., Responses of neurons in macaque area V4 during memory-guided visual search, Cerebral cortex, 11, 8, 761-772, (2001)
[4] Connor, C.E.; Preddie, D.C.; Gallant, J.L.; van Essen, D.C., Spatial attention effects in macaque area V4, Journal of neuroscience, 17, 9, 3201-3214, (1997)
[5] Deco, G.; Schürmann, B., A hierarchical neural system with attentional top-down enhancement of the spatial resolution for object recognition, Vision research, 40, 20, 2845-2859, (2000)
[6] Duncan, J., Selective attention and the organization of visual information, Journal of experimental psychology: general, 113, 4, 501-517, (1984)
[7] Duncan, J., Integrated mechanisms of selective attention, Current opinion in biology, 7, 255-261, (1997)
[8] Egly, R.; Driver, J.; Rafal, R.D., Shifting visual attention between objects and locations: evidence from normal and parietal lesion subjects, Journal of experimental psychology general, 123, 2, 161-177, (1994)
[9] Eriksen, C.W.; St. James, J.D., Visual attention within and around the field of focal attention: A zoom Lens model, Perception and psychophysics, 40, 4, 225-240, (1986)
[10] Feldman, J.A.; Ballard, D.H., Connectionist models and their properties, Cognitive science, 6, 3, 205-254, (1983)
[11] Gandhi, S.P.; Heeger, D.J.; Boynton, G.M., Spatial attention affects brain activity in human primary visual cortex, Proceedings of the national Academy of sciences of the USA, 96, 6, 3314-3319, (1999)
[12] Grossberg, S., Adaptive pattern classification and universal recoding: I. parallel development and coding of neural feature detectors, Biological cybernetics, 23, 3, 121-134, (1976) · Zbl 0339.92004
[13] Grossberg, S., Adaptive pattern classification and universal recoding: II. feedback, expectation, olfaction, illusions, Biological cybernetics, 23, 4, 187-202, (1976) · Zbl 0339.92005
[14] Hamker, F.H., The reentry hypothesis: the putative interaction of the frontal eye field, ventrolateral prefrontal cortex, and areas V4, IT for attention and eye movement, Cerebral cortex, 15, 4, 431-447, (2005)
[15] Hamker, F.H., The emergence of attention by population-based inference and its role in distributed processing and cognitive control of vision, Computer vision and image understanding, 100, 1-2, 64-106, (2005)
[16] Indiveri, G., Winner-take-all networks with lateral excitation, Analog integrated circuits and signal processing, 13, 1/2, 185-193, (1997)
[17] Intriligator, J.; Cavanagh, P., The spatial resolution of visual attention, Cognitive psychology, 43, 3, 171-216, (2001)
[18] Itti, L. (2000). Models of bottom-up and top-down visual attention. Ph.D. thesis. California Institute of Technology
[19] Itti, L.; Koch, C., A saliency-based search mechanism for overt and covert shifts of visual attention, Vision research, 40, 10-12, 1489-1506, (2000)
[20] Itti, L.; Koch, C., Feature combination strategies for saliency-based visual attention systems, Journal of electronic imaging, 10, 1, 161-169, (2001)
[21] Itti, L.; Koch, C.; Niebur, E., A model of saliency-based visual attention for rapid scene analysis, IEEE transactions on pattern analysis and machine intelligence, 20, 11, 1254-1259, (1998)
[22] Jordan, H.; Tipper, S.P., Spread of inhibition across an object’s surface, British journal of psychology, 90, 4, 495-507, (1999)
[23] Kahneman, D.; Treisman, A., Changing views of attention and automaticity, (), 29-61
[24] Kastner, S.; De Weerd, P.; Desimone, R.; Ungerleider, L.G., Mechanisms of directed attention in the human extrastriate cortex as revealed by functional MRI, Science, 282, 5386, 108-111, (1998)
[25] Koch, C.; Ullman, S., Shifts in selective visual-attention — towards the underlying neural circuitry, Human neurobiology, 4, 219-227, (1985)
[26] Liu, S.-C., A normalizing avlsi network with controllable winner-take-all properties, Analog integrated circuits and signal processing, 31, 1, 47-53, (2002)
[27] Logothetis, N.K.; Pauls, J.; Bülthoff, H.H.; Poggio, T., View-dependent object recognition by monkeys, Current biology, 4, 5, 401-414, (1994)
[28] Lowe, D.G., Distinctive image features from scale-invariant keypoints, International journal of computer vision, 60, 2, 91-110, (2004)
[29] Luck, S.J.; Chelazzi, L.; Hillyard, S.A.; Desimone, R., Neural mechanisms of spatial selective attention in areas V1, V2, and V4 of macaque visual cortex, Journal of neurophysiology, 77, 1, 24-42, (1997)
[30] Martin, D.; Fowlkes, C.; Malik, J., Learning to detect natural image boundaries using local brightness, color and texture cues, IEEE transactions on pattern analysis and machine intelligence, 26, 5, 530-549, (2004)
[31] McAdams, C.J.; Maunsell, J.H.R., Attention to both space and feature modulates neuronal responses in macaque area V4, Journal of neurophysiology, 83, 3, 1751-1755, (2000)
[32] McAdams, C.J.; Reid, R.C., Attention modulates the responses of simple cells in monkey primary visual cortex, Journal of neuroscience, 25, 47, 11023-11033, (2005)
[33] Mitchell, J.F.; Stoner, G.R.; Fallah, M.; Reynolds, J.H., Attentional selection of superimposed surfaces cannot be explained by modulation of the gain of color channels, Vision research, 43, 12, 1323-1328, (2003)
[34] Moore, C.M.; Yantis, S.; Vaughan, B., Object-based visual selection: evidence from perceptual completion, Psychological science, 9, 2, 104-110, (1998)
[35] Moran, J.; Desimone, R., Selective attention gates visual processing in the extrastriate cortex, Science, 229, 4715, 782-784, (1985)
[36] Motter, B.C., Neural correlates of attentive selection for color or luminance in extrastriate area V4, Journal of neuroscience, 14, 4, 2178-2189, (1994)
[37] Navalpakkam, V.; Itti, L., Modeling the influence of task on attention, Vision research, 45, 2, 205-231, (2005)
[38] O’Connor, D.H.; Fukui, M.M.; Pinsk, M.A.; Kastner, S., Attention modulates responses in the human lateral geniculate nucleus, Nature neuroscience, 5, 11, 1203-1209, (2002)
[39] Olshausen, B.A.; Anderson, C.H.; Van Essen, D.C., A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information, Journal of neuroscience, 13, 11, 4700-4719, (1993)
[40] Posner, M.I., Orienting of attention, Quarterly journal of experimental psychology, 32, 1, 3-25, (1980)
[41] Rensink, R.A., Seeing, sensing, and scrutinizing, Vision research, 40, 10-12, 1469-1487, (2000)
[42] Rensink, R.A., The dynamic representation of scenes, Visual cognition, 7, 1/2/3, 17-42, (2000)
[43] Rensink, R.A.; Oregan, J.K.; Clark, J.J., To see or not to see: the need for attention to perceive changes in scenes, Psychological science, 8, 5, 368-373, (1997)
[44] Reynolds, J.H.; Pasternak, T.; Desimone, R., Attention increases sensitivity of V4 neurons, Neuron, 26, 3, 703-714, (2000)
[45] Reynolds, J.H.; Alborzian, S.; Stoner, G.R., Exogenously cued attention triggers competitive selection of surfaces, Vision research, 43, 1, 59-66, (2003)
[46] Riesenhuber, M.; Poggio, T., Are cortical models really bound by the “binding problem”?, Neuron, 24, 1, 87-93, (1999), 111-125
[47] Riesenhuber, M.; Poggio, T., Hierarchical models of object recognition in cortex, Nature neuroscience, 2, 11, 1019-1025, (1999)
[48] Riesenhuber, M.; Poggio, T., How visual cortex recognizes objects: the tale of the standard model, (), 1640-1653
[49] Roelfsema, P.R.; Lamme, V.A.F.; Spekreijse, H., Object-based attention in the primary visual cortex of the macaque monkey, Nature, 395, 6700, 376-381, (1998)
[50] Rosen, E. (2003). Face representation in cortex: Studies using a simple and not so special model, CBCL Paper #228/AI Memo #2003-010. Technical report. Massachusetts Institute of Technology. June
[51] Rosenfeld, A.; Pfaltz, J.L., Sequential operations in digital picture processing, Journal of the association for computing machinery, 13, 471-494, (1966) · Zbl 0143.41803
[52] Rutishauser, U., Walther, D., Koch, C., & Perona, P. (2004). Is attention useful for object recognition? In International conference on computer vision and pattern recognition: Vol. 2 (pp. 37-44)
[53] Rybak, I.A.; Gusakova, V.I.; Golovan, A.V.; Podladchikova, L.N.; Shevtsova, N.A., A model of attention-guided visual perception and recognition, Vision research, 38, 15-16, 2387-2400, (1998)
[54] Saarinen, J.; Julesz, B., The speed of attentional shifts in the visual field, Proceedings of the national Academy of sciences of the USA, 88, 5, 1812-1814, (1991)
[55] Sagi, D.; Julesz, B., Enhanced detection in the aperture of focal attention, Nature, 321, 693-695, (1986)
[56] Serences, J.T.; Yantis, S., Selective visual attention and perceptual coherence, Trends in cognitive sciences, 10, 1, 38-45, (2006)
[57] Serre, T.; Wolf, L.; Poggio, T., Object recognition with features inspired by visual cortex, (), 994-1000
[58] Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., & Poggio, T. (2006). Object recognition with cortex-like mechanisms. IEEE Transactions on Pattern Analysis and Machine Intelligence (in press)
[59] Shi, J.; Malik, J., Normalized cuts and image segmentation, IEEE transactions on pattern analysis and machine intelligence, 22, 8, 888-905, (2000)
[60] Shulman, G.L.; Wilson, J., Spatial frequency and selective attention to spatial location, Perception, 16, 1, 103-111, (1987)
[61] Simons, D.J.; Levin, D.T., Failure to detect changes to people during a real-world interaction, Psychonomic bulletin and review, 5, 4, 644-649, (1998)
[62] Spitzer, H.; Desimone, R.; Moran, J., Increased attention enhances both behavioral and neuronal performance, Science, 240, 4850, 338-340, (1988)
[63] Sun, Y.; Fisher, R., Object-based visual attention for computer vision, Artificial intelligence, 20, 11, 77-123, (2003) · Zbl 1082.68839
[64] Tipper, S.P.; Jordan, H.; Weaver, B., Scene-based and object-centered inhibition of return: evidence for dual orienting mechanisms, Perception and psychophysics, 61, 1, 50-60, (1999)
[65] Torralba, A., Modeling global scene factors in attention, Journal of the optical society of America A, 20, 7, 1407-1418, (2003)
[66] Treisman, A.M.; Gelade, G., A feature-integration theory of attention, Cognitive psychology, 12, 1, 97-136, (1980)
[67] Tsotsos, J.K., Analyzing vision at the complexity level, Behavioral and brain sciences, 13, 3, 423-445, (1990)
[68] Tsotsos, J.K.; Culhane, S.M.; Wai, W.Y.K.; Lai, Y.H.; Davis, N.; Nuflo, F., Modeling visual-attention via selective tuning, Artificial intelligence, 78, 507-545, (1995)
[69] Vetter, T., & Blanz, V. (1999). A morphable model for the synthesis of 3D faces. In SIGGRAPH (pp. 187-194)
[70] Walther, D. (2006). Appendix A.2. Color opponencies for bottom-up attention. In Interactions of visual attention and object recognition: Computational modeling, algorithms, and psychophysics. Ph.D. thesis (pp. 98-99). California Institute of Technology. http://resolver.caltech.edu/CaltechETD:etd-03072006-135433
[71] Walther, D.; Itti, L.; Riesenhuber, M.; Poggio, T.; Koch, C., Attentional selection for object recognition — a gentle way, (), 472-479 · Zbl 1033.68929
[72] Walther, D.; Rutishauser, U.; Koch, C.; Perona, P., Selective visual attention enables learning and recognition of multiple objects in cluttered scenes, Computer vision and image understanding, 100, 1-2, 41-63, (2005)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.