Robust Object Recognition with Cortex-like Mechanisms
Overview
Affiliations
We introduce a new general framework for the recognition of complex visual scenes, which is motivated by biology: We describe a hierarchical system that closely follows the organization of visual cortex and builds an increasingly complex and invariant feature representation by alternating between a template matching and a maximum pooling operation. We demonstrate the strength of the approach on a range of recognition tasks: From invariant single object recognition in clutter to multiclass categorization problems and complex scene understanding tasks that rely on the recognition of both shape-based as well as texture-based objects. Given the biological constraints that the system had to satisfy, the approach performs surprisingly well: It has the capability of learning from only a few training examples and competes with state-of-the-art systems. We also discuss the existence of a universal, redundant dictionary of features that could handle the recognition of most object categories. In addition to its relevance for computer vision, the success of this approach suggests a plausibility proof for a class of feedforward models of object recognition in cortex.
Unraveling the complexity of rat object vision requires a full convolutional network and beyond.
Muratore P, Alemi A, Zoccolan D Patterns (N Y). 2025; 6(2):101149.
PMID: 40041851 PMC: 11873012. DOI: 10.1016/j.patter.2024.101149.
RTify: Aligning Deep Neural Networks with Human Behavioral Decisions.
Cheng Y, Rodriguez I, Chen S, Kar K, Watanabe T, Serre T ArXiv. 2025; .
PMID: 39764401 PMC: 11703321.
Waschke L, Kamp F, van den Elzen E, Krishna S, Lindenberger U, Rutishauser U Nat Commun. 2025; 16(1):236.
PMID: 39747026 PMC: 11696175. DOI: 10.1038/s41467-024-55406-4.
Luthra S, Razin R, Tierney A, Holt L, Dick F bioRxiv. 2024; .
PMID: 39386708 PMC: 11463673. DOI: 10.1101/2024.09.21.614258.
Miyagi Y, Habara T, Hirata R, Hayashi N Reprod Med Biol. 2024; 23(1):e12612.
PMID: 39351129 PMC: 11442056. DOI: 10.1002/rmb2.12612.