Narrowing Attention in Capsule Networks

Thiele, Sebastian; Risse, Benjamin

Forschungsartikel in Sammelband (Konferenz) | Peer reviewed

Zusammenfassung

Despite their recent success capsule networks (CapsNets) are still very computationally intensive and fail to achieve state-of-the-art performances on advanced datasets. As a consequence CapsNets are usually combined with additional conventional feature extraction layers to solve complex tasks. Based on the hypothesis that more efficient and distinct routing can alleviate these drawbacks, we propose a novel CapsNet algorithm, which utilises narrowed attention to determine the coupling coefficients between lower and higher level capsules. In particular, we employ tiny subnetworks with sigmoid activation functions to enforce concise routing decisions, thus reducing the tendency of CapsNets to explain the entire image rather than focusing on the essential information for a given task. This non-iterative routing strategy is computationally fast and memory efficient, results in interpretable coupling decisions and can be easily integrated into existing models due to its strong alignment with capsule theory. In addition, these solely capsule-based models are robust to a wide range of image transformations, have stable convergence characteristics and can be further improved by capsule-specific yet straightforward applications of dropout and batch normalisation. In a series of experiments, we demonstrate that narrowed attention routing enables the training of deep capsule networks without the need for additional feature extraction layers, while outperforming existing CapsNet architectures on a variety of well-known benchmark datasets.

Details zur Publikation

Herausgeber*innenIEEE
Buchtitel26th International Conference on Pattern Recognition
Seitenbereich2679-2685
VerlagWiley-IEEE Press
Erscheinungsort26th International Conference on Pattern Recognition (ICPR)
StatusVeröffentlicht
Veröffentlichungsjahr2022
Sprache, in der die Publikation verfasst istEnglisch
KonferenzInternational Conference on Pattern Recognition (ICPR2022), Montréal Quèbec, Kanada
StichwörterDeep Learning; Artificial Intelligence; Computer Vision

Autor*innen der Universität Münster

Risse, Benjamin
Professur für Geoinformatics for Sustainable Development (Prof. Risse)
Thiele, Sebastian
Institut für Informatik