|
|
|
Mohamad Abou Ali, Fadi Dornaika and Ignacio Arganda-Carreras
Deep learning (DL) has made significant advances in computer vision with the advent of vision transformers (ViTs). Unlike convolutional neural networks (CNNs), ViTs use self-attention to extract both local and global features from image data, and then ap...
ver más
|
|
|
|
|
|
|
Qian Zhou, Hua Zou and Huanhuan Wu
Vision Transformers (ViTs) have shown their superiority in various visual tasks for the capability of self-attention mechanisms to model long-range dependencies. Some recent works try to reduce the high cost of vision transformers by limiting the self-at...
ver más
|
|
|
|
|
|
|
A. Vits, D. Weitzenbürger, H. Hamann, and O. Distl
Pág. 1511 -
|
|
|
|