|
|
|
Shichen Lu, Ruimin Hu, Jing Liu, Longteng Guo and Fei Zheng
In the task of image captioning, learning the attentive image regions is necessary to adaptively and precisely focus on the object semantics relevant to each decoded word. In this paper, we propose a convolutional attention module that can preserve the s...
ver más
|
|
|