|
|
|
Sen Du, Hong Zhu, Guangfeng Lin, Dong Wang and Jing Shi
Automatically describing the content of an image is a challenging task that is on the edge between natural language and computer vision. The current image caption models can describe the objects that are frequently seen in the training set very well, but...
ver más
|
|
|