|
|
|
Huy Manh Nguyen, Tomo Miyazaki, Yoshihiro Sugaya and Shinichiro Omachi
Visual-semantic embedding aims to learn a joint embedding space where related video and sentence instances are located close to each other. Most existing methods put instances in a single embedding space. However, they struggle to embed instances due to ...
ver más
|
|
|