|
|
|
Tomasz Walczyna and Zbigniew Piotrowski
The proliferation of ?Deep fake? technologies, particularly those facilitating face-swapping in images or videos, poses significant challenges and opportunities in digital media manipulation. Despite considerable advancements, existing methodologies ofte...
ver más
|
|
|
|
|
|
|
Jiajia Peng and Tianbing Tang
Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descript...
ver más
|
|
|
|
|
|
|
Ioannis Karampinis, Lazaros Iliadis and Athanasios Karabinis
Structures inevitably suffer damage after an earthquake, with severity ranging from minimal damage of nonstructural elements to partial or even total collapse, possibly with loss of human lives. Thus, it is essential for engineers to understand the cruci...
ver más
|
|
|
|
|
|
|
Ulzhan Bissarinova, Aidana Tleuken, Sofiya Alimukhambetova, Huseyin Atakan Varol and Ferhat Karaca
This paper introduces a deep learning (DL) tool capable of classifying cities and revealing the features that characterize each city from a visual perspective. The study utilizes city view data captured from satellites and employs a methodology involving...
ver más
|
|
|
|
|
|
|
Hongye Liu and Xiai Chen
Person re-identification aims to identify the same pedestrians captured by various cameras from different viewpoints in multiple scenarios. Occlusion is the toughest problem for practical applications. In video-based ReID tasks, motion information can be...
ver más
|
|
|
|
|
|
|
Ning Li, Tianrun Ye, Zhihua Zhou, Chunming Gao and Ping Zhang
In the domain of automatic visual inspection for miniature capacitor quality control, the task of accurately detecting defects presents a formidable challenge. This challenge stems primarily from the small size and limited sample availability of defectiv...
ver más
|
|
|
|
|
|
|
Shancheng Tang, Ying Zhang, Zicheng Jin, Jianhui Lu, Heng Li and Jiqing Yang
The number of defect samples on the surface of aluminum profiles is small, and the distribution of abnormal visual features is dispersed, such that the existing supervised detection methods cannot effectively detect undefined defects. At the same time, t...
ver más
|
|
|
|
|
|
|
Qiuyue Li, Hao Sheng, Mingxue Sheng and Honglin Wan
Efficient document recognition and sharing remain challenges in the healthcare, insurance, and finance sectors. One solution to this problem has been the use of deep learning techniques to automatically extract structured information from paper documents...
ver más
|
|
|
|
|
|
|
Yuhuan Wu and Yonghong Wu
Salient object detection (SOD) aims to identify the most visually striking objects in a scene, simulating the function of the biological visual attention system. The attention mechanism in deep learning is commonly used as an enhancement strategy which e...
ver más
|
|
|
|
|
|
|
Rushi Li and Mincheng Wu
Urban color, primarily emanating from building façades and roofs, plays a pivotal role in shaping a city?s image and influencing people?s overall impression. Understanding the nuances of color patterns contributes significantly to unraveling the uniquene...
ver más
|
|
|
|