|
|
|
Meng Li, Jiqiang Liu and Yeping Yang
Data governance is an extremely important protection and management measure throughout the entire life cycle of data. However, there are still data governance issues, such as data security risks, data privacy breaches, and difficulties in data management...
ver más
|
|
|
|
|
|
|
Li Tan, Jiayi Jiang, Meng Guo and Yujia Zhong
Land use types other than specialized athletic fields provide a variety of jogging environments, addressing the shortage of urban fitness facilities and promoting urban health as well as sustainability. Currently, there is limited research comparing the ...
ver más
|
|
|
|
|
|
|
Shiqian Guo, Yansun Huang, Baohua Huang, Linda Yang and Cong Zhou
This paper proposed a method for improving the XLNet model to address the shortcomings of segmentation algorithm for processing Chinese language, such as long sub-word lengths, long word lists and incomplete word list coverage. To address these issues, w...
ver más
|
|
|
|
|
|
|
Zepeng Wang, Yuan Chen and Juwei Zhang
In practical applications, the accuracy of domain terminology translation is an important criterion for the performance evaluation of domain machine translation models. Aiming at the problem of phrase mismatch and improper translation caused by word-by-w...
ver más
|
|
|
|
|
|
|
Sardar Parhat, Mutallip Sattar, Askar Hamdulla and Abdurahman Kadir
In this study, based on a morpheme segmentation framework, we researched a text keyword extraction method for Uyghur, Kazakh and Kirghiz languages, which have similar grammatical and lexical structures. In these languages, affixes and a stem are joined t...
ver más
|
|
|
|
|
|
|
Xiaohui Cui, Yu Yang, Dongmei Li, Xiaolong Qu, Lei Yao, Sisi Luo and Chao Song
Recently, researchers have extensively explored various methods for electronic medical record named entity recognition, including character-based, word-based, and hybrid methods. Nonetheless, these methods frequently disregard the semantic context of ent...
ver más
|
|
|
|
|
|
|
Yu Tong, Weiming Tan, Jingzhi Guo, Bingqing Shen, Peng Qin and Shuaihe Zhuo
In the last decade, blockchain smart contracts emerged as an automated, decentralized, traceable, and immutable medium of value exchange. Nevertheless, existing blockchain smart contracts are not compatible with legal contracts. The automatic execution o...
ver más
|
|
|
|
|
|
|
Guizhe Song, Degen Huang and Zhifeng Xiao
Multilingual characteristics, lack of annotated data, and imbalanced sample distribution are the three main challenges for toxic comment analysis in a multilingual setting. This paper proposes a multilingual toxic text classifier which adopts a novel fus...
ver más
|
|
|
|
|
|
|
Arda Tezcan, Bram Bulté and Bram Vanroy
We identify a number of aspects that can boost the performance of Neural Fuzzy Repair (NFR), an easy-to-implement method to integrate translation memory matches and neural machine translation (NMT). We explore various ways of maximising the added value o...
ver más
|
|
|
|
|
|
|
Xi Kuai, Renzhong Guo, Zhijun Zhang, Biao He, Zhigang Zhao and Han Guo
Georeferencing by place names (known as toponyms) is the most common way of associating textual information with geographic locations. While computers use numeric coordinates (such as longitude-latitude pairs) to represent places, people generally refer ...
ver más
|
|
|
|