Redirigiendo al acceso original de articulo en 20 segundos...
ARTÍCULO
TITULO

An Empirical Comparison of Portuguese and Multilingual BERT Models for Auto-Classification of NCM Codes in International Trade

Roberta Rodrigues de Lima    
Anita M. R. Fernandes    
James Roberto Bombasar    
Bruno Alves da Silva    
Paul Crocker and Valderi Reis Quietinho Leithardt    

Resumen

Classification problems are common activities in many different domains and supervised learning algorithms have shown great promise in these areas. The classification of goods in international trade in Brazil represents a real challenge due to the complexity involved in assigning the correct category codes to a good, especially considering the tax penalties and legal implications of a misclassification. This work focuses on the training process of a classifier based on bidirectional encoder representations from transformers (BERT) for tax classification of goods with MCN codes which are the official classification system for import and export products in Brazil. In particular, this article presents results from using a specific Portuguese-language-pretrained BERT model, as well as results from using a multilingual-pretrained BERT model. Experimental results show that Portuguese model had a slightly better performance than the multilingual model, achieving an MCC 0.8491, and confirms that the classifiers could be used to improve specialists? performance in the classification of goods.

 Artículos similares

       
 
Lan Wang, Mingjiang Xie, Min Pan, Feng He, Bing Yang, Zhigang Gong, Xuke Wu, Mingsheng Shang and Kun Shan    
Harmful algal blooms (HABs) have been deteriorating global water bodies, and the accurate prediction of algal dynamics using the modelling method is a challenging research area. High-frequency monitoring and deep learning technology have opened up new ho... ver más
Revista: Water

 
Jianting Zhu    
A method was developed to integrate the truncated power-law distribution of solid volumetric fraction into the widely used Kozeny?Carman (KC)-type equations to assess the potential uncertainty of permeability. The focus was on the heterogeneity of porosi... ver más
Revista: Hydrology

 
Luigi Calabrese, Massimiliano Galeano and Edoardo Proverbio    
In this paper, time/frequency domain data processing was proposed to analyse the EN signal recorded during stress corrosion cracking on precipitation-hardening martensitic stainless steel in a chloride environment. Continuous Wavelet Transform, albeit wi... ver más

 
Katherine Ho and Rebecca Loraamm    
Animal movements are realizations of complex spatiotemporal processes. Central to these processes are the varied environmental contexts in which animals move, which fundamentally impact the movement trajectories of individuals at fine spatial and tempora... ver más

 
Xiangrui Xiong, Yanhui Wang, Cheng Ma and Yuwei Chi    
The Large Machine Factory (LMF) was built in the complex historical context of the late Qing Dynasty (1840?1912). Its space and construction faithfully record the architectural and cultural fusion between Chinese and western traditions and mark the begin... ver más
Revista: Buildings