Inicio  /  Informatics  /  Vol: 10 Par: 1 (2023)  /  Artículo
ARTÍCULO
TITULO

SMPT: A Semi-Supervised Multi-Model Prediction Technique for Food Ingredient Named Entity Recognition (FINER) Dataset Construction

Kokoy Siti Komariah    
Ariana Tulus Purnomo    
Ardianto Satriawan    
Muhammad Ogin Hasanuddin    
Casi Setianingsih and Bong-Kee Sin    

Resumen

To pursue a healthy lifestyle, people are increasingly concerned about their food ingredients. Recently, it has become a common practice to use an online recipe to select the ingredients that match an individual?s meal plan and healthy diet preference. The information from online recipes can be extracted and used to develop various food-related applications. Named entity recognition (NER) is often used to extract such information. However, the problem in building an NER system lies in the massive amount of data needed to train the classifier, especially on a specific domain, such as food. There are food NER datasets available, but they are still quite limited. Thus, we proposed an iterative self-training approach called semi-supervised multi-model prediction technique (SMPT) to construct a food ingredient NER dataset. SMPT is a deep ensemble learning model that employs the concept of self-training and uses multiple pre-trained language models in the iterative data labeling process, with a voting mechanism used as the final decision to determine the entity?s label. Utilizing the SMPT, we have created a new annotated dataset of ingredient entities obtained from the Allrecipes website named FINER. Finally, this study aims to use the FINER dataset as an alternative resource to support food computing research and development.

 Artículos similares

       
 
Emin Mercan    
This study aimed to determine the effects of the ozone treatment of film-forming solutions (FFSs) containing whey protein concentrate (WPC) and gelatine on biopolymer films? physical, mechanical, and thermal properties. Film samples were produced from a ... ver más
Revista: Applied Sciences

 
Yongzhen Zhang, Yanbo Hui, Ying Zhou, Juanjuan Liu, Ju Gao, Xiaoliang Wang, Baiwei Wang, Mengqi Xie and Haonan Hou    
Moldy corn produces aflatoxin and gibberellin, which can have adverse effects on human health if consumed. Mold is a significant factor that affects the safe storage of corn. If not detected and controlled in a timely manner, it will result in substantia... ver más
Revista: Applied Sciences

 
Nuno Marques de Almeida and Adolfo Crespo    
The frequency and severity of natural or human-induced disaster events, such as floods, earthquakes, hurricanes, fires, pandemics, hazardous material spills, groundwater contamination, structural failures, explosions, etc., as well as their impacts, have... ver más
Revista: Applied Sciences

 
Giulia Polizzi, Loriana Casalino, Marika Di Paolo, Alma Sardo, Valeria Vuoso, Carlos Manuel Franco and Raffaele Marrone    
The selection of starter cultures with different technological profiles and suitable microclimatic conditions is among the main tools used to improve the technological quality and safety of dry-cured salami. The aim of this study is to evaluate the effec... ver más
Revista: Applied Sciences

 
Francisca Espincho, Rúben Pereira, Sabrina M. Rodrigues, Diogo M. Silva, C. Marisa R. Almeida and Sandra Ramos    
The present work aims to evaluate the MP contamination of zooplankton and its impact on MP trophic transfers at the lower levels of the food web in a field study. During 1 year, seasonal surveys were conducted to collect zooplankton and water samples fro... ver más
Revista: Water