Inicio  /  Applied Sciences  /  Vol: 12 Par: 3 (2022)  /  Artículo
ARTÍCULO
TITULO

Comparing Deep Learning and Shallow Learning Techniques for API Calls Malware Prediction: A Study

Angelo Cannarile    
Vincenzo Dentamaro    
Stefano Galantucci    
Andrea Iannacone    
Donato Impedovo and Giuseppe Pirlo    

Resumen

Recognition of malware is critical in cybersecurity as it allows for avoiding execution and the downloading of malware. One of the possible approaches is to analyze the executable?s Application Programming Interface (API) calls, which can be done using tools that work in sandboxes, such as Cuckoo or CAPEv2. This chain of calls can then be used to classify if the considered file is benign or malware. This work aims to compare six modern shallow learning and deep learning techniques based on tabular data, using two datasets of API calls containing malware and goodware, where the corresponding chain of API calls is expressed for each instance. The results show the quality of shallow learning approaches based on tree ensembles, such as CatBoost, both in terms of F1-macro score and Area Under the ROC curve (AUC ROC), and training time, making them optimal for making inferences on Edge AI solutions. The results are then analyzed with the explainable AI SHAP technique, identifying the API calls that most influence the process, i.e., those that are particularly afferent to malware and goodware.

 Artículos similares

       
 
Myoung-Su Choi, Dong-Hun Han, Jun-Woo Choi and Min-Soo Kang    
Sleep apnea has emerged as a significant health issue in modern society, with self-diagnosis and effective management becoming increasingly important. Among the most renowned methods for self-diagnosis, the STOP-BANG questionnaire is widely recognized as... ver más
Revista: Applied Sciences

 
Atefe Sedaghat, Homayoon Arbabkhah, Masood Jafari Kang and Maryam Hamidi    
This research introduces an online system for monitoring maritime traffic, aimed at tracking vessels in water routes and predicting their subsequent locations in real time. The proposed framework utilizes an Extract, Transform, and Load (ETL) pipeline to... ver más

 
Olivier Pantalé    
Finite element (FE) simulations have been effective in simulating thermomechanical forming processes, yet challenges arise when applying them to new materials due to nonlinear behaviors. To address this, machine learning techniques and artificial neural ... ver más
Revista: Algorithms

 
Szabolcs Deák, Paul Levine, Joseph Pearlman and Bo Yang    
We construct a New Keynesian (NK) behavioural macroeconomic model with bounded-rationality (BR) and heterogeneous agents. We solve and simulate the model using a third-order approximation for a given policy and evaluate its properties using this solution... ver más
Revista: Algorithms

 
William Villegas-Ch and Jaime Govea    
This article addresses the need for early emergency detection and safety monitoring in public spaces using deep learning techniques. The problem of discerning relevant sound events in urban environments is identified, which is essential to respond quickl... ver más