On data mining for software repositories

Dmitry Namiot

Vladimir Romanov

Resumen

The article discusses issues related to the use of data science and data mining methods for software repositories. The paper attempts to provide an overview of the technologies that are used in the analysis of programs and are based on static data that can be extracted directly from the code or the code repositories. The paper reviews papers using deep learning methods (recurrent neural networks), classification methods based on other machine learning models, and the use of clustering in software engineering. Practical applications of the methods under consideration include, for example, classification and prediction of errors, determining the characteristics of code change over time, searching for duplicate fragments, automatically detecting design errors, recommending code refactoring.

Acceso

PÁGINAS

pp. 18 - 23

NÚMERO

Volumen: 6 Número: 4 Parte: 0 (2018)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Inteligencia Artificial
Infrastructures

Artículos similares

Vehicle Driving Safety of Underground Interchanges Using a Driving Simulator and Data Mining Analysis

Acceso

Zhen Liu, Qifeng Yang, Anlue Wang and Xingyu Gu

In the process of driving in an underground interchange, drivers are faced with many challenges, such as being in a closed space, visual changes alternating between light and dark conditions, complex road conditions in the confluence section, and dense s... ver más

Revista: Infrastructures

Anomaly Detection and Identification Method for Shield Tunneling Based on Energy Consumption Perspective

Acceso

Min Hu, Fan Zhang and Huiming Wu

Various abnormal scenarios might occur during the shield tunneling process, which have an impact on construction efficiency and safety. Existing research on shield tunneling construction anomaly detection typically designs models based on the characteris... ver más

Revista: Applied Sciences

Multi-Identity Recognition of Darknet Vendors Based on Metric Learning

Acceso

Yilei Wang, Yuelin Hu, Wenliang Xu and Futai Zou

Dark web vendor identification can be seen as an authorship aliasing problem, aiming to determine whether different accounts on different markets belong to the same real-world vendor, in order to locate cybercriminals involved in dark web market transact... ver más

Revista: Applied Sciences

Topic Extraction: BERTopic?s Insight into the 117th Congress?s Twitterverse

Acceso

Margarida Mendonça and Álvaro Figueira

As social media (SM) becomes increasingly prevalent, its impact on society is expected to grow accordingly. While SM has brought positive transformations, it has also amplified pre-existing issues such as misinformation, echo chambers, manipulation, and ... ver más

Revista: Informatics

Using ML to Predict User Satisfaction with ICT Technology for Educational Institution Administration

Acceso

Hamad Almaghrabi, Ben Soh and Alice Li

Effective and efficient use of information and communication technology (ICT) systems in the administration of educational organisations is crucial to optimise their performance. Earlier research on the identification and analysis of ICT users? satisfact... ver más

Revista: Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas