Inicio  /  Applied Sciences  /  Vol: 11 Par: 24 (2021)  /  Artículo
ARTÍCULO
TITULO

Anatomy of a Data Science Software Toolkit That Uses Machine Learning to Aid ?Bench-to-Bedside? Medical Research?With Essential Concepts of Data Mining and Analysis Explained

László Beinrohr    
Eszter Kail    
Péter Piros    
Erzsébet Tóth    
Rita Fleiner and Krasimir Kolev    

Resumen

Data science and machine learning are buzzwords of the early 21st century. Now pervasive through human civilization, how do these concepts translate to use by researchers and clinicians in the life-science and medical field? Here, we describe a software toolkit, just large enough in scale, so that it can be maintained and extended by a small team, optimised for problems that arise in small/medium laboratories. In particular, this system may be managed from data ingestion statistics preparation predictions by a single person. At the system?s core is a graph type database, so that it is flexible in terms of irregular, constantly changing data types, as such data types are common during explorative research. At the system?s outermost shell, the concept of ?user stories? is introduced to help the end-user researchers perform various tasks separated by their expertise: these range from simple data input, data curation, statistics, and finally to predictions via machine learning algorithms. We compiled a sizable list of already existing, modular Python platform libraries usable for data analysis that may be used as a reference in the field and may be incorporated into this software. We also provide an insight into basic concepts, such as labelled-unlabelled data, supervised vs. unsupervised learning, regression vs. classification, evaluation by different error metrics, and an advanced concept of cross-validation. Finally, we show some examples from our laboratory using our blood sample and blood clot data from thrombosis patients (sufferers from stroke, heart and peripheral thrombosis disease) and how such tools can help to set up realistic expectations and show caveats.

 Artículos similares

       
 
Lewis Urquhart, Andrew Wodehouse, Brian Loudon and Craig Fingland    
Algorithmic design harnesses the power of computation to generate a form based on input data and rules. In the product design setting, a major advantage afforded by this approach is the ability to automate the customization of design variations in accord... ver más
Revista: Applied Sciences

 
Anuja Arora, Ambikesh Jayal, Mayank Gupta, Prakhar Mittal and Suresh Chandra Satapathy    
Brain tumor segmentation seeks to separate healthy tissue from tumorous regions. This is an essential step in diagnosis and treatment planning to maximize the likelihood of successful treatment. Magnetic resonance imaging (MRI) provides detailed informat... ver más
Revista: Computers

 
Hossam El-Din Hassanien and Ahmed Elragal    
Transforming the state-of-the-art definition and anatomy of enterprise systems (ESs) seems to some academics and practitioners as an unavoidable destiny. Value depletion lead by early retirement and/or replacement of ESs solutions has been a constant thr... ver más
Revista: Informatics

 
Xin Chen, Hong Zhao and Ping Zhou    
In anatomy, the lung can be divided by lung fissures into several pulmonary lobe units with specific functions. Identifying the lung lobes and the distribution of various diseases among different lung lobes from CT images is important for disease diagnos... ver más
Revista: Algorithms

 
Khaja Moiduddin, Syed Hammad Mian, Wadea Ameen, Mohammed Alkindi, Sundar Ramalingam and Osama Alghamdi    
Mandibular reconstruction is a complicated task because of the complex nature of the regional anatomy. Computer-assisted tools are a promising means of improving the precision and safety of such complex surgeries. The digital techniques utilized in the r... ver más
Revista: Applied Sciences