Inicio  /  Algorithms  /  Vol: 15 Par: 8 (2022)  /  Artículo
ARTÍCULO
TITULO

Building a Technology Recommender System Using Web Crawling and Natural Language Processing Technology

Nathalie Campos Macias    
Wilhelm Düggelin    
Yesim Ruf and Thomas Hanne    

Resumen

Finding, retrieving, and processing information on technology from the Internet can be a tedious task. This article investigates if technological concepts such as web crawling and natural language processing are suitable means for knowledge discovery from unstructured information and the development of a technology recommender system by developing a prototype of such a system. It also analyzes how well the resulting prototype performs in regard to effectivity and efficiency. The research strategy based on design science research consists of four stages: (1) Awareness generation; (2) suggestion of a solution considering the information retrieval process; (3) development of an artefact in the form of a Python computer program; and (4) evaluation of the prototype within the scope of a comparative experiment. The evaluation yields that the prototype is highly efficient in retrieving basic and rather random extractive text summaries from websites that include the desired search terms. However, the effectivity, measured by the quality of results is unsatisfactory due to the aforementioned random arrangement of extracted sentences within the resulting summaries. It is found that natural language processing and web crawling are indeed suitable technologies for such a program whilst the use of additional technology/concepts would add significant value for a potential user. Several areas for incremental improvement of the prototype are identified.

 Artículos similares

       
 
Hongguo Ren, Yujun Wang, Jing Zhang, Ziming Zheng and Qingqin Wang    
As the quality of life and the spiritual and cultural well-being of the inhabitants progress, the current rural infrastructure has challenges in adequately addressing the physical and psychological requirements of individuals. This work presents a method... ver más
Revista: Applied Sciences

 
Fenfang Li, Zhengzhang Zhao, Li Wang and Han Deng    
Sentence Boundary Disambiguation (SBD) is crucial for building datasets for tasks such as machine translation, syntactic analysis, and semantic analysis. Currently, most automatic sentence segmentation in Tibetan adopts the methods of rule-based and stat... ver más
Revista: Applied Sciences

 
Fajia Zheng, Bin Zhang, Yuqiong Zhao, Jiakun Li, Fei Long and Qibo Feng    
Key errors of machine tools have a significant impact on their accuracy, however accurately and quickly measuring the geometric errors of machine tools is essential for key error identification. Fortunately, a quick and direct laser measurement method an... ver más
Revista: Applied Sciences

 
Xiaojun Zhang, Zhuo Li, Zheng Wei and Wenxue Gao    
Blasting technology is widely applied in various engineering applications due to its cost-effectiveness and high efficiency, such as in mining, transport infrastructure construction, and building demolition. However, the occurrence of cracking in the rea... ver más
Revista: Applied Sciences

 
Qiankun Wang, Ke Zhu, Peiwen Guo, Jiaji Zhang and Zhihua Xiong    
Faced with the challenges of global climate change, zero-carbon buildings (ZCB) serve as a crucial means to achieve carbon peak and carbon neutrality goals, particularly in the development of tropical island regions. This study aims to establish a ZCB te... ver más
Revista: Applied Sciences