REVISTA
Information

TODAS

Redirigiendo al acceso original de articulo en 17 segundos...

Inicio / Information / Vol: 12 Par: 8 (2021) / Artículo

ARTÍCULO

TITULO

A Study of Analogical Density in Various Corpora at Various Granularity

Rashel Fam and Yves Lepage

Resumen

In this paper, we inspect the theoretical problem of counting the number of analogies between sentences contained in a text. Based on this, we measure the analogical density of the text. We focus on analogy at the sentence level, based on the level of form rather than on the level of semantics. Experiments are carried on two different corpora in six European languages known to have various levels of morphological richness. Corpora are tokenised using several tokenisation schemes: character, sub-word and word. For the sub-word tokenisation scheme, we employ two popular sub-word models: unigram language model and byte-pair-encoding. The results show that the corpus with a higher Type-Token Ratio tends to have higher analogical density. We also observe that masking the tokens based on their frequency helps to increase the analogical density. As for the tokenisation scheme, the results show that analogical density decreases from the character to word. However, this is not true when tokens are masked based on their frequencies. We find that tokenising the sentences using sub-word models and masking the least frequent tokens increase analogical density.

Palabras claves

proportional analogy - language productivity - automatic acquisition

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 12 Parte: 8 (2021)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Journal of Science and Applicative Technology
Aceh International Journal of Science and Technology

DOI

https://doi.org/10.3390/info12080314

Artículos similares

Optimal Allocation Model of Water Resources Based on the Prospect Theory

Acceso

Huaxiang He, Aiqi Chen, Mingwan Yin, Zhenzhen Ma, Jinjun You, Xinmin Xie, Zhizhang Wang and Qiang An

The rational allocation of water resources in the basin/region can be better assisted and performed using a suitable water resources allocation model. Rule-based and optimization-based simulation methods are utilized to solve medium- and long-term water ... ver más

Revista: Water

A graph theory approach to the dormitory room placement problem

Acceso

Sri Efrinita Irwan, Triyana Muliawati Pág. 111 - 118

One of the important areas in mathematics is graph theory. A graph is a mathematical structure used to model pairwise relations between objects. The theory of graph can be applied in various problems. The purpose of this paper is to solve the dormitory r... ver más

Revista: Journal of Science and Applicative Technology

Behaviour Analysis of Strengthened-RC Beam with Natural Fiber Reinforced Polymer (NFRP) based on Abaca Fiber by Using Finite Element Method

Acceso

Taufiq Saidi,Taufiq Saidi,Muttaqin Hasan,Muttaqin Hasan,Zahra Amalia,Muhammad Iqbal,Muhammad Iqbal Pág. 155 - 164

The use of synthetic Fiber Reinforced Polymer (FRP) as a composite material is an alternative material that has been widely used for strengthening and repairing reinforced concrete structures. However, the high price is one of the obstacles in applying s... ver más

Revista: Aceh International Journal of Science and Technology

Antioxidant capacity of phytic acid purified from rice bran - doi: 10.4025/actascitechnol.v34i4.16358

Acceso

Cristiane Canan, Fernanda Delaroza, Rúbia Casagrande, Marcela Maria Baracat, Massami Shimokomaki, Elza Iouko Ida (Author) Pág. 457 - 463

Rice bran is a by-product of rice processing industry, with high levels of phytic acid or phytate. Considering phytic acid antioxidant activity, its various applications and its high concentration in rice bran, this study had the objective of evaluating ... ver más

Revista: Acta Scientiarum: Technology

Research on Ship Resistance Prediction Using Machine Learning with Different Samples

Acceso

Yunfei Yang, Zhicheng Zhang, Jiapeng Zhao, Bin Zhang, Lei Zhang, Qi Hu and Jianglong Sun

Resistance serves as a critical performance metric for ships. Swift and accurate resistance prediction can enhance ship design efficiency. Currently, methods for determining ship resistance encompass model tests, estimation techniques, and computational ... ver más

Revista: Journal of Marine Science and Engineering

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas