METHOD OF DEVELOPING ARTIFICIAL PATTERNS TO ANALYZING SEMANTIC SIMILARITY OF DOCUMENTS
Keywords:
semantic similarity of documents, latent feature.Abstract
A technique for creating artificial patterns using data from textual documents is considered. There is a need for patterns to reduce the dimensionality of the feature space and analyse the semantic similarity of thematic documents. The composition of patterns is formed according to the rules of hierarchical agglomerative grouping. The formation process is based on the selection of a latent feature space using the technology for calculating generalized assessments of objects.
References
Ignatyev N. A. Structure Choice for Relations between Objects in Metric Classification Algorithms // Pattern Recognition and Image Analysis. 2018. V. 28. № 4. P. 590–597.
N. A. Ignatev, U. Y. Tuliyev, “Semantic structuring of text documents based on patterns of natural language entities”, Computer Research and Modeling, 14:5 (2022), 1185–1197 http://doi.org/10.20537/2076-7633-2022-14-5-1185-1197
N. Abdurakhmonova, U. Tuliyev and A. Gatiatullin, "Linguistic functionality of Uzbek Electron Corpus: uzbekcorpus.uz," 2021 International Conference on Information Science and Communications Technologies (ICISCT), Tashkent, Uzbekistan, 2021, pp. 1-4, http://doi.org/10.1109/ICISCT52966.2021.9670043
Tuliyev U. (2021). Space formation for the description of thematic documents. AIP Conference Proceedings. 2365. 070007. http://doi.org/10.1063/5.0056963
Тулиев У. Ю. Кластерный анализ текстовых документов по отношению их связности // Проблемы вычислительной и прикладной математики. — 2019, No 6(24). — С. 102–109.
N.A. Ignatyev, Sh.F.Madrakhimov, D.Y.Saidov. Stability of object classes and selection of the latent features // International journal of engineering technology and sciences, 2017, Malaysia, Vol. 7, pp. 1-10.
Игнатьев Н.А., Саидов Д.Ю. Анализ данных и принятие решений с помощью логических закономерностей в форме полуплоскостей // Известия СамНЦ, 2017, Том 19, № 4(2), С. 294-300.