Research on Letter and Word Frequency and Mathematical Modeling of Frequency Distributions in the Modern Bulgarian Language


Георгиева-Трифонова, Цветанка (2014) Research on Letter and Word Frequency and Mathematical Modeling of Frequency Distributions in the Modern Bulgarian Language Contemporary Advancements in Information Technology Development in Dynamic Environments, Book chapter, IGI Global Publishing, 2014, pp.111-139, ISBN: 978-146666253-7;1466662522;978-146666252-0, https://www.scopus.com/record/display.uri?eid=2-s2.0-84949772000&origin=resultslist&sort=plf-f&src=s&sid=fe3a1e48c11c53ca8309bac5f2c854f0&sot=autdocs&sdt=autdocs&sl=18&s=AU-ID%2824366500900%29&relpos=1&citeCnt=0&searchTerm=


 The purpose of this article is to present current research on the modern Bulgarian language. It is one of the oldest European languages. An information system for the management of the electronic archive with texts in Bulgarian language is described. It provides the possibility for processing the collected text information. The detailed and comprehensive researches on the letter and the word frequency in the modern Bulgarian language from varied sources (fiction, scientific and popular science literature, press, legal texts, government bulletins, etc.) are performed and the obtained results are represented. The index of coincidence of the Bulgarian language as a whole and for the individual sources is computed. The results can be utilized by different specialists – computer scientists, linguists, cryptanalysts and others. Furthermore, with mathematical modeling we found the letter and word frequency distributions and their models and we estimated their standard deviations by documents.
  Част от книга / Глава от книга
 letter frequency, bigram, trigram, word frequency, Cyrillic alphabet, Bulgarian language, letter and the word frequency distributions, mathematical modeling


Природни науки, математика и информатика
Природни науки, математика и информатика Информатика и компютърни науки

Natural sciences, mathematics and informatics
Natural sciences, mathematics and informatics Informatics and Computer Science

 Издадено
  6955
 Цветанка Георгиева-Трифонова

Научният архив поддържа инициативата за отворен достъп OAI 2.0 с начален адрес: http://da.uni-vt.bg/oai2/