Research interests: Infometrics and Scientometrics, Informational technologies in Library; Integration of electronic resources; NLP. Certified trainer of Thomson Reuters. One of the authors and tutors of distance course "Curator of content" (NTU "KhPI").
AI and ML-2017 ongoing
Formation of the Text Corpus and Identification of Author Style in Academic Works
Qualitative scientific work must be original. Today's information systems are used to detect possible plagiarism to confirm originality. Due to on a lot of possible variants for text presentation, such systems have limited capabilities. But they can be used as additional means of detecting plagiarism. This research shows the features of text corpus creation for the development and testing of software tools for plagiarism detection; experiments with the determination of statistical estimates and lingometry parameters.