Лексика как классифицирующий признак современной поэзии [Vocabulary as a Classifying Feature оf Russian Postmodern Poetry]

Boris Orekhov


The article discusses the possibility of classifying poetic books on the basis of their vocabulary. The distance between 190 poem collections is calculated as the Euclidean distance between books’ vocabularies, for each element of which the value of TF-IDF (term frequency – inverse document frequency) is calculated (each book has 190 measurements with this method of calculation). Using t-SNE (t-distributed stochastic neighbor embedding), these measurements are reduced to two, and the K-means clustering method is applied to the resulting structure. With such a classification method, poets are grouped on the basis of their originality / similarity, which in turn helps to overcome more traditional classifications based on poets’ generations or literary schools.

Keywords: 21st-Century Russian Poetry, Lexis, t-Distributed Stochastic Neighbor
Embedding, Digital Humanities.

Full Text:


DOI: https://doi.org/10.22601/SR.2019.06.08


  • There are currently no refbacks.

Издательство / Published by:

ISSN 2346-5824 (print)
ISSN 2504-7531 (online)