Developing an image-based classifier for detecting poetic content in historic newspaper collections

Elizabeth Lorang, Leen Kiat Soh, Maanas Varma Datla, Spencer Kulwicki

Research output: Contribution to journalArticlepeer-review

10 Scopus citations


The Image Analysis for Archival Discovery (Aida) project team is investigating the use of image analysis to identify poetic content in historic newspapers. The project seeks both to augment the study of literary history by drawing attention to the magnitude of poetry published in newspapers and by making the poetry more readily available for study, as well as to advance work on the use of digital images in facilitating discovery in digital libraries and other digitized collections. We have recently completed the process of training our classifier for identifying poetic content, and as we prepare to move to the deployment stage, we are making available our methods for classification and testing in order to promote further research and discussion. The precision and recall values achieved during the training (90.58%; 79.4%) and testing (74.92%; 61.84%) stages are encouraging. In addition to discussing why such an approach is needed and relevant and situating our project alongside related work, this paper analyzes preliminary results, which support the feasibility and viability of our approach to detecting poetic content in historic newspaper collections.

Original languageEnglish (US)
Article number5
JournalD-Lib Magazine
Issue number7-8
StatePublished - 2015

ASJC Scopus subject areas

  • Library and Information Sciences


Dive into the research topics of 'Developing an image-based classifier for detecting poetic content in historic newspaper collections'. Together they form a unique fingerprint.

Cite this