ATHENA: Automatic Text Height ExtractioN for the Analysis of old handwritten manuscripts

Loading...
Thumbnail Image
Date
2013
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association
Abstract
A massive digital acquisition of huge sets of deteriorating historical documents is mandatory due to their value and delicacy. The study and the browsing of such digital libraries is becoming crucial for scholars in the Cultural Heritage field, but it requires automatic tools for analyzing and indexing those dataset items. We present here a layout analysis method to perform automatic text height estimation, without the need of any kind of manual intervention and user defined parameters. It proves to be a robust technique in the case of very noisy and damaged handwritten manuscripts. The effectiveness of the method is demonstrated on a huge heterogeneous corpus of medieval manuscripts, with different writing styles, and affected by other uncontrollable factors, such as ink bleed-through, background noise, and overtyping text lines.
Description

        
@inproceedings{
10.1109:DigitalHeritage.2013.6743802
, booktitle = {
Digital Heritage International Congress
}, editor = {
-
}, title = {{
ATHENA: Automatic Text Height ExtractioN for the Analysis of old handwritten manuscripts
}}, author = {
Pintus, Ruggero
and
Yang, Ying
and
Rushmeier, Holly
}, year = {
2013
}, publisher = {
The Eurographics Association
}, ISBN = {}, DOI = {
10.1109/DigitalHeritage.2013.6743802
} }
Citation
Collections