Accepted papers

DATeCH 2014

DATeCH2014 proceedings are available at:

Accepted oral presentations

  • David Hebert, Thomas Palfray, Pierrick Tranouez, Stéphane Nicolas and Thierry Paquet. Automatic Article Extraction in Old Newspapers Digitized Collections

  • Christian Clausner, Stefan Pletschacher and Apostolos Antonacopoulos. Document Representation Refinement for Precise Region Description

  • Sajid Saleem, Fabian Hollaus and Robert Sablatnig. Recognition of Degraded Ancient Characters Based on Dense SIFT

  • Klaus Schulz, Christoph Ringlstetter and Florian Fink. Automated Assignment of Topics to OCRed Historical Texts

  • Petar Mitankin, Stefan Gerdjikov and Stoyan Mihov. An approach to unsupervised historical text normalisation

  • Arianna Ciula and Øyvind Eide. Reflections on Cultural Heritage and Digital Humanities: Modelling in Practice and Theory

  • John Evershed and Kent Fitch. Correcting Noisy OCR: Context beats Confusion

  • Günter Mühlberger, Johannes Zelger, David Sagmeister and Albert Greinöcker. User-driven correction of OCR errors. Combing crowdsourcing and information retrieval technology

  • Thorsten Vobl, Annette Gotscharek, Ulrich Reffle, Christoph Ringlstetter and Klaus Schulz. PoCoTo – An Open Source System for Efficient Interactive Postcorrection of OCRed Historical Texts

  • Truyen Van Phan and Masaki Nakagawa. Construction of a Text Digitization System for Nôm Historical Documents

  • Uwe Springmann, Dietmar Najock, Hermann Morgenroth, Helmut Schmid, Annette Gotscharek and Florian Fink. OCR of Historical Printings of Latin Texts: Problems, Prospects, Progress

  • Christine Roughan. Digital Editions and Diplomatic Diagrams

  • Gregory Crane, Bridget Almas, Alison Babeu, Lisa Cerrato, Anna Krohn, Frederik Baumgardt, Monica Berti, Greta Franzini and Simona Stoyanova. Cataloging for a billion word library of Greek and Latin

  • Maximilian Hadersbeck, Alois Pichler, Florian Fink and Øyvind Liland Gjesdal. Wittgenstein’s Nachlass: WiTTFind and Wittgenstein Advanced Search Tools (WAST)

  • Beatrice Alex and John Burns. Estimating and Rating the Quality of Optically Character Recognised Text

  • Alicia Fornés, Josep Llados, Joan Mas, Joana Maria Pujades and Anna Cabré. A Bimodal Crowdsourcing Platform for Demographic Historical Manuscripts

Accepted posters

  • Joan Andreu Sánchez, Vicente Boch, Verónica Romero, Katrien Depuydt and Jesse de Does. Handwritten text recognition for historical documents in the tranScriptorium project

  • Paula Estrella and Pablo Paliza. OCR correction of documents generated during Argentina’s National Reorganization Process

  • Vicent Bosch, Isabel Bordes Cabrera, Paloma Cuenca Muñoz, Celio Hernández Tornero, Luis Leiva, Moisés Pastor Gadea, Verónica Romero, Alejandro Toselli and Enrique Vidal. Transcribing a XVII Manuscript from Scratch using Computer-Assisted Transcription Technology

  • Basilis Gatos, Nikolaos Stamatopoulos, Georgios Louloudis and Stavros Perantonis. H-DocPro: A Document Image Processing Platform for Historical Documents

  • Dimitrios Kokkinakis, Mats Malm, Jenny Bergenmar and Ann Ighe. Semantics in Storytelling in Swedish Fiction

  • Nicholas Hayward. From lighthouse to framework: visualising digital scholarly editions with pathways and histories

  • Adam Dudczak, Aleksandra Nowak and Tomasz Parkoła. Creation of custom recognition profiles for historical documents

  • Alexandros Papandreou, Basilis Gatos and Georgios Louloudis. An Adaptive Zoning Technique for Efficient Word Retrieval Using Dynamic Time Warping

  • Roberto Theron, Carlos Seguín, Laura de La Cruz and María Vaquero. Highly interactive and natural user interfaces: Enabling visual analysis in historical lexicography

  • Martin Reynaert.On OCR ground truths and OCR post-correction gold standards, tools and formats

  • Bruce Robertson, Christoph Dalitz and Fabian Schmitt. Automated Page Layout Simplification of Patrologia Graeca

  • David Hebert, Thomas Palfray, Stéphane Nicolas, Pierrick Tranouez and Thierry Paquet. PIVAJ: displaying and augmenting digitized newspapers on the Web Experimental feedback from the “Journal de Rouen” Collection

  • Rafael C. Carrasco. An open-source OCR evaluation tool

  • Iuliu Konya and Stefan Eickeler. Logical Structure Recognition for Heterogeneous Periodical Collections

  • Karel Kučera and Martin Stluka. Data Processing and Lemmatization in Digitized 19th-Century Czech Texts

  • Hervé Déjean. Using Ancestral Layout Models for Document Digitization