DATeCH 2014

Digitisation Days Schedule

Monday, May 19

8.30-9-00: Registration

9.00-9.10: Welcome

Ana Santos (BNE director), Klaus Schulz and Apostolos Antonacopoulos (Programme chairs)

9.10-9.30: Presentation by the BNE

9.30-11: Session 1. Document Analysis and OCR

  • Automatic Article Extraction in Old Newspapers Digitized Collections.
    David Hebert, Thomas Palfray, Pierrick Tranouez, Stéphane Nicolas and Thierry Paquet.

  • Document Representation Refinement for Precise Region Description.
    Christian Clausner, Stefan Pletschacher and Apostolos Antonacopoulos.

  • Recognition of Degraded Ancient Characters Based on Dense SIFT.
    Sajid Saleem, Fabian Hollaus and Robert Sablatnig.

11.00-11.10: Company presentation 1

11.10-12.00: Coffee break with exhibits

12.00-13.30: Session 2. Linguistic processing and encoding

    • Automated Assignment of Topics to OCRed Historical Texts.
      Klaus Schulz, Christoph Ringlstetter and Florian Fink.

    • An approach to unsupervised historical text normalisation.
      Petar Mitankin, Stefan Gerdjikov and Stoyan Mihov.

    • Reflections on Cultural Heritage and Digital Humanities: Modelling in Practice and Theory.
      Arianna Ciula and Øyvind Eide.

13.30-13.40: Company presentation 2

13.40-15.00: Lunch break

15.00-16.30: Session 3. Postcorrection

    • Correcting Noisy OCR: Context beats Confusion.
      John Evershed and Kent Fitch.

    • User-centered correction of noisy OCR text in large document collections.
      Günter Mühlberger, Johannes Zelger, David Sagmeister and Albert Greinöcker.

    • PoCoTo – An Open Source System for Efficient Interactive Postcorrection of OCRed Historical Texts.
      Thorsten Vobl, Annette Gotscharek, Ulrich Reffle, Christoph Ringlstetter and Klaus Schulz.

16.30-17.00: Tea with exhibits

17.00-18.20: Panel session: IPR and digitised content

18.20: End of first Digitisation Day

Tuesday, May 20

9.00-11.00: Session 4. Best practices and experiences

    • Construction of a Text Digitization System for Nôm Historical Documents.
      Truyen Van Phan and Masaki Nakagawa.

    • OCR of Historical Printings of Latin Texts: Problems, Prospects, Progress.
      Uwe Springmann, Dietmar Najock, Hermann Morgenroth, Helmut Schmid, Annette Gotscharek and Florian Fink.

    • Digital Editions and Diplomatic Diagrams.
      Christine Roughan

    • Cataloging for a billion word library of Greek and Latin.
      Gregory Crane, Bridget Almas, Alison Babeu, Lisa Cerrato, Anna Krohn, Frederik Baumgardt, Monica Berti, Greta Franzini and Simona Stoyanova.

11.00-11.10: Company presentation 3

11.10-11.50: Coffee break with exhibits

11.50-13.00: Panel session: The digitisation of cultural heritage: modern utopia?

    • Milagros del Corral, UNESCO

    • Jill Cousins, Europeana

    • Frank Frischmuth, German Digital Library

    • Michael Keller, Stanford University Library

    • Steven Krauwer, Utrecht University

    • Andrew Prescott, King’s College London

13.00-13.20: Succeed awards

Delivered by EC representative, Succeed and Impact Centre of Competence directors.

13.20-14.00: Closing ceremony

Hildelies Balk, European Commision, S.A.R Príncipe de Asturias

14.00-15.30: Lunch

Session 5. Enrichment

    • Wittgenstein’s Nachlass: WiTTFind and Wittgenstein Advanced Search Tools (WAST).
      Maximilian Hadersbeck, Alois Pichler, Florian Fink and Øyvind Liland Gjesdal.

    • Estimating and Rating the Quality of Optically Character Recognised Text.
      Beatrice Alex and John Burns.

    • A Bimodal Crowdsourcing Platform for Demographic Historical Manuscripts.
      Alicia Fornés, Josep Llados, Joan Mas, Joana Maria Pujades and Anna Cabré.

17.00-17.20: 1-minute poster madness

17.20-18.30: Poster session and cocktail

    • Handwritten text recognition for historical documents in the tranScriptorium project. Joan Andreu Sánchez, Vicente Boch, Verónica Romero, Katrien Depuydt and Jesse de Does

    • OCR correction of documents generated during Argentina’s National Reorganization Process,Paula Estrella and Pablo Paliza,

    • Transcribing a XVII Manuscript from Scratch using Computer-Assisted Transcription Technology. Vicent Bosch, Isabel Bordes Cabrera, Paloma Cuenca Muñoz, Celio Hernández Tornero, Luis Leiva, Moisés Pastor Gadea, Verónica Romero, Alejandro Toselli and Enrique Vidal

    • H-DocPro: A Document Image Processing Platform for Historical Documents. Basilis Gatos, Nikolaos Stamatopoulos, Georgios Louloudis and Stavros Perantonis

    • Semantics in Storytelling in Swedish Fiction. Dimitrios Kokkinakis, Mats Malm, Jenny Bergenmar and Ann Ighe

    • From lighthouse to framework: visualising digital scholarly editions with pathways and histories,Nicholas Hayward,

    • Creation of custom recognition profiles for historical documents. Adam Dudczak, Aleksandra Nowak and Tomasz Parkoła

    • An Adaptive Zoning Technique for Efficient Word Retrieval Using Dynamic Time Warping. Alexandros Papandreou, Basilis Gatos and Georgios Louloudis

    • Highly interactive and natural user interfaces: Enabling visual analysis in historical lexicography. Roberto Theron, Carlos Seguín, Laura de La Cruz and María Vaquero.

    • On OCR ground truths and OCR post-correction gold standards, tools and formats

    • Martin Reynaert,

    • Automated Page Layout Simplification of Patrologia Graeca. Bruce Robertson, Christoph Dalitz and Fabian Schmitt.

    • PIVAJ: displaying and augmenting digitized newspapers on the Web Experimental feedback from the “”Journal de Rouen”” Collection”. David Hebert, Thomas Palfray, Stéphane Nicolas, Pierrick Tranouez and Thierry Paquet

    • An open-source OCR evaluation tool. Rafael C. Carrasco

    • Logical Structure Recognition for Heterogeneous Periodical Collections. Iuliu Konya and Stefan Eickeler

    • Data Processing and Lemmatization in Digitized 19th-Century Czech Texts. Karel Kučera and Martin Stluka

    • Using Ancestral Layout Models for Document Digitization. Hervé Déjean

18.30-18.50: Wrap up

18.50: End of the Digitisation Days