Schedule

DATeCH 2014

Digitisation Days Schedule

Monday, May 19

8.30-9-00: Registration

9.00-9.10: Welcome

Ana Santos (BNE director), Klaus Schulz and Apostolos Antonacopoulos (Programme chairs)

9.10-9.30: Presentation by the BNE

9.30-11: Session 1. Document Analysis and OCR

Automatic Article Extraction in Old Newspapers Digitized Collections.
David Hebert, Thomas Palfray, Pierrick Tranouez, Stéphane Nicolas and Thierry Paquet.
Document Representation Refinement for Precise Region Description.
Christian Clausner, Stefan Pletschacher and Apostolos Antonacopoulos.
Recognition of Degraded Ancient Characters Based on Dense SIFT.
Sajid Saleem, Fabian Hollaus and Robert Sablatnig.

11.00-11.10: Company presentation 1

11.10-12.00: Coffee break with exhibits

12.00-13.30: Session 2. Linguistic processing and encoding

- Automated Assignment of Topics to OCRed Historical Texts.
  Klaus Schulz, Christoph Ringlstetter and Florian Fink.
- An approach to unsupervised historical text normalisation.
  Petar Mitankin, Stefan Gerdjikov and Stoyan Mihov.
- Reflections on Cultural Heritage and Digital Humanities: Modelling in Practice and Theory.
  Arianna Ciula and Øyvind Eide.

13.30-13.40: Company presentation 2

13.40-15.00: Lunch break

15.00-16.30: Session 3. Postcorrection

- Correcting Noisy OCR: Context beats Confusion.
  John Evershed and Kent Fitch.
- User-centered correction of noisy OCR text in large document collections.
  Günter Mühlberger, Johannes Zelger, David Sagmeister and Albert Greinöcker.
- PoCoTo – An Open Source System for Efficient Interactive Postcorrection of OCRed Historical Texts.
  Thorsten Vobl, Annette Gotscharek, Ulrich Reffle, Christoph Ringlstetter and Klaus Schulz.

16.30-17.00: Tea with exhibits

17.00-18.20: Panel session: IPR and digitised content

18.20: End of first Digitisation Day

Tuesday, May 20

9.00-11.00: Session 4. Best practices and experiences

- Construction of a Text Digitization System for Nôm Historical Documents.
  Truyen Van Phan and Masaki Nakagawa.
- OCR of Historical Printings of Latin Texts: Problems, Prospects, Progress.
  Uwe Springmann, Dietmar Najock, Hermann Morgenroth, Helmut Schmid, Annette Gotscharek and Florian Fink.
- Digital Editions and Diplomatic Diagrams.
  Christine Roughan
- Cataloging for a billion word library of Greek and Latin.
  Gregory Crane, Bridget Almas, Alison Babeu, Lisa Cerrato, Anna Krohn, Frederik Baumgardt, Monica Berti, Greta Franzini and Simona Stoyanova.

11.00-11.10: Company presentation 3

11.10-11.50: Coffee break with exhibits

11.50-13.00: Panel session: The digitisation of cultural heritage: modern utopia?

- Milagros del Corral, UNESCO
- Jill Cousins, Europeana
- Frank Frischmuth, German Digital Library
- Michael Keller, Stanford University Library
- Steven Krauwer, Utrecht University
- Andrew Prescott, King’s College London

13.00-13.20: Succeed awards

Delivered by EC representative, Succeed and Impact Centre of Competence directors.

13.20-14.00: Closing ceremony

Hildelies Balk, European Commision, S.A.R Príncipe de Asturias

14.00-15.30: Lunch

Session 5. Enrichment

- Wittgenstein’s Nachlass: WiTTFind and Wittgenstein Advanced Search Tools (WAST).
  Maximilian Hadersbeck, Alois Pichler, Florian Fink and Øyvind Liland Gjesdal.
- Estimating and Rating the Quality of Optically Character Recognised Text.
  Beatrice Alex and John Burns.
- A Bimodal Crowdsourcing Platform for Demographic Historical Manuscripts.
  Alicia Fornés, Josep Llados, Joan Mas, Joana Maria Pujades and Anna Cabré.

17.00-17.20: 1-minute poster madness

17.20-18.30: Poster session and cocktail

- Handwritten text recognition for historical documents in the tranScriptorium project. Joan Andreu Sánchez, Vicente Boch, Verónica Romero, Katrien Depuydt and Jesse de Does
- OCR correction of documents generated during Argentina’s National Reorganization Process,Paula Estrella and Pablo Paliza,
- Transcribing a XVII Manuscript from Scratch using Computer-Assisted Transcription Technology. Vicent Bosch, Isabel Bordes Cabrera, Paloma Cuenca Muñoz, Celio Hernández Tornero, Luis Leiva, Moisés Pastor Gadea, Verónica Romero, Alejandro Toselli and Enrique Vidal
- H-DocPro: A Document Image Processing Platform for Historical Documents. Basilis Gatos, Nikolaos Stamatopoulos, Georgios Louloudis and Stavros Perantonis
- Semantics in Storytelling in Swedish Fiction. Dimitrios Kokkinakis, Mats Malm, Jenny Bergenmar and Ann Ighe
- From lighthouse to framework: visualising digital scholarly editions with pathways and histories,Nicholas Hayward,
- Creation of custom recognition profiles for historical documents. Adam Dudczak, Aleksandra Nowak and Tomasz Parkoła
- An Adaptive Zoning Technique for Efficient Word Retrieval Using Dynamic Time Warping. Alexandros Papandreou, Basilis Gatos and Georgios Louloudis
- Highly interactive and natural user interfaces: Enabling visual analysis in historical lexicography. Roberto Theron, Carlos Seguín, Laura de La Cruz and María Vaquero.
- On OCR ground truths and OCR post-correction gold standards, tools and formats
- Martin Reynaert,
- Automated Page Layout Simplification of Patrologia Graeca. Bruce Robertson, Christoph Dalitz and Fabian Schmitt.
- PIVAJ: displaying and augmenting digitized newspapers on the Web Experimental feedback from the “”Journal de Rouen”” Collection”. David Hebert, Thomas Palfray, Stéphane Nicolas, Pierrick Tranouez and Thierry Paquet
- An open-source OCR evaluation tool. Rafael C. Carrasco
- Logical Structure Recognition for Heterogeneous Periodical Collections. Iuliu Konya and Stefan Eickeler
- Data Processing and Lemmatization in Digitized 19th-Century Czech Texts. Karel Kučera and Martin Stluka
- Using Ancestral Layout Models for Document Digitization. Hervé Déjean

18.30-18.50: Wrap up

18.50: End of the Digitisation Days

Google Sites

Report abuse