Historical Newspaper Content Mining: Revisiting the impresso Project’s Challenges in Text and Image Processing, Design and Historical Scholarship

paper, specified "long paper"
Authorship
  1. 1. Maud Ehrmann

    École Polytechnique Fédérale de Lausanne (EPFL)

  2. 2. Estelle Bunout

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

  3. 3. Simon Clematide

    Universität Zürich (University of Zurich)

  4. 4. Marten Düring

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

  5. 5. Andreas Fickers

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

  6. 6. Roman Kalyakin

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

  7. 7. Frédéric Kaplan

    École Polytechnique Fédérale de Lausanne (EPFL)

  8. 8. Matteo Romanello

    École Polytechnique Fédérale de Lausanne (EPFL)

  9. 9. Paul Schroeder

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

  10. 10. Philipp Ströbel

    Universität Zürich (University of Zurich)

  11. 11. Thijs van Beek

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

  12. 12. Martin Volk

    Universität Zürich (University of Zurich)

  13. 13. Lars Wieneke

    Luxembourg Centre for Contemporary and Digital History (C2DH) - University of Luxembourg / Universität Luxemburg

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.

impresso. Media Monitoring of the Past is an interdisciplinary research project in which a team of computational linguists, designers and historians collaborate on the datafication of a multilingual corpus of digitised historical newspapers. The primary goals of the project are to improve text mining tools for historical text, to enrich historical newspapers with (semi-) automatically generated data and to integrate such data into historical research workflows by means of a newly developed user interface. In this paper we discuss our efforts to overcome inherent challenges and to integrate text mining and data visualisation applications in general historical research practices which are characterised by search operations as well as the need to create topical collections.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2020
"carrefours / intersections"

Hosted at Carleton University, Université d'Ottawa (University of Ottawa)

Ottawa, Ontario, Canada

July 20, 2020 - July 25, 2020

475 works by 1078 authors indexed

Conference cancelled due to coronavirus. Online conference held at https://hcommons.org/groups/dh2020/. Data for this conference were initially prepared and cleaned by May Ning.

Conference website: https://dh2020.adho.org/

References: https://dh2020.adho.org/abstracts/

Series: ADHO (15)

Organizers: ADHO