Current Issue
Preview Issue
Previous Issues
Preview Issue
Previous Issues
- 2023: 17.3
- 2023: 17.2
- 2023: 17.1
- 2022: 16.4
- 2022: 16.3
- 2022: 16.2
- 2022: 16.1
- 2021: 15.4
- 2021: 15.3
- 2021: 15.2
- 2021: 15.1
- 2020: 14.4
- 2020: 14.3
- 2020: 14.2
- 2020: 14.1
- 2019: 13.4
- 2019: 13.3
- 2019: 13.2
- 2019: 13.1
- 2018: 12.4
- 2018: 12.3
- 2018: 12.2
- 2018: 12.1
- 2017: 11.4
- 2017: 11.3
- 2017: 11.2
- 2017: 11.1
- 2016: 10.4
- 2016: 10.3
- 2016: 10.2
- 2016: 10.1
- 2015: 9.4
- 2015: 9.3
- 2015: 9.2
- 2015: 9.1
- 2014: 8.4
- 2014: 8.3
- 2014: 8.2
- 2014: 8.1
- 2013: 7.3
- 2013: 7.2
- 2013: 7.1
- 2012: 6.3
- 2012: 6.2
- 2012: 6.1
- 2011: 5.3
- 2011: 5.2
- 2011: 5.1
- 2010: 4.2
- 2010: 4.1
- 2009: 3.4
- 2009: 3.3
- 2009: 3.2
- 2009: 3.1
- 2008: 2.1
- 2007: 1.2
- 2007: 1.1
ISSN 1938-4122
Announcements
DHQ: Digital Humanities Quarterly
2024 18.1
Articles
Cuneiform Stroke Recognition and Vectorization in 2D Images
Adéla Hamplová, Czech University of Life Sciences Prague; Avital Romach, Yale University; Josef Pavlíček, Czech University of Life Sciences Prague; Arnošt Veselý, Czech University of Life Sciences Prague; Martin Čejka, Czech University of Life Sciences Prague; David Franc, Czech University of Life Sciences Prague; Shai Gordin, Ariel University; Open University of Israel
Abstract
[en]
A vital part of the publication process of ancient cuneiform tablets is creating hand-copies, which are 2D line art representations of
the 3D cuneiform clay tablets, created manually by scholars. This research provides an innovative method using Convolutional Neural
Networks (CNNs) to identify strokes, the constituent parts of cuneiform characters, and display them as vectors — semi-automatically
creating cuneiform hand-copies. This is a major step in optical character recognition (OCR) for cuneiform texts, which would contribute
significantly to their digitization and create efficient tools for dealing with the unique challenges of Mesopotamian cultural heritage.
Our research has resulted in the successful identification of horizontal strokes in 2D images of cuneiform tablets, some of them from
very different periods, separated by hundreds of years from each other. With the Detecto algorithm, we achieved an F-measure of 81.7%
and an accuracy of 90.5%. The data and code of the project are available on GitHub.
Exploring Combinatorial Methods to Produce Sonnets: An Overview of the Oupoco
Project
Frédérique Mélanie-Becquet, LATTICE (CNRS & ENS/PSL & Univ. Sorbonne Nouvelle); Clément Plancq, LATTICE (CNRS & ENS/PSL & Univ. Sorbonne Nouvelle); Claude Grunspan, LATTICE (CNRS & ENS/PSL & Univ. Sorbonne Nouvelle); Mylène Maignant, LATTICE (CNRS & ENS/PSL & Univ. Sorbonne Nouvelle); Matthieu Raffard, Atelier Raffard-Roussel; Mathilde Roussel, Atelier Raffard-Roussel; Fiammetta Ghedini, RIVA Illustrations; Thierry Poibeau, LATTICE (CNRS & ENS/PSL & Univ. Sorbonne Nouvelle)
Abstract
[en]
In this paper, we describe Oupoco (l’Ouvroir de Poésie Combinatoire), a system producing new sonnets by
recombining lines of poetry from existing sonnets, following an idea that Queneau described in his
book Cent Mille Milliards de poèmes (A Hundred Thousand Billion
Poems, 1961). We first give the rationale of the project and review past experiments in poetry generation
using combinatorial methods. We then demonstrate different outputs of our implementation (a Web site, a Twitter bot
and a specifically developed device, called the Boîte à poésie) based on
a corpus of 19th century French poetry. We describe how this project was an opportunity to work with artists and
reach a new audience through the Boîte à poésie, and also through a
video clip that frequently served as an introduction to the project. Our goal is to revive people’s interest in
poetry by giving access to automatically produced sonnets through original and entertaining channels and
devices.
Recognition and Analysis of the Proceedings of the Greek Parliament after WWII
Epameinondas-Konstantinos Barmpounis, Athens University of Economics and Business; John Pavlopoulos, Athens University of Economics and Business; Panos Louridas, Athens University of Economics and Business; Dritsa Konstantina, Athens University of Economics and Business
Abstract
[en]
The first post-WWII years in Greece were devastating. After a brutal Nazi occupation, the Greek Civil War (1946–1949) erupted. It
wrecked the economy and the country's infrastructure and altered politics and the social fabric for decades to come. A study of the
issues discussed in the Greek Parliament during the tense and unstable first years of the conflict (1946-1947) could
facilitate our understanding of the society at the time. An obstacle is that parliament proceedings are publicly available in a
machine-readable form beginning in 1989; before that only scanned images of the original records exist. We show that text recognition
followed by natural language processing can unlock this corpus for historical research. Using Transkribus, we trained a text recogniser
(1.5% CER) that we applied to 3,156 images from 1946 and 1947. As low-quality recognition is inevitable, we trained a language model on the
transcribed text and applied it to recognised text, discarding records with high average cross-entropy. Using information extraction
techniques, we sampled speeches that were applauded and we introduced the first quantification of issues that were thus received.
All our resources are made available at https://zenodo.org/record/8302990.
Cross-codex Learning for Reliable Scribe Identification in Medieval
Manuscripts
Julius Weißmann, Media and Digital Technologies, St. Pölten University of Applied Sciences; Markus Seidl, Media and Digital Technologies, St. Pölten University of Applied Sciences; Anya Dietrich, MEG Unit, Brain Imaging Center; Martin Haltrich, Library, Klosterneuburg Abbey
Abstract
[en]
Historic scribe identification is a substantial task for obtaining information about the past. Uniform script
styles, such as the Carolingian minuscule, make it a difficult task for classification to focus on meaningful
features. Therefore, we demonstrate in this paper the importance of cross-codex training data for CNN based
text-independent off-line scribe identification, to overcome codex dependent overfitting. We report three main
findings: First, we found that preprocessing with masked grayscale images instead of RGB images clearly
increased the F1-score of the classification results. Second, we trained different neural networks on our
complex data, validating time and accuracy differences in order to define the most reliable network
architecture. With AlexNet, the network with the best trade-off between F1-score and time, we achieved for
individual classes F1-scores of up to 0,96 on line level and up to 1.0 on page level in classification. Third,
we could replicate the finding that the CNN output can be further improved by implementing a reject option,
giving more stable results. We present the results on our large scale open source dataset – the Codex
Claustroneoburgensis database (CCl-DB) – containing a significant number of writings from different scribes in
several codices. We demonstrate for the first time on a dataset with such a variety of codices that
paleographic decisions can be reproduced automatically and precisely with CNNs. This gives manifold new and
fast possibilities for paleographers to gain insights into unlabeled material, but also to develop further
hypotheses.
Fingerprints of British Book History: A Feminist Labor History of EEBO
Ana Quiring, University of Missouri — St. Louis
Abstract
[en]
In this essay, I give a labor history of the commonly used database Early English Books Online. EEBO began its
life as a mirofilm archive produced beginning in 1940: a massive book-copying project was undertaken during World
War II to protect rare books from German bombs. These reproductions were made largely by unnamed woman archivists
and led by a team of woman photographers, academics, and secretaries. In this essay, I draw on existing theories
and histories of EEBO while highlighting these women's work, which manifests in the photographs through the shadows
of the fingertips they used to pin down the fragile books. This archive of reproductive labor exemplifies the
artificial divide between auteur and stenographer, artist and secretary, that animated many of the early twentieth
century’s avant-garde movements, including ones in which EEBO photographers were intimately involved. The fingers
represent the custodial labor of women working across artistic and administrative modes during World War II and the
decades that followed.
Building an Interface as an Argument? The Case Study of Untangling the Cordel
Elina Leblanc, University of Geneva
Abstract
[en]
The project Untangling the cordel (2020-2024) aims at studying and promoting a collection
of 19th-century Spanish chapbooks via a digital library (DL). This resource is composed of
digital scholarly editions of chapbooks and of a catalogue of woodcuts, which decorate the
first page of almost all the documents. In this paper, after presenting the project’s
editorial workflow, we focus our attention of the way we design the interface of this DL
to represent the different facets of chapbooks (document, text and illustrations). For
that, we have chosen to follow a method, proposed by Andrews and van Zundert in 2018, that
consider an interface as an argument editors made about their data and their digital
editions. Through this case study, we demonstrate the feasibility of this approach, where
each component of an interface contributes to the scientific discourse a project made
about its goals and its perception of digital editing. We also stress the impact of this
method on user experience and on a project itself, as another way to see data and their
modelling.
Reviews
Tractable Tensions: A Review of Digital Humanities: Knowledge and Critique in a
Digital Age by David M. Berry and Anders Fagerjord
Onyekachi Henry Ibekwe, University of Nigeria
Abstract
[en]
This book analyzes the tensions that arise when the numerical, predictive exactitude of digital computing embraces
the humanities. Such tensions are thrown into frequent and sharper relief, whereas the sciences
proceed on the basis of skepticism, and the humanities proceed on the basis of interpretation, criticism and
dialectical engagement with given issues. In addition, the pervasive nature of digital computing heightens the risk
of over-shadowing the humanities. The authors argue for a critical turn in digital humanities.
Author Biographies
URL: http://www.digitalhumanities.org/dhq/preview/index.html
Comments: dhqinfo@digitalhumanities.org
Published by: The Alliance of Digital Humanities Organizations and The Association for Computers and the Humanities
Affiliated with: Digital Scholarship in the Humanities
DHQ has been made possible in part by the National Endowment for the Humanities.
Copyright © 2005 -
Unless otherwise noted, the DHQ web site and all DHQ published content are published under a Creative Commons Attribution-NoDerivatives 4.0 International License. Individual articles may carry a more permissive license, as described in the footer for the individual article, and in the article’s metadata.
Comments: dhqinfo@digitalhumanities.org
Published by: The Alliance of Digital Humanities Organizations and The Association for Computers and the Humanities
Affiliated with: Digital Scholarship in the Humanities
DHQ has been made possible in part by the National Endowment for the Humanities.
Copyright © 2005 -
Unless otherwise noted, the DHQ web site and all DHQ published content are published under a Creative Commons Attribution-NoDerivatives 4.0 International License. Individual articles may carry a more permissive license, as described in the footer for the individual article, and in the article’s metadata.