{"id":23426,"date":"2026-03-11T15:25:55","date_gmt":"2026-03-11T14:25:55","guid":{"rendered":"https:\/\/dhd-blog.org\/?p=23426"},"modified":"2026-03-11T15:25:55","modified_gmt":"2026-03-11T14:25:55","slug":"from-modelling-to-transcription-workshop-notes-from-dhd2026","status":"publish","type":"post","link":"https:\/\/dhd-blog.org\/?p=23426","title":{"rendered":"From Modelling to Transcription: Workshop Notes from DHd2026"},"content":{"rendered":"\n<p>During DHd2026 in Vienna, many discussions revolved around how digital tools shape the way we work with texts and data. Instead of trying to summarise the entire conference, this blog post focuses on the workshops I attended during the first days and on a few ideas that stayed with me throughout the week.<\/p>\n\n\n\n<p>Looking back at my notes, I realised they already suggested a structure for this post. The workshops I attended raised questions about modelling, transcription, and data that later reappeared in other panels and keynotes during the conference.<\/p>\n\n\n\n<h2><strong>Note 1: Starting with Practice<\/strong><\/h2>\n\n\n\n<p>My first two days at the conference were shaped by workshops, and that felt like a good way to begin. Rather than starting with big claims about digital humanities, I started by sitting down with tools, notebooks, scripts, and a lot of practical questions.<\/p>\n\n\n\n<p>On the first day, I attended the workshop <strong>\u201cBeyond Entities: Inhaltsbasierte Erschlie\u00dfung digitaler Editionen mit KI.\u201d<\/strong> We worked with Python, APIs, and Jupyter notebooks to extract RDF triples from TEI-encoded early modern letters. What I found especially interesting was that the workflow did not stop at named entities. Instead, it tried to model conceptual relations in the texts. Themes such as emotion, illness, or social order became part of a semantic structure that could then be visualised and analysed further.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"1460\" src=\"https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-scaled.jpg\" alt=\"Titelfolie des Workshops &quot;Beyond Entities&quot; im Rahmen der DHd2026 in Wien.\" class=\"wp-image-23432\" srcset=\"https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-scaled.jpg 2560w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-300x171.jpg 300w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-1024x584.jpg 1024w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-768x438.jpg 768w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-1536x876.jpg 1536w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/IMG_20260223_140125-2048x1168.jpg 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/figure>\n\n\n\n<p><span style=\"font-weight: 400\">I liked that this workshop stayed close to the material while still asking what kinds of structures can be made visible through computational methods. It also made very clear that modelling is never neutral. Even at the level of prompts and extraction rules, decisions shape what the final data looks like.<\/span><\/p>\n<h2><b>Note 2: From Audio to Text<\/b><\/h2>\n<p><span style=\"font-weight: 400\">The second workshop I attended, <\/span><b>\u201cVom Audio zum Text: Automatisierte Transkriptionen mit Whisper,\u201d<\/b><span style=\"font-weight: 400\"> shifted the focus from written material to spoken language. We looked at automated transcription workflows, compared tools, and worked through Python-based pipelines for transcription and speaker diarisation.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1368\" height=\"774\" src=\"https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/Titelfolie_Workshop_Von-Audio-zu-Text_DHd2026.jpg\" alt=\"Titelfolie des Workshops &quot;Von Audio zu Text&quot; vom Data Science Center im Rahmen der DHd2026\" class=\"wp-image-23433\" srcset=\"https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/Titelfolie_Workshop_Von-Audio-zu-Text_DHd2026.jpg 1368w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/Titelfolie_Workshop_Von-Audio-zu-Text_DHd2026-300x170.jpg 300w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/Titelfolie_Workshop_Von-Audio-zu-Text_DHd2026-1024x579.jpg 1024w, https:\/\/dhd-blog.org\/app\/uploads\/2026\/03\/Titelfolie_Workshop_Von-Audio-zu-Text_DHd2026-768x435.jpg 768w\" sizes=\"auto, (max-width: 1368px) 100vw, 1368px\" \/><\/figure>\n\n\n\n<p><span style=\"font-weight: 400\">What stayed with me most was the discussion after the practical part. Our reflections quickly clustered around three terms: <\/span><b>transfer, opportunities, and limits<\/b><span style=\"font-weight: 400\">.<\/span><\/p>\n<p><span style=\"font-weight: 400\">The question of transfer came up in relation to both teaching and research. We talked about how automated transcription might be built into thesis work, methods courses, or training materials. There were ideas about shared standards, open educational resources, and also about making these workflows usable for people who are not deeply technical.<\/span><\/p>\n<p><span style=\"font-weight: 400\">The opportunities were easy to see. Automated transcription can save time, give a quick overview of larger amounts of audio material, and make certain kinds of corpus building much more realistic. Reusable code and adaptable workflows also make it easier to test different research setups.<\/span><\/p>\n<p><span style=\"font-weight: 400\">At the same time, the workshop discussion was just as much about the limits. In qualitative work especially, transcripts are never just raw text. Things like pauses, laughter, overlap, hesitation, and speaker dynamics matter, and automated systems do not capture all of this equally well. We also kept coming back to transparency, validation, and the need to check what a model is actually doing.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">For me, this workshop was useful precisely because it did not present automation as a magic solution. It showed where these tools can help, but also where they flatten the material.<\/span><\/p>\n<h2><b>Note 3: The Keynote as a Frame<\/b><\/h2>\n<p><span style=\"font-weight: 400\">Only after these two workshops came the opening keynote by Miriah Meyer, <\/span><i><span style=\"font-weight: 400\">\u201cData As ___________: Exploring the Plurality of Data in Visualization.\u201d<\/span><\/i><span style=\"font-weight: 400\"> By that point, I already had the workshop experiences in mind, and that made the keynote even more interesting to listen to.<\/span><\/p>\n<p><span style=\"font-weight: 400\">What I took from it was not one single definition of data, but the opposite. Data appeared here as something plural, shaped, and dependent on context. Meyer spoke about data as entangled, as design material, and as connection. That fit surprisingly well with what I had just seen in the workshops. Whether we are extracting semantic triples from letters or producing transcripts from audio, data do not simply appear ready-made. They are produced through tools, settings, modelling choices, and research interests.<\/span><\/p>\n<p><span style=\"font-weight: 400\">The question <\/span><i><span style=\"font-weight: 400\">\u201cIs data graffiti data?\u201d <\/span><\/i><span style=\"font-weight: 400\">while looking at sticker emojis museum visitors made on a data sheet, stayed with me because it made this point in a way that was funny and sharp at the same time. It pushed against the idea of data as something clean and self-evident.<\/span><\/p>\n<h2><b>Looking Back at the Rest of the Week<\/b><\/h2>\n<p><span style=\"font-weight: 400\">For me, the workshops were an amazing start to DHd2026 because they made it possible to move back and forth between trying things out and thinking about what those methods actually imply for research practice.<\/span><\/p>\n<p><span style=\"font-weight: 400\">As the conference continued, I noticed that many of the themes from the workshops reappeared in other sessions. Panels such as \u201cNot Just Text, Intertext!\u201d and projects like NAKAR returned to questions of modelling, connection, and interpretation. The final keynote by Katharina Kinder-Kurlanda then made the political and epistemic side of data work even more explicit.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Looking back at my notes now, this connection between experimentation and reflection is probably my main takeaway from the week. The interesting part is not only that digital tools can do more and more. They also force us to ask more precise questions about modelling, interpretation, transparency, and what we even mean when we call something data. In that sense, the workshops were not just an introduction to tools, but also to the questions that come with using them.<\/span><\/p>\n<p>\u00a0<\/p>\n<p><i><span style=\"font-weight: 400\">This blog post was written as part of a travel grant for the DHd 2026 conference. My sincere thanks go to NFDI4Memory for supporting my participation, and to the conference organizers for their excellent work in making the event such a positive experience.<\/span><\/i><\/p>\n","protected":false},"excerpt":{"rendered":"<p>During DHd2026 in Vienna, many discussions revolved around how digital tools shape the way we work with texts and data. Instead of trying to summarise the entire conference, this blog post focuses on the workshops I attended during the first days and on a few ideas that stayed with me throughout the week. Looking back [&hellip;]<\/p>\n","protected":false},"author":459,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1,103,534,10],"tags":[1943,206,1424,1946,98],"class_list":["post-23426","post","type-post","status-publish","format-standard","hentry","category-allgemein","category-community","category-konferenz","category-reflektion","tag-dhd2026","tag-python","tag-text-2","tag-whisper","tag-workshop"],"_links":{"self":[{"href":"https:\/\/dhd-blog.org\/index.php?rest_route=\/wp\/v2\/posts\/23426","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dhd-blog.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dhd-blog.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dhd-blog.org\/index.php?rest_route=\/wp\/v2\/users\/459"}],"replies":[{"embeddable":true,"href":"https:\/\/dhd-blog.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=23426"}],"version-history":[{"count":2,"href":"https:\/\/dhd-blog.org\/index.php?rest_route=\/wp\/v2\/posts\/23426\/revisions"}],"predecessor-version":[{"id":23434,"href":"https:\/\/dhd-blog.org\/index.php?rest_route=\/wp\/v2\/posts\/23426\/revisions\/23434"}],"wp:attachment":[{"href":"https:\/\/dhd-blog.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=23426"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dhd-blog.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=23426"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dhd-blog.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=23426"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}