Make it (net)work! - MLA2S Networking Seminar #07
7th Network Seminar of the Thematic Platform MLA²S.
Seminarrooms 1 and 2, PSK 3rd floor
Zoom Connection: [removed]
To register: click "register" below here.
-
-
13:00
→
13:30
News from MLA2S and Make it (net)work! 30m
Including a short round table news update.
Speakers: Dr Claudius Krause (HEPHY Vienna (ÖAW)), Claus Trost (Erich Schmid Institute of Materials Science of theAustrian Academy of Sciences), Jan Odstrčilík (IMAFO), Kati Heinrich (IGF | ÖAW), Nicki Holighaus (Acoustics Research Institute, Austrian Academy of Sciences) -
13:30
→
13:35
Introduction to Research at ACDH 5mSpeaker: Alexandra Lenz (ACDH)
-
13:35
→
14:00
Using LLMs to write a historical dialect dictionary: challenges and opportunities for the WBÖ 25m
In this talk, we investigate the potential of LLMs in supporting lexicographic work on non[1]standard linguistic varieties using data from the Dictionary of Bavarian Dialects in Austria (WBÖ). Based on approx. 2.4 million digitized and TEI-encoded dialect paper slips published via the Lexical Information System Austria (LIÖ), we construct a domain-specific corpus and evaluate LLMs in semantic classification and dictionary entry generation. Key preparatory steps include metadata enrichment, glossary and ontology development, and prompt engineering combined with Retrieval-Augmented Generation (RAG) techniques. Preliminary results suggest that LLMs can assist in organizing dialectal material into coherent semantic groupings. However, challenges persist regarding data preprocessing, structural conformity, and selection of representative examples. We discuss methodological implications and outline future directions, including the integration of agent-based systems and fine-tuning approaches tailored to dialect resources. This study contributes to the broader discourse on AI-assisted lexicography, highlighting both the potential and limitations of current LLM technologies in handling underrepresented language varieties.
Speakers: Daniel Elsner (ACDH), Katharina Korecky-Kröll (ACDH), Philipp Stöckle (ACDH), Wolfgang Koppensteiner (ACDH) -
14:00
→
14:15
Discussion 15m
-
14:15
→
14:40
Using Word Embeddings to analyse differences and shifts of semantics over time, space, and corpora: Overview of architectures, implementations, and evaluations, with a focus of applications on the Austria Media Corpus. 25m
Word Embeddings are at the core of the majority of today's NLP architectures. They allow for a mathematical approximation of semantics by learning the contextual patterns
of words and by mapping those to high dimensional vector spaces. In that space, words which are semantically related to each other, also map to vectors that are equally close to each other. We utilize these architectures to detect differences and shifts of
meanings of words by comparing their respective vectors across time, space, and corpora. In this talk, we will give an overview of the used architectures and their implementations: static word embeddings suchs as word2vec, GloVe, fastText, and contextual ones
such as BERT and RoBERTa, and how they are evaluated and applied on the Austrian Media Corpus.Speaker: Stefan Resch (ACDH) -
14:40
→
14:55
Discussion 15m
-
14:55
→
15:05
Break to walk a few steps and get some fresh air in. 10m
-
15:05
→
15:30
Model Context Protocol (MCP) Servers for Research, Automation, and AI Experiments 25m
The Model Context Protocol is an open standard introduced by Anthropic in November 2024 that enables Large Language Models to interact with external tools, APIs, and data sources in a structured manner. MCP servers function as a standardized interface and solve the problem of fragmented integrations: Instead of developing individual connections for each combination of AI application and data source, MCP enables a universal architecture. Hosts, clients, and servers communicate via JSON-RPC 2.0. The three core primitives – Tools, Resources, and Prompt Templates – enable modular, reusable solutions and foster a rapidly growing open-source ecosystem for scalable, context-aware AI systems. In my presentation, I will briefly explain the fundamentals of the protocol before presenting application examples from my scholarly practice and self-developed servers.
Speaker: Robert Klugseder (ACDH) -
15:30
→
15:45
Discussion 15m
-
15:45
→
16:45
Networking with refreshments / further discussions 1h
-
13:00
→
13:30