Skip to content

Welcome

Hispanistica documents Digital Humanities projects in Hispanic linguistics at Philipps University of Marburg, highlighting their technical foundations, versioned development workflows, and adherence to FAIR principles. It provides an overview of research infrastructures, corpora, and digital teaching tools developed under the direction of Prof. Dr. Felix Tacke.


Approach

In my Digital Humanities projects, I draw on recent developments in Large Language Models, generative AI, and modern web technologies to make research data and teaching materials transparent, versioned, and sustainable over time. Lightweight and reproducible technical infrastructures now make it possible to develop specialised tools for Hispanic linguistics without relying on extensive institutional IT resources.

I work openly, transparently, and in accordance with FAIR principles: data, workflows, and technical decisions should be understandable and reusable. For this reason, the complete source code of my projects — including build configurations, release workflows, and documentation — is freely accessible on GitHub:

Projects

CO.RA.PAN (Corpus Radiofónico Panhispánico)

CO.RA.PAN Logo

A large-scale corpus of contemporary broadcast Spanish from nearly all Spanish-speaking countries.

CO.RA.PAN documents pluricentric Spanish through professionally produced radio news, supported by a robust technical pipeline: raw audio is processed into time-aligned, linguistically annotated JSON (POS, lemma, morphology, dependency), which is indexed through BlackLab Server for fast corpus search. A custom web application integrates transcript navigation, metadata filters, and synchronized audio playback, offering researchers a unified interface for exploring more than 1.4 million words of spoken data.

MAR.ELE

MAR.ELE Logo

A pronunciation corpus capturing spoken Spanish from learners across proficiency levels and linguistic backgrounds.

The project combines controlled recordings with transparent annotation workflows to support phonetic and interlanguage research. Its technical foundation is a containerized Flask web application featuring audio–text synchronization, speaker metadata views, and tools for segment-level comparison across recordings. The architecture ensures reproducibility, modularity, and long-term maintainability, making MAR.ELE a scalable platform for empirical studies in second-language pronunciation.

Spanish Linguistics @ School

Spanisch Toolkit Logo

A digital reference book for Spanish language teachers with a focus on linguistic awareness and classroom applicability.

Implemented using MkDocs and the Material for MkDocs theme, the project leverages page navigation, integrated search, and responsive layouts to emulate the readability of a structured textbook in a fully static-site environment. Through custom HTML/JS/CSS extensions, the toolkit supports Leaflet-based interactive maps, embedded audio players, and specialized components for linguistic examples, enabling dynamic didactic materials far beyond the standard MkDocs feature set. The platform supports collaborative OER development and sustainable long-term expansion.


GitHub Hispanistica – @ Marburg