1. In het project Polimedia.nl hebben we de Handelingen van het Nederlandse parlement gekoppeld aan media-databanken, zoals de dagbladen en teksten zoals die zijn uitgesproken door het radiojournaal. Deze applicatie is voor een ieder te gebruiken om met een druk op de knop te kunnen zien wat de media zoal schreven over wat er in het parlement werd besproken en gedaan. In het huidige project Talk of Europe zullen de Handelingen van het Europese Parlement worden gecureerd en gekoppeld aan nationale databanken. Dit project doen wij – Max Kemman, Martijn Kleppe, Jill Briggeman en ik – samen met de VU, Dans, NISV, en de initiator Clarin.nl/Clarin Eric, gesubsidideerd door NWO/OCW. Hieronder volgt een korte uitleg over het project.
ToE-TCC has three ambitions. The first is to translate the proceedings of the European Parliament debates (europarl) to CLARIN standards. In other words, the europarl data is curated to linked data so that it can be linked to and reused by other datasets and services. An interesting aspect of this dataset is that its available in 21 languages, enabling to link to other datasets in as many different languages.

The second ambition is to research how this linked data can be exploited to be used by humanities scholars. The combination of linked datasets with the base available in 21 languages should allow research questions not yet feasible. As such, interlingual comparative research through digital tools becomes possible. In comparison, where PoliMedia focused on how Dutch debates were covered in the Dutch media, the europarl data could be analysed to compare how European Parliament debates are covered in Dutch versus Polish media. Political scientists could analyse how subjects from EP debates relate to subjects in debates of national parliaments. In order to build tools helpful, we will investigate user requirements and possible research questions from humanities scholars. This second aim will be undertaken by Max Kemman, Martijn Kleppe and Henri Beunders from the History department of EUR.

The third ambition is to spark a transnational, European collaboration to create tools for scholars to analyse linked datasets. To this end, three meetings of a week will be organised in which teams from all CLARIN countries are invited to participate. Think hackathon, but a week long and with a European focus.

2. Een groot Europees onderzoeksproject, gefinancierd door de EU, waar ik aan mee doe is het AXES-project. Dit project beoogt een multimediale zoekmachine te bouwen om al het bestaande audiovisuele archiefmateriaal beter te kunnen vinden en analyseren. Met mijn Erasmus-team (Martijn Kleppe, Max Kemman, Jill Briggeman) onderzoeken wij wat de diverse groepen burgers – professionals, academici, thuisgebruikers – willen en kunnen als het gaat om het vinden van informatie en bewegende beelden op internet en in archieven. Hieronder volgt een korte uitleg over het project.
The goal of AXES is to develop tools that provide various types of users with new engaging ways to interact with audiovisual libraries, helping them discover, browse, navigate, search and enrich archives. In particular, apart from a search-oriented scheme, we will explore how suggestions for audiovisual content exploration can be generated via a myriad of information trails crossing the archive. This will be approached from three perspectives (or axes): users, content, and technology.

Within AXES innovative indexing techniques are developed in close cooperation with a number of user communities through tailored use cases and validation stages. Rather than just starting new investments in technical solutions, the co-development is proposed of innovative paradigms of use and novel navigation and search facilities. We will target media professionals, educators, students, amateur researchers, and home users.

Based on an existing Open Source service platform for digital libraries, novel navigation and search functionalities will be offered via interfaces tuned to user profiles and workflow. To this end, AXES will develop tools for content analysis deploying weakly supervised classification methods. Information in scripts, audio tracks, wikis or blogs will be used for the cross-modal detection of people, places, events, etc., and for link generation between audiovisual content. Users will be engaged in the annotation process: with the support of selection and feedback tools, they will enable the gradual improvement of tagging performance.

AXES technology will open up audiovisual digital libraries, increasing their cultural value and their exposure to the European public and academia at large. The consortium is a perfect match to the multi-disciplinary nature of the project, with professional content owners academic and industrial experts in audiovisual analysis, retrieval, and user studies, and partners experienced in system integration and project management.

3. Een nationaal project dat mijn medewerkers Max Kemman, Martijn Kleppe en ik leiden is een Clarin-project, getiteld: Polimedia. Dit project, samen met de TU Delft, de KB en Beeld & Geluid, wil koppelingen leggen tussen de Handelingen van de Tweede Kamer, de kranten, radiobulletins en tv-programma’s. Hieronder volgt een korte uitleg over het project. Voor meer info: zie www.polimedia.nl

Analysing media coverage across several types of media-outlets is a challenging task for (media) historians. Up until now, the focus has been on newspaper articles: being generally available in digital, computer-readable format, these can be studied relatively easily. Cross-media comparisons between different types of media-outlets have however rarely been undertaken, even though such comparisons have top priority on the wish-list of (media) historians as this could give better insight into the choices that different media-outlets make. A specific example of media coverage research investigates the coverage of political debates and how the representation of topics and people change over time. The PoliMedia project aims to showcase the potential of cross-media analysis for research in the humanities, by (i) curating automatically detected semantic links between four data sets of different media types, and (ii) developing a demonstrator application that allows researchers to deploy such an interlinked collection for quantitative and qualitative analysis of media coverage of debates in the Dutch parliament.

Prof. Dr. Henri Beunders (project leader), Max Kemman MSc, Martijn Kleppe MA, Jill Briggeman MA.