Farrukh Baratov, co-supervised by Niki van Stein in the ICAI lab AI4Oversight: XAI-driven Compliance Detection in Dutch Regulatory Contexts (2025-now).
Sabijn Perdijk, co-supervised by Max van Duijn and Gijs Wijnholds: StorytimeLM – Improving the Effectiveness and Efficiency of Language Models (LMs) with Narrative Data (2025-now, LIACS)
Jia-Huei (Dylan) Ju, co-supervised with Andrew Yates in the Hybrid Intelligence consortium (2024-now, UvA)
Jujia Zhao, co-supervised by Zhaochun Ren: Integrating search and recommendation with generative models (2024-now, LIACS)
Ali Satvati, RUG, co-supervised by Fatih Turkmen: privacy and security of Large Language Models (2023-now, NWA LESSEN)
Murad Bozik: Emotion-aware text-to-speech (2023-now, LIACS and DAISYS)
Andras Paraskeva: Parsimonious architectures for Large Language Models, co-supervised with dr. Jan van Rijn (2023-now, LIACS, NWA LESSEN)
Yumeng Wang: Explainability for conversational agents (2023-now, LIACS, NWA LESSEN)
Sophia Althammer, TU Wien, co-supervised with Allan Hanbury on the project Addressing Data Availability and Document-to-Document Retrieval for Domain-specific Neural Rankers, part of the H2020 DoSSIER project (2020-2023)
Gineke Wiggers: The Relevance of Impact. Bibliometric-enhanced legal information retrieval (2017-2022, collaboration with Faculty of Law). PhD defence: 8 March 2023
Anne Dirkson: Knowledge Discovery from Patient Forums. Gaining novel medical insights from patient experiences. (2018-2022, collaboration with LUMC). PhD defence: 6 December 2022
Xue Wang: Multi Modal Representation Learning and Cross-Modal Semantic Matching (2019-2022, LIACS, collaboration with Xi’an Jiaotong University). PhD defence: 24 June 2022.
Alex Brandsen: Digging in Documents. Utilizing text mining to access the hidden knowledge in archeological grey literature (2017-2021, collaboration with Faculty of Archaeology). PhD defence: 15 February 2022
Benjamin van der Burgh, research assistant on health NLP projects, 2017-2019
Current projects I am involved in
The 4D PICTURE Project: Design-based Data-Driven Decision-support Tools: Producing Improved Cancer Outcomes Through User-centred Research, 2022-2026 (news article)
LESSEN: Low-Resource Chat-based Conversational Intelligence, 2022-2026. Coordinated by the University of Amsterdam. Funded by NWO in the NWA-ORC programme.
SENTENCES: Social mEdia aNalysis To promotE caNCEr Screening, 2020-2024. Coordinated by the University of Amsterdam (Gert-Jan de Bruijn). Funded by ZonMW in the call ‘Vroege Opsporing’.
BEL (Behaviour, Energy transition, Low income): “Tenants’ behavioural responses to residential energy transition: are intended energy savings feasible?”, 2020-2026. Coordinated by TUe (Ioulia Ossokina). Funded by NWO in the call ‘Transities en Gedrag’.
DoSSIER: Domain Specific Systems for Information Extraction and Retrieval (co-applicant), 2020-2024. Funded by the European Union’s Horizon 2020 Innovative Training Networks (ITN) programme under the Marie Skłodowska-Curie actions
RISE_SMA: Social Media Analytics for Society and Crisis Communication (co-applicant), 2019-2023. Funded by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement
Digital tools for knowledge extraction for (rare) cancers (co-applicant), Voucher project funded by the Ministry of Health. With 4 cancer patient communities, in collaboration with TNO.
SmartFile: from keyboard to patient, and the follow-up project, ‘Learning from registration’, (co-applicant), in collaboration with Hogeschool Codarts, the startup company ‘SmartFile’, 10 sports physiotherapy practices and the Dutch Association for Physical Therapy in Sport Healthcare (Funded by RAAK-SIA)
The reach of junk news on Facebook, in collaboration with Nieuwscheckers.
Empowerment of patients (co-applicant), pilot project for hypothesis generation based on text mining from patient forums, in collaboration with TNO and LUMC (funded by SIDN)
DISCOSUMO: Discussion Thread Summarization for Mobile Devices (financed by NWO Creative Industries), with Tilburg University and Sanoma Media BV (2015-2019)
Wisdom of the crowds, Patient empowerment in online support communities (casus 3) (co-applicant), with RIVM and Kanker.nl (2016)
QUINN (main applicant): Query Updates for News moNitoring (financed by a COMMIT valorization grant), with TNO and LexisNexis (2015)
SWELL: smart reasoning systems for well-being at work and at home (financed by COMMIT), with TNO, Philips, Noldus, University of Twente, Roessingh R&D, Innovalor, and Sense/Almende (2012-2016)
RemBench (main applicant): A Digital Workbench for Rembrandt Research (financed by CLARIN-NL), with Huygens ING and RKD (2013-2014)
Rembrandt Documents: A new digital infrastructure for accessing, analysing, and interpreting original written and printed documents related to the life and art of the world-renowned Dutch painter Rembrandt van Rijn (1606-1669), with Huygens ING and the Rembrandthuis Museum (2012-2013).
PoliticalMashup: Automatic classification of political texts, with University of Amsterdam (2012-2013)
Route 66 explorative text mining, for De Baak (2012)
ComPoli: Communicatie en revalidatie digiPoli, with Sint Maartenskliniek (2011-2012)