Irena SpasicIrena Spasić is a Professor in Cardiff’s School of Computer Science and Informatics, where she is also the Director of Research and the leader of the Text & Data Mining theme. She has a long track record of active interdisciplinary collaboration. Her research interests include text mining, knowledge representation, machine learning and their applications in social sciences, social media, life sciences and healthcare. Her team was ranked first in a 2008 NIH-funded challenge for disease status classification from hospital discharge summaries (https://www.i2b2.org/NLP/). She also led a team that was ranked third on information extraction from discharge summaries (2009), and ranked first in extracting types of information that proved the most difficult to model. Irena is overseeing the design and construction of the web-based infrastructure that brings together the corpus enquiry tools, taggers and pedagogic toolkit into a single web-based tool for the CorCenCC corpus.