Research

I am a researcher at the Department of Computer Science, University of Pisa, and a member of the Knowledge Discovery and Data Mining Laboratory (KDD Lab), a joint group with CNR-ISTI in Pisa. With a background in Digital Humanities (BS and MS, both cum laude) and a PhD in Computer Science—building on work at the CoLingLab—my research sits at the intersection of Natural Language Processing, text analytics, and computational social science.
I design machine learning and explainable methods and large-scale language pipelines to analyse opinions and behaviours, with a particular focus on human migration; my PhD explored how Big Data and sentiment analysis can be combined to study migration flows.
I have contributed to several Horizon 2020 projects (SoBigData, SoBigData++, HumMingBird, MeMind) and I am currently involved in SoBigData.it.
I have also been the course holder of the Text Analytics course in the MSc programmes in Digital Humanities and in Data Science and Business Informatics at the University of Pisa.

Publications

Thesis Supervision

  • Melasi, FabioPOPOLARE: A Populism and Polarization Classification Framework for Italian Texts. Corso di studi: Data Science and Business Informatics.
  • Coda-Giorgio, LucaSIMVALE: A Generalizable Simulator Validation Framework Combining Embedding Clustering and Feature-Based Metrics. Corso di studi: Data Science and Business Informatics.
  • Maggio, FrancescoBISON: predizione del successo interpretabile nei contenuti basati su NFT in Blockchain Online Social Media. Corso di studi: Informatica Umanistica.
  • Vasta, MarcoCloud-Based Data Warehousing: Migration and Reporting with SAP Datasphere and SAP Analytics Cloud. Corso di studi: Data Science and Business Informatics.
  • Gneri, JacopoModeling Toxic Users via Feature Extraction: An Interpretable Data-Driven Approach. Corso di studi: Data Science and Business Informatics.
  • Rossi, Martina FelicinaStudio delle lacune nei modelli di fake news detection: un'indagine data-driven. Corso di studi: Informatica Umanistica.
  • Ayushi, AyushiDesign and Implementation of a Cloud-Based AI System for Automated Document Processing: Integrating Google Cloud Storage, Document AI, and Vertex AI. Corso di studi: Data Science and Business Informatics.
  • Giada, PetraCRISP-DM IF: A Specialised Framework for Improving Demand Forecasting Pipelines with ReOrder Point and Clustering. Corso di studi: Data Science and Business Informatics.
  • Di Mauro, GianmarcoLinkedIn Data: uno sguardo attraverso LinkedIn alle carriere degli alumni e delle alumnae degli atenei toscani. Corso di studi: Informatica Umanistica.
  • Ricciarelli, GianmarcoScholarly Career Paths: Brain Drain and Exterophily. Corso di studi: Informatica.
  • Morini, VirginiaPolarizzazione politica & echo chamber: una metodologia per l'identificazione e analisi su Reddit. Corso di studi: Data Science and Business Informatics.
ETD