Valentine
ICDE 2021 · VLDB Demo 2021
Schema matching made accessible to everyone — an open library and benchmark for evaluating matching techniques on tabular data.
Postdoctoral Research Associate at NYU VIDA.
I build data integration solutions for practitioners and developers, working across biomedical and healthcare, open government, and organizational data. My research covers data matching at the schema and value levels, metadata generation, and the composite harmonization workflows that bring these components together.
Before NYU, I received my PhD from the Web Information Systems group at TU Delft, advised by Asterios Katsifodimos and Christoph Lofi. I obtained my MPhil at HKUST and my MEng in ECE at NTUA.
ICDE 2021 · VLDB Demo 2021
Schema matching made accessible to everyone — an open library and benchmark for evaluating matching techniques on tabular data.
VLDB 2025
Joinability discovery in data products using Graph Neural Networks. Developed during my Applied Scientist internship at AWS in 2022.
Patterns · SIGMOD Demo 2026
A programmable and conversational toolkit for biomedical data harmonization. Pairs traditional matchers with LLM-driven workflows.
VIS 2025 · SIGMOD Demo 2026
An interactive visualization system that helps domain experts verify and curate schema matches, with LLM-powered validation built in.
VLDB 2026/2027 · ICDE 2026/2027 · SIGMOD 2026 · ICDE 2024 Demonstration Track · ECML/PKDD 2024 Applied Data Science Track · DBML 2022–2024, 2026 (ICDE Workshop) · AIDB 2026 (VLDB Workshop)
VLDB 2023/2024 · IEEE ICDE 2019/2023 · EDBT 2024 · ACM SIGMOD 2018/2021 · ACM SIGSPATIAL 2017 · SSTD 2017
Data & Knowledge Engineering (DKE) · Patterns (Cell Press) · Information Systems