Christos Koutras

Christos Koutras

Postdoctoral Research Associate at NYU VIDA.

I build data integration solutions for practitioners and developers, working across biomedical and healthcare, open government, and organizational data. My research covers data matching at the schema and value levels, metadata generation, and the composite harmonization workflows that bring these components together.

Before NYU, I received my PhD from the Web Information Systems group at TU Delft, advised by Asterios Katsifodimos and Christoph Lofi. I obtained my MPhil at HKUST and my MEng in ECE at NTUA.

Selected projects

TU Delft

Valentine

ICDE 2021 · VLDB Demo 2021

Schema matching made accessible to everyone — an open library and benchmark for evaluating matching techniques on tabular data.

OmniMatch

VLDB 2025

Joinability discovery in data products using Graph Neural Networks. Developed during my Applied Scientist internship at AWS in 2022.

NYU

BDI-Kit

Patterns · SIGMOD Demo 2026

A programmable and conversational toolkit for biomedical data harmonization. Pairs traditional matchers with LLM-driven workflows.

BDIViz

VIS 2025 · SIGMOD Demo 2026

An interactive visualization system that helps domain experts verify and curate schema matches, with LLM-powered validation built in.

News

Valentine has been upgraded to version 1.0.0! This release includes several new features and improvements. Check out the what's new!
BDI-Kit: An AI-powered toolkit for biomedical data harmonization published in Patterns (Cell).
Our interactive system for biomedical schema matching with LLM-powered validation, BDIViz, was presented at IEEE VIS 2025 in Vienna.
OmniMatch: Joinability Discovery in Data Products presented at VLDB 2025 in London.
Joined the VIDA Research Center at NYU Tandon as a Postdoctoral Research Associate.

Awards

IEEE ICDE Distinguished PC Award For outstanding contributions in reviewing and discussions on the Program Committee.

Service

Program Committee

VLDB 2026/2027 · ICDE 2026/2027 · SIGMOD 2026 · ICDE 2024 Demonstration Track · ECML/PKDD 2024 Applied Data Science Track · DBML 2022–2024, 2026 (ICDE Workshop) · AIDB 2026 (VLDB Workshop)

External Reviewing

VLDB 2023/2024 · IEEE ICDE 2019/2023 · EDBT 2024 · ACM SIGMOD 2018/2021 · ACM SIGSPATIAL 2017 · SSTD 2017

Reviewing for Journals

Data & Knowledge Engineering (DKE) · Patterns (Cell Press) · Information Systems