Jan-Christoph Kalo

University of Amsterdam

Intelligent Data Engineering Lab (INDElab)

Amsterdam, Netherlands

Hi, I am Jan-Christoph Kalo, Assistant Professor at the University of Amsterdam in the Intelligent Data Engineering Lab (INDElab).

Research · Publications · Google Scholar · Email

My research studies what happens at the boundaries between different representations of knowledge. The same fact can live as a sentence in text, a row in a statistical table, a triple in a knowledge graph, a SQL query, a logical formula, or somewhere in the parameters of a language model — and moving between these forms is often lossy and context-dependent. Whether two representations actually encode the same knowledge depends on what population, time frame, or scope is assumed, and integrating them means making those assumptions explicit: this is what I call knowledge translation.

I work at the intersection of language models, knowledge graphs, and databases, combining semantic web, database, and NLP techniques. Expressing the same knowledge in multiple forms and studying where the versions disagree is a diagnostic — it tells us what each representation captures, what it silently drops, and where context is doing hidden work. The full portfolio of projects is on the Research page.

Beyond research, I teach databases in the Bachelor programme at UvA and supervise MSc and PhD projects. I currently supervise the PhD candidates Fabian Hoppe (deductive reasoning with LLMs; VU Amsterdam, LEMUR doctoral network), Trevor Pearce (knowledge graph construction from conversational data; UvA), and Lucas Lageweg (text-to-SQL over official statistics, with Statistics Netherlands). I have supervised more than 45 BSc and MSc theses, including winners of the ADS Thesis Award (2022) and the best AI bachelor thesis award of Lower Saxony (2019).

I co-organise the Dagstuhl Seminar “Large Language Models Meet Knowledge Graphs” (October 2026), the AKBC workshop, and the LM-KBC challenge on knowledge base construction from language models. I am part of the management team of the COST Action KGELL (CA24121), active in the COST Action GOBLIN (CA23147) on large-scale open multilingual knowledge graphs, and a contributor to the ISWS summer school.

Talks & tutorials

Hands-on tutorial “LLM4KGC”, SIKS & HI Course on Knowledge Graphs, online, June 2026
Tutorials at the International Semantic Web Summer School (ISWS), Bertinoro — 2024 and 2025
Invited talk, GOBLIN COST Action meeting, Prague, February 2025
Keynote “What do Language Models Know About the World?”, X-TAIL Workshop at EKAW 2024, Amsterdam
Invited lecture, FoMo Lectures, University of Amsterdam, June 2023

Resources

LOCuST — a multilingual benchmark for text-to-SQL over official statistics
LM-KBC challenge datasets — knowledge base construction from language models
KAMEL — a probing benchmark with multi-token entities
KnowlyBERT — hybrid query answering over language models and knowledge graphs
WILA-PopQA — a popularity-matched multilingual factual-recall benchmark

Before Amsterdam, I did my PhD at TU Braunschweig (2021) on representation heterogeneity in knowledge graphs — the same problem I now call knowledge translation — followed by a postdoc in the Knowledge Representation and Reasoning group at VU Amsterdam. A longer bio for talk introductions is here.

news

Jul 14, 2026	AKBC is back: we are organising the 10th Workshop on Automated Knowledge Base Construction, co-located with EMNLP 2026 in Budapest on October 28, 2026 — with keynotes by Alon Halevy (Google), Heng Ji (UIUC), and Mausam (IIT Delhi). The call for papers is open: research papers (also via ARR), vision papers, and a shared task on knowledge base construction from LLMs. Details: https://akbc.ws/2026/
Jul 14, 2026	Together with Angela Bonifati, Jeff Z. Pan, Simon Razniewski, and Luke Zettlemoyer, I am organising Dagstuhl Seminar 26411 “Large Language Models Meet Knowledge Graphs”, Oct 4–9, 2026.
Jun 23, 2026	Gave a hands-on tutorial on LLM-based knowledge graph construction at the SIKS & HI Course on Knowledge Graphs.
Apr 22, 2026	New paper at the KG-LLM Workshop (LREC 2026): A Wikidata-Based Framework to Measure Cross-Lingual Bias in Multilingual Large Language Models. We introduce WILA-PopQA, a popularity-matched multilingual benchmark across 9 languages, and disentangle three factors that multilingual probing benchmarks usually confound: the language of the question, the language of the entity, and entity popularity. Across 12 open-weight LLMs, the language of the question turns out to be the dominant factor, and matching it to the entity’s language does not reliably improve factual recall.
Sep 10, 2025	Our paper on the robustness of deductive reasoning with LLMs was presented at ECAI 2025. See it on the publications page: Robustness paper entry. Short description: We study how small prompt and input variations affect deductive reasoning, analyse common failure modes, and outline an evaluation setup for robustness.

selected publications

KG-LLM

A Wikidata-Based Framework to Measure Cross-Lingual Bias in Multilingual Large Language Models

Mouloud Iferroudjene, Lisa Poggel, Andrea Schimmenti, and 4 more authors

In Proceedings of the Workshop on Knowledge Graphs and Large Language Models (KG-LLM @ LREC 2026), 2026

PDF Code
ACL

ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events

Duygu Sezen Islakoglu, and Jan-Christoph Kalo

In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 2025

DOI arXiv HTML PDF
ECAI

Investigating the Robustness of Deductive Reasoning with Large Language Models

Fabian Hoppe, Filip Ilievski, and Jan-Christoph Kalo

In Proceedings of the 27th European Conference on Artificial Intelligence (ECAI 2025), 2025

DOI arXiv HTML
TGDK

Large Language Models and Knowledge Graphs: Opportunities and Challenges

Jeff Z. Pan, Simon Razniewski, Jan-Christoph Kalo, and 8 more authors

Transactions on Graph Data and Knowledge (TGDK), 2023

DOI arXiv HTML
AKBC

KAMEL: Knowledge Analysis with Multitoken Entities in Language Models

Jan-Christoph Kalo, and Leandra Fichtel

In 4th Conference on Automated Knowledge Base Construction (AKBC 2022), 2022

HTML
ISWC

KnowlyBERT – Hybrid Query Answering over Language Models and Knowledge Graphs

Jan-Christoph Kalo, Leandra Fichtel, Philipp Ehler, and 1 more author

In International Semantic Web Conference (ISWC 2020), 2020

DOI HTML