Humboldt-Universität zu Berlin - Faculty of Mathematics and Natural Sciences - Databases and Information Systems

Mahdi Esmailoghli

Mahdi Esmailoghli
Freepik - Flaticon

I am a Postdoctoral researcher in the Database and Information Systems (DBIS) group at Humboldt-Universität zu Berlin under the supervision of Prof. Dr. Matthias Weidlich.

I completed my Ph.D. under the supervision of Prof. Dr. Ziawasch Abedjan at TU Berlin with Summa cum laude. During my doctoral studies, my research focused on data discovery in data lakes, in particular, I developed a holistic system to efficiently explore large data lakes to enhance the data at hand to train more effective machine learning (ML) models. I received my M.Sc. degree from Amirkabir University of Technology (Tehran Polytechnic).

My current research focuses on supporting scientific workflow developers by building comprehensive knowledge bases and exploring methods to effectively share insights and expertise gained from existing workflows with new developers across diverse domains.

 

Selected Publications

  • "Every Data Lake Has a Past: Analytical Exploration of Wikipedia History as a Temporal Data Lake": Mahdi Esmailoghli, Steven Purtzel, Roee Shraga, Renée J. Miller, Matthias Weidlich, DOLAP 2026
  • "Data Discovery in Data Lakes: Operations, Indexes, Systems": Ziawasch Abedjan, Mahdi Esmailoghli, Sainyam Galhorta, ICDE 2026
  • "The Past Still Matters: A Temporally-Valid Data Discovery System.": Mahdi Esmailoghli, Matthias Weidlich, arXiv preprint arXiv:2510.13662 (2025)
  • "FlowPilot: A Suggestion System for Designing Scientific Workflows": Mahdi Esmailoghli, Matthias Weidlich, SIGMOD 2026
  • "Data Discovery in Data Lakes: Operations, Indexes, Systems": Ziawasch Abedjan, Mahdi Esmailoghli, Sainyam Galhorta, VLDB 2025
  • "Blend: A Unified Data Discovery System": Mahdi Esmailoghli, Christoph Schnell, Renée J. Miller, Ziawasch Abedjan. ICDE 2025
  • "Demonstrating MATE and COCOA for Data Discovery": Jannis Becktepe, Mahdi Esmailoghli, Maximilian Koch, Ziawasch Abedjan. SIGMOD 2023
  • "MATE: multi-attribute table extraction": Mahdi Esmailoghli, Jorge-Arnulfo Quiané-Ruiz, Ziawasch Abedjan. VLDB 2022
  • "COCOA: COrrelation COefficient-Aware Data Augmentation.": Mahdi Esmailoghli, Jorge-Arnulfo Quiané-Ruiz, Ziawasch Abedjan. EDBT 2021
  • "CAFE: Constraint-Aware Feature Extraction from Large Databases": Mahdi Esmailoghli, Ziawasch Abedjan. CIDR 2020

 

Profiles and Bibliography

 

Awards & Recognitions

  • Two-year funding from DAAD Programme for Project-Related Personal Exchange (PPP) as Principal Investigator
  • CIKM 2023 - Distinguished Reviewer
  • GI Data Science Challenges at BTW 2023 - First Prize
  • GI Data Science Challenges at BTW 2019 - First Prize
  • National University Entrance Exam Exemption Award (for M.Sc. degree)
  • Top 1 student based on GPA, in the B.Sc degree
  • Ranked #4 in the 19th National Computer Olympiad Of Iran, September 2014

Academic Service

  • ACM SIGMOD 2026 - International Conference on Management of Data - PC Member
  • VLDB 2026 - 52nd International Conference on Very Large Databases - PC Member
  • ICDE 2025 - 41st IEEE International Conference on Data Engineering - PC Member/Reviewer Research Track
  • IEEE BigData 2024 - IEEE International Conference on Big Data - PC Member/Reviewer Research Track
  • CIKM 2024 - 33rd ACM International Conference on Information and Knowledge Management - PC Member/Reviewer Research Track
  • CIKM 2023 - 32nd ACM International Conference on Information and Knowledge Management - PC Member/Reviewer Research Track