Present

Hi! I am Anupam. I am currently working as an Assistant Professor at the Department of Computer Science and Engineering, IIT Hyderabad. My research interests lie in and around database systems. My current research focuses on the Testing and Benchmarking of Database Systems and AI for Data Engineering.

Experience

Assistant Professor  Oct 2025 - Present

CSE Dept., IIT Hyderabad.

Postdoctoral Researcher  May 2024 - Aug 2025

TU Darmstadt, Germany.
Host: Prof. Carsten Binnig

Research Scientist  Aug 2022 - Feb 2024

IBM Research, Bengaluru.

Technical Project Leader  Aug 2016 - Aug 2017

Huawei Technologies, Bengaluru.

Education

Ph.D. - Computer Science and Engineering  2017 - 2022

Indian Institute of Science, Bangalore.
Advisor: Prof. Jayant Haritsa
Thesis: Hydra: A Dynamic Approach to Database Regeneration.

M.E. - Computer Science and Engineering  2014 - 2016

Indian Institute of Science, Bangalore.

B.Tech. - Information Technology  2010 - 2014

Jaypee Institute of Information Technology, NOIDA.

Publications

Unveiling Challenges for LLMs in Enterprise Data Engineering

J. Bodensohn, U. Brackmann, L. Vogel, A. Sanghi, and C. Binnig
to be presented in 52nd Intl. Conf. on Very Large Data Bases (VLDB), Boston, MA, USA, 2026 [in press]

Beyond Row Counts: Enhancing Workload-Aware Data Synthesis

A. Sanghi
EDBT Workshop: 27th Intl. Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP), Barcelona, Spain, March 2025.

LLMs for Enterprise Data Engineering

J. Bodensohn, L. Vogel, A. Sanghi, and C. Binnig
ELLIS Workshop on Representation Learning and Generative Models for Structured Data, Amsterdam, Netherlands, February 2025

Automating Enterprise Data Engineering with LLMs

J. Bodensohn, U. Brackmann, L. Vogel, A. Sanghi, and C. Binnig
NeurIPS Workshop: Table Representation Learning Workshop (TRL), Vancouver, Canada, December 2024.

LLMs for Data Engineering on Enterprise Data

J. Bodensohn, U. Brackmann, L. Vogel, M. Urban, A. Sanghi, and C. Binnig
VLDB Workshop: Tabular Data Analysis Workshop (TaDA), Guangzhou, China, September 2024.

Surprise Benchmarking: The Why, What, and How

L. Benson, C. Binnig, J. Bodensohn, F. Lorenzi, J. Luo, D. Porobic, T. Rabl, A. Sanghi, R. Sears, P. Tözün, and T. Ziegler (alphabetically sorted)
SIGMOD Workshop: 10th Intl. Workshop on Testing Database Systems (DBTest), Santiago, Chile, June 2024.

Tabular Data Synthesis with GANs for Adaptive AI Models

S. Hans*, A. Sanghi*, and D. Saha
Proc. of 7th Joint Intl. Conf. on Data Science & Management of Data (CODS-COMAD), Bangalore, India, January 2024.
* (equal contribution)

Synthetic Data Generation for Enterprise DBMS   (tutorial)

A. Sanghi, and J. Haritsa
Proc. of 39th IEEE Intl. Conf. on Data Engineering (ICDE), Anaheim, California, USA, April 2023.

Semantic Automation for Data Discovery   (tutorial)

Rajmohan C, R. Chaudhuri, B. Ganesan, A. Sanghi, A. Agarwal, and S. Mehta
Proc. of 6th Joint Intl. Conf. on Data Science & Management of Data (CODS-COMAD), January 2023.

Projection-Compliant Database Generation

A. Sanghi, S. Ahmed, and J. Haritsa
PVLDB Journal, 15(5), January 2022, pgs. 998-1010
presented in 48th Intl. Conf. on Very Large Data Bases (VLDB), Sydney, Australia, September 2022

Towards Generating HiFi Databases

A. Sanghi, Rajkumar S., and J. Haritsa
Proc. of 26th Intl. Conf. on Database Systems for Advanced Applications (DASFAA), Taipei, Taiwan ROC, April 2021

HYDRA: A Dynamic Big Data Regenerator   (demo)

A. Sanghi, R. Sood, D. Singh, J. Haritsa, and S. Tirthapura
PVLDB Journal, 11(12), August 2018, pgs. 1974-1977
presented in 44th Intl. Conf. on Very Large Data Bases (VLDB), Rio de Janeiro, Brazil, August 2018

Scalable and Dynamic Regeneration of Big Data Volumes

A. Sanghi, R. Sood, J. Haritsa, and S. Tirthapura
Proc. of 21st Intl. Conf. on Extending DataBase Technology (EDBT), Vienna, Austria, March 2018

Achievements

  • Our paper received the Best Short Paper Award at VLDB Workshop on Tabular Data Analysis (TaDA), Aug. 2024.
  • Received Distinguished Reviewer Award for Applied Data Science Research Track, CODS-COMAD, 2024.
  • Awarded IBM PhD Fellowship 2019. Thanks IBM!
  • Received Microsoft Research Travel Grant and VLDB Travel Fellowship to visit VLDB 2018, Rio de Janeiro, Brazil, and VLDB 2022, Sydney, Australia.
  • Received Best Poster Award at the Young Researchers' Symposium, CoDS-COMAD 2018.
  • Received the Future Star Award 2017 at Huawei Technologies India Pvt. Ltd.

Service

  • Proceedings co-chair for SIGMOD 2026.
  • Program committee member in SIGMOD 2026, VLDB 2026, EDBT 2026 (demo track), DBTest 2024, CODS-COMAD 2024 (Jan. edition), CODS-COMAD 2024 (Dec. edition).
  • Participant of the Dagstuhl Seminar on Challenges and Opportunities of Table Representation Learning, 2025.
  • Participant of the Dagstuhl Seminar on Ensuring Reliability and Robustness of Database Management Systems, in 2021 and 2023.
  • Member of the Diversity Council at IBM Research, India, 2023-24.
  • Session Chair for Benchmarking, Performance Modeling, Tuning, and Testing at ICDE 2023.
  • Member of the Student Welfare Committee at CSA Dept, IISc during 2020-2022.