Present
Hi! I am Anupam. I am currently working as an Assistant Professor at the Department of Computer Science and Engineering, IIT Hyderabad. My research interests lie in and around database systems. My current research focuses on the Testing and Benchmarking of Database Systems and AI for Data Engineering.
Bio
Assistant Professor Oct 2025 - PresentCSE Dept., IIT Hyderabad. |
Postdoctoral Researcher May 2024 - Aug 2025TU Darmstadt, Germany.Host: Prof. Carsten Binnig |
Research Scientist Aug 2022 - Feb 2024IBM Research, Bengaluru. |
Ph.D. - Computer Science and Engineering 2017 - 2022Indian Institute of Science, Bangalore.Advisor: Prof. Jayant Haritsa Thesis: Hydra: A Dynamic Approach to Database Regeneration. |
Technical Project Leader Aug 2016 - Aug 2017Huawei Technologies, Bengaluru. |
M.E. - Computer Science and Engineering 2014 - 2016Indian Institute of Science, Bangalore. |
Publications
Boosting DBMS Test Coverage via LLM-Driven SQL Generation |
| E. Abdelkarim, C. Binnig and A. Sanghi SIGMOD Workshop: 11th Intl. Workshop on Testing Database Systems (DBTest), Bengaluru, India, June, 2026. |
Generating Databases from Natural Language Specification |
| A. Mitra, and A. Sanghi SIGMOD 2026 Workshop: Workshop on Synthetic Data Generation and Management for Building AI Systems (SynthAI), Bengaluru, India, May, 2026. |
Eleventh International Workshop on Testing Database Systems (DBTest) |
| A. Gruenheid, and A. Sanghi Companion of the SIGMOD Intl. Conf. on Management of Data, 2026. |
Unveiling Challenges for LLMs in Enterprise Data Engineering |
| J. Bodensohn, U. Brackmann, L. Vogel, A. Sanghi, and C. Binnig PVLDB Journal, 19(2), 2025, pgs. 196 - 209 to be presented in 52nd Intl. Conf. on Very Large Data Bases (VLDB), Boston, MA, USA, 2026. |
Beyond Row Counts: Enhancing Workload-Aware Data Synthesis |
| A. Sanghi EDBT Workshop: 27th Intl. Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP), Barcelona, Spain, March 2025. |
LLMs for Enterprise Data Engineering |
| J. Bodensohn, L. Vogel, A. Sanghi, and C. Binnig ELLIS Workshop on Representation Learning and Generative Models for Structured Data, Amsterdam, Netherlands, February 2025. |
Automating Enterprise Data Engineering with LLMs |
| J. Bodensohn, U. Brackmann, L. Vogel, A. Sanghi, and C. Binnig NeurIPS Workshop: Table Representation Learning Workshop (TRL), Vancouver, Canada, December 2024. |
LLMs for Data Engineering on Enterprise Data |
| J. Bodensohn, U. Brackmann, L. Vogel, M. Urban, A. Sanghi, and C. Binnig VLDB Workshop: Tabular Data Analysis Workshop (TaDA), Guangzhou, China, September 2024. |
Surprise Benchmarking: The Why, What, and How |
| L. Benson, C. Binnig, J. Bodensohn, F. Lorenzi, J. Luo, D. Porobic, T. Rabl, A. Sanghi, R. Sears, P. Tözün, and T. Ziegler (alphabetically sorted) SIGMOD Workshop: 10th Intl. Workshop on Testing Database Systems (DBTest), Santiago, Chile, June 2024. |
Tabular Data Synthesis with GANs for Adaptive AI Models |
| S. Hans*, A. Sanghi*, and D. Saha Proc. of 7th Joint Intl. Conf. on Data Science & Management of Data (CODS-COMAD), Bangalore, India, January 2024. * (equal contribution) |
Synthetic Data Generation for Enterprise DBMS (tutorial) |
| A. Sanghi, and J. Haritsa Proc. of 39th IEEE Intl. Conf. on Data Engineering (ICDE), Anaheim, California, USA, April 2023. |
Semantic Automation for Data Discovery (tutorial) |
| Rajmohan C, R. Chaudhuri, B. Ganesan, A. Sanghi, A. Agarwal, and S. Mehta Proc. of 6th Joint Intl. Conf. on Data Science & Management of Data (CODS-COMAD), January 2023. |
Projection-Compliant Database Generation |
| A. Sanghi, S. Ahmed, and J. Haritsa PVLDB Journal, 15(5), January 2022, pgs. 998-1010 presented in 48th Intl. Conf. on Very Large Data Bases (VLDB), Sydney, Australia, September 2022. |
Towards Generating HiFi Databases |
| A. Sanghi, Rajkumar S., and J. Haritsa Proc. of 26th Intl. Conf. on Database Systems for Advanced Applications (DASFAA), Taipei, Taiwan ROC, April 2021. |
HYDRA: A Dynamic Big Data Regenerator (demo) |
| A. Sanghi, R. Sood, D. Singh, J. Haritsa, and S. Tirthapura PVLDB Journal, 11(12), August 2018, pgs. 1974-1977 presented in 44th Intl. Conf. on Very Large Data Bases (VLDB), Rio de Janeiro, Brazil, August 2018. |
Scalable and Dynamic Regeneration of Big Data Volumes |
| A. Sanghi, R. Sood, J. Haritsa, and S. Tirthapura
Proc. of 21st Intl. Conf. on Extending DataBase Technology (EDBT), Vienna, Austria, March 2018. |
Patents
Dynamic semantic synopsis generation for datasets in data catalog |
| Rajmohan C, A. Sanghi, and A. Agarwal
U.S. Patent No. 12,554,932, February 2026. |
Teaching
- CS3563: Database Management Systems 2 course in Jan 2026.
Achievements
- Our paper received the Best Short Paper Award at VLDB Workshop on Tabular Data Analysis (TaDA), Aug. 2024.
- Received Distinguished Reviewer Award for Applied Data Science Research Track, CODS-COMAD, 2024.
- Awarded IBM PhD Fellowship 2019. Thanks IBM!
- Received Microsoft Research Travel Grant and VLDB Travel Fellowship to visit VLDB 2018, Rio de Janeiro, Brazil, and VLDB 2022, Sydney, Australia.
- Received Best Poster Award at the Young Researchers' Symposium, CoDS-COMAD 2018.
- Received the Future Star Award 2017 at Huawei Technologies India Pvt. Ltd.
Service
- Workshop co-chair for DBTest 2026.
- Proceedings co-chair for SIGMOD 2026.
- Program committee member for VLDB 2027, ICDE 2027, SIGMOD 2026, VLDB 2026, EDBT 2026 (demo track), DBTest 2024, CODS-COMAD 2024 (Jan. edition), CODS-COMAD 2024 (Dec. edition).
- Organizing committee member for ACM India ARCS 2026.
- Participant of the Dagstuhl Seminar on Challenges and Opportunities of Table Representation Learning, 2025.
- Participant of the Dagstuhl Seminar on Ensuring Reliability and Robustness of Database Management Systems, in 2021 and 2023.
- Session Chair for Benchmarking, Performance Modeling, Tuning, and Testing at ICDE 2023.
Students
- M.Tech RA: Mayank Dataram Yadav (Jan 2026 - )
- M.Tech: Jaideep Singh Kaler (May 2026 - )
Applications:
- PhD Students: I am looking for motivated PhD candidates. If you are interested, please drop me an email.
- Internship: Instead of sending an email, please apply for an internship here.