This website is under development. If you come accross any issues, please report them to Konstantinos Kanellis (kkanellis@cs.wisc.edu) or Yannis Chronis (chronis@google.com).

Proceedings of CIDR

Session 1: Core DB 1

A Critique of Modern SQL And A Proposal Towards A Simple and Expressive Query Language

Thomas Neumann, Viktor Leis

A Model for Query Execution Over Heterogeneous Instances

Ziheng Wang, Emanuel Adamiak, Alex Aiken

Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries

Yifei Yang, Hangdong Zhao, Xiangyao Yu, Paraschos Koutris

Chablis: Fast and General Transactions in Geo-Distributed Systems

Tamer Eldeeb, Philip A Bernstein, Asaf Cidon, Junfeng Yang

Session 2: Systems/HW

Database Kernels: Seamless Integration of Database Systems and Fast Storage via CXL

Sangjin Lee, Alberto Lerner, Philippe Bonnet, Philippe Cudré-Mauroux

Program your (custom) SIMD instruction set on FPGA in C++

Johannes Pietrzyk, Alexander Krause, Christian Färber, Dirk Habich, Wolfgang Lehner

Taking the Shortcut: Actively Incorporating the Virtual Memory Index of the OS to Hardware-Accelerate Database Indexing

Felix Schuhknecht

Session 3: Cloud 1

The Tipping Point of Edge-Cloud Data Management

Faisal Nawab, Moshe Shadmon

Cost-Intelligent Data Analytics in the Cloud

Huanchen Zhang, Yihao Liu, Jiaqi Yan

Towards Resource-adaptive Query Execution in Cloud Native Databases

Rui Liu, Jun Hyuk Chang, Riki Otaki, Zhe Heng Eng

Session 4: Core DB 2

Is Perfect Hashing Practical for OLAP Systems?

Kevin P Gaffney, Jignesh M Patel

Oligolithic Cross-task Optimizations across Isolated Workloads

Eleni Zapridou, Panagiotis Sioulas, Anastasia Ailamaki

Dear User-Defined Functions, Inlining isn't working out so great for us. Let's try batching to make our relationship work. Sincerely, SQL

Kai Franz, Samuel Arch, Denis Hirn, Torsten Grust, Todd C Mowry, Andrew Pavlo

Session 5: Benchmarking/Testing

Panda: Performance Debugging for Databases using LLM Agents

Vikramank Singh, Kapil Eknath Vaidya, Vinayshekhar Bannihatti, Sopan Khosla, Balakrishnan Narayanaswamy, Rashmi Gangadharaiah, Tim Kraska

TracEx: Understanding and Analyzing Database Traces

Dominik Durner

Towards an Objective Metric for Data Value Through Relevance

Boris Glavic, Pengyuan Li, Ziyu Liu, Dieter Gawlick, Vasudha Krishnaswamy, Danica Porobic, Zhen Hua Liu

Leopard: A General Test Suite for Isolation Level Verification

Peiyuan Liu, Siyang Weng, Keqiang Li

Session 6: GenAI 1

NL2SQL is a solved problem... Not!

Avrilia Floratou, Fotis Psallidas, Fuheng Zhao, Shaleen Deep, Gunther Hagleither, Joyce Cahoon, Rana Alotaibi, Jordan Henkel, Abhik Singla, Alex van Grootel, Kai Deng, Katherine Lin, Marcos Campos, Venkatesh Emani, Vivek Pandit, Wenjing Wang, Carlo Curino

Welding Natural Language Queries to Analytics IRs with LLMs

Kaushik Rajan, Aseem Rastogi, Akash Lal, Sampath Rajendra, Krithika Subramanian, Krut Patel

Turning Databases Into Generative AI Machines

Alekh Jindal, Shi Qiao, Sathwik Reddy Madhula, Kanupriya Raheja, Sandhya Jain

Session 7: Cloud 2

Off-the-shelf Data Analytics on Serverless

Michael Wawrzoniak, Gianluca Moro, Rodrigo Bruno, Ana Klimovic, Gustavo Alonso, U Lisboa

Serverless State Management Systems

Tianyu Li, Badrish Chandramouli, Sebastian Burckhardt, Samuel Madden

Optimizing the cloud? Don’t train models. Build oracles!

Tiemo Bang, Conor Power, Siavash Ameli, Natacha Crooks, Joseph M Hellerstein

Scalable OLTP in the Cloud: What's the BIG DEAL?

Pat Helland

Session 8: Startup Session

Yellowbrick: An Elastic Data Warehouse on Kubernetes

Mark Cusack, John Adamson, Mark Brinicombe, Neil Carson, Thomas Kejser, Jim Peterson, Arvind Vasudev, Kurt Westerfeld

MotherDuck: DuckDB in the cloud and in the client

RJ Atwal, Peter Boncz, Ryan Boyd, Antony Courtney, Till Döhmen, Florian Gerlinghoff, Jeff Huang, Joseph Hwang, Raphael Hyde, Elena Felder, Jacob Lacouture, Yves LeMaout, Boaz Leskes, Yao Liu, Dan Perkins, Tino Tereshko, Jordan Tigani, Nick Ursa, Stephanie Wang, Yannick Welsch

Session 9: GenAI 2

CAESURA: Language Models as Multi-Modal Query Planners

Matthias Urban, Carsten Binnig

The Fast and the Private: Task-based Dataset Search

Zezhou Huang, Jiaxiang Liu, Haonan Wang, Eugene Wu

Revisiting Prompt Engineering via Declarative Crowdsourcing

Aditya G Parameswaran, Shreya Shankar, Parth Asawa, Naman Jain, Yujie Wang

SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions

Yin Lin, Bolin Ding, H V Jagadish, Jingren Zhou

VerifAI: Verified Generative AI

Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Halevy