CIDR Proceedings

This website is under development. If you come accross any issues, please report them to Konstantinos Kanellis (kkanellis@cs.wisc.edu) or Yannis Chronis (chronis@google.com).

Proceedings of CIDR

Session 1: Core DB 1

A Critique of Modern SQL And A Proposal Towards A Simple and Expressive Query Language

Thomas Neumann, Viktor Leis

PDF

A Model for Query Execution Over Heterogeneous Instances

Ziheng Wang, Emanuel Adamiak, Alex Aiken

PDF

Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries

Yifei Yang, Hangdong Zhao, Xiangyao Yu, Paraschos Koutris

PDF

Chablis: Fast and General Transactions in Geo-Distributed Systems

Tamer Eldeeb, Philip A Bernstein, Asaf Cidon, Junfeng Yang

PDF

Session 2: Systems/HW

Database Kernels: Seamless Integration of Database Systems and Fast Storage via CXL

Sangjin Lee, Alberto Lerner, Philippe Bonnet, Philippe Cudré-Mauroux

PDF

Program your (custom) SIMD instruction set on FPGA in C++

Johannes Pietrzyk, Alexander Krause, Christian Färber, Dirk Habich, Wolfgang Lehner

PDF

Taking the Shortcut: Actively Incorporating the Virtual Memory Index of the OS to Hardware-Accelerate Database Indexing

Felix Schuhknecht

PDF

Session 3: Cloud 1

The Tipping Point of Edge-Cloud Data Management

Faisal Nawab, Moshe Shadmon

PDF

Cost-Intelligent Data Analytics in the Cloud

Huanchen Zhang, Yihao Liu, Jiaqi Yan

PDF

Towards Resource-adaptive Query Execution in Cloud Native Databases

Rui Liu, Jun Hyuk Chang, Riki Otaki, Zhe Heng Eng

PDF

Session 4: Core DB 2

Is Perfect Hashing Practical for OLAP Systems?

Kevin P Gaffney, Jignesh M Patel

PDF

Oligolithic Cross-task Optimizations across Isolated Workloads

Eleni Zapridou, Panagiotis Sioulas, Anastasia Ailamaki

PDF

Dear User-Defined Functions, Inlining isn't working out so great for us. Let's try batching to make our relationship work. Sincerely, SQL

Kai Franz, Samuel Arch, Denis Hirn, Torsten Grust, Todd C Mowry, Andrew Pavlo

PDF

Session 5: Benchmarking/Testing

Panda: Performance Debugging for Databases using LLM Agents

Vikramank Singh, Kapil Eknath Vaidya, Vinayshekhar Bannihatti, Sopan Khosla, Balakrishnan Narayanaswamy, Rashmi Gangadharaiah, Tim Kraska

PDF

TracEx: Understanding and Analyzing Database Traces

Dominik Durner

PDF

Towards an Objective Metric for Data Value Through Relevance

Boris Glavic, Pengyuan Li, Ziyu Liu, Dieter Gawlick, Vasudha Krishnaswamy, Danica Porobic, Zhen Hua Liu

PDF

Leopard: A General Test Suite for Isolation Level Verification

Peiyuan Liu, Siyang Weng, Keqiang Li

PDF

Session 6: GenAI 1

NL2SQL is a solved problem... Not!

Avrilia Floratou, Fotis Psallidas, Fuheng Zhao, Shaleen Deep, Gunther Hagleither, Joyce Cahoon, Rana Alotaibi, Jordan Henkel, Abhik Singla, Alex van Grootel, Kai Deng, Katherine Lin, Marcos Campos, Venkatesh Emani, Vivek Pandit, Wenjing Wang, Carlo Curino

PDF

Welding Natural Language Queries to Analytics IRs with LLMs

Kaushik Rajan, Aseem Rastogi, Akash Lal, Sampath Rajendra, Krithika Subramanian, Krut Patel

PDF

Turning Databases Into Generative AI Machines

Alekh Jindal, Shi Qiao, Sathwik Reddy Madhula, Kanupriya Raheja, Sandhya Jain

PDF

Session 7: Cloud 2

Off-the-shelf Data Analytics on Serverless

Michael Wawrzoniak, Gianluca Moro, Rodrigo Bruno, Ana Klimovic, Gustavo Alonso, U Lisboa

PDF

Serverless State Management Systems

Tianyu Li, Badrish Chandramouli, Sebastian Burckhardt, Samuel Madden

PDF

Optimizing the cloud? Don’t train models. Build oracles!

Tiemo Bang, Conor Power, Siavash Ameli, Natacha Crooks, Joseph M Hellerstein

PDF

Scalable OLTP in the Cloud: What's the BIG DEAL?

Pat Helland

PDF

Session 8: Startup Session

Yellowbrick: An Elastic Data Warehouse on Kubernetes

Mark Cusack, John Adamson, Mark Brinicombe, Neil Carson, Thomas Kejser, Jim Peterson, Arvind Vasudev, Kurt Westerfeld

PDF

MotherDuck: DuckDB in the cloud and in the client

RJ Atwal, Peter Boncz, Ryan Boyd, Antony Courtney, Till Döhmen, Florian Gerlinghoff, Jeff Huang, Joseph Hwang, Raphael Hyde, Elena Felder, Jacob Lacouture, Yves LeMaout, Boaz Leskes, Yao Liu, Dan Perkins, Tino Tereshko, Jordan Tigani, Nick Ursa, Stephanie Wang, Yannick Welsch

PDF

Session 9: GenAI 2

CAESURA: Language Models as Multi-Modal Query Planners

Matthias Urban, Carsten Binnig

PDF

The Fast and the Private: Task-based Dataset Search

Zezhou Huang, Jiaxiang Liu, Haonan Wang, Eugene Wu

PDF

Revisiting Prompt Engineering via Declarative Crowdsourcing

Aditya G Parameswaran, Shreya Shankar, Parth Asawa, Naman Jain, Yujie Wang

PDF

SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions

Yin Lin, Bolin Ding, H V Jagadish, Jingren Zhou

PDF

VerifAI: Verified Generative AI

Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Halevy

PDF