This website is under development. If you come accross any issues, please report them to Konstantinos Kanellis (kkanellis@cs.wisc.edu) or Yannis Chronis (chronis@google.com).

Proceedings of CIDR

Session 1: Foundations

A Fix for the Fixation on Fixpoints

Denis Hirn, Torsten Grust

mutable: A Modern DBMS for Research and Fast Prototyping

Immanuel Haffner, Jens Dittrich

Zed: Leveraging Data Types to Process Eclectic Data

Amy Ousterhout, Steve McCanne, Henri Dubois-Ferriere, Silvery Fu, Sylvia Ratnasamy, Noah Treuhaft

Towards Unifying Query Interpretation and Compilation

Philipp M Grulich, Aljoscha Lepping, Dwi Prasetyo Adi Nugroho, Varun Pandey, Steffen Zeuch, Volker Markl

Making Data Engineering Declarative

Michael Armbrust, Bilal Aslam, Yingyi Bu, Sourav Chatterji, Yuhong Chen, Yijia Cui, Vuk Ercegovac, Ali Ghodsi, Rahul Govind, Aakash Japi, Kiavash Kianfar, Eun-Gyu Kim, Xi Liang, Paul Lappas, Mukul Murthy, Supun Nakandala, Andreas Neumann, Yannis Papakonstantinou, Nitin Sharma, Yannis Sismanis, Justin Tang, Joseph Torres, Reynold Xin, Min Yang, Li Zhang

Session 2: Models

KÙZU Graph Database Management System

Xiyang Feng, Guodong Jin, Ziyi Chen, Chang Liu, Semih Salihoğlu

DuckPGQ: Efficient Property Graph Queries in an analytical RDBMS

Daniel ten Wolde, Tavneet Singh, Gábor Szárnyas, Peter Boncz

Two is Better Than One: The Case for 2-Tree for Skewed Data Sets

Xinjing Zhou, Xiangyao Yu, Goetz Graefe, Michael Stonebraker

Session 3: Transactions

Transactions Make Debugging Easy

Qian Li, Peter Kraft, Michael Cafarella, Çağatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Matei Zaharia

Developer's Responsibility or Database's Responsibility? Rethinking Concurrency Control in Databases

Chaoyi Cheng, Mingzhe Han, Nuo Xu, Spyros Blanas

Git is for Data

Yucheng Low, Rajat Arya, Ajit Banerjee, Ann Huang, Brian Ronan, Hoyt Koepke, Joseph Godlewski, Zach Nation

Session 4: Machine Learning

Database Gyms

Wan Shen Lim, Matthew Butrovich, William Zhang, Andrew Crotty, Lin Ma, Peijing Xu, Johannes Gehrke, Andrew Pavlo

The Tensor Data Platform: Towards an AI-centric Database System

Apurva Gandhi, Yuki Asada, Victor Fu, Advitya Gemawat, Lihao Zhang, Rathijit Sen, Carlo Curino, Jesús Camacho-Rodríguez, Matteo Interlandi

Deep Lake: a Lakehouse for Deep Learning

Sasun Hambardzumyan, Abhinav Tuli, Levon Ghukasyan, Fariz Rahman, Hrant Topchyan, Mark McQuade, Mikayel Harutyunyan, Tatevik Hakobyan, Ivo Stranic, Davit Buniatyan

Session 5: Abstractions

Raising the Level of Abstraction for Time-State Analytics With the Timeline Framework

Henry Milner, Yihua Cheng, Jibin Zhan, Hui Zhang, Vyas Sekar, Junchen Jiang, Ion Stoica

WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses

Tianji Cong, James Gale, Jason Frantz, H V Jagadish, Çağatay Demiralp

Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes

Zui Chen, Zihui Gu, Lei Cao, Ju Fan, Sam Madden, Nan Tang

Towards Adaptive Storage Views in Virtual Memory

Felix Schuhknecht, Justus Henneberg

Predicting Query Execution time for JIT Compiled Database Engines

Kostas Chasialis, Srinivas Karthik, Bikash Chandra, Anastasia Ailamaki

Session 6: Cloud Systems

Is Scalable OLTP in the Cloud a Solved Problem?

Tobias Ziegler, Philip A Bernstein, Viktor Leis, Carsten Binnig

Shared Foundations: Modernizing Meta’s Data Lakehouse

Biswapesh Chattopadhyay, Pedro Pedreira, Sameer Agarwal, Suketu Vakharia, Peng Li, Weiran Liu, Sundaram Narayanan

Analyzing and Comparing Lakehouse Storage Systems

Paras Jain, Peter Kraft, Conor Power, Tathagata Das, Ion Stoica, Matei Zaharia

Session 7: Data Movement

Templating Shuffles

Qizhen Zhang, Jiacheng Wu, Ang Chen, Vincent Liu, Boon Thau Loo

Data Pipes: Declarative Control over Data Movement

Lukas Vogel, Daniel Ritter, Danica Porobic, Pınar Tözün, Tianzheng Wang, Alberto Lerner

Pipeline Group Optimization on Disaggregated Systems

Andreas Geyer, Alexander Krause, Dirk Habich, Wolfgang Lehner

Stateful Entities: Object-oriented Cloud Applications as Distributed Dataflows

Kyriakos Psarakis, Wouter Zorgdrager, Marios Fragkoulis, Guido Salvaneschi, Asterios Katsifodimos

Session 8: Consensus and Query Processing

FlexiRaft: Flexible Quorums with Raft

Ritwik Yadav, Anirban Rahut

Chemistry behind Agreement

Suyash Gupta, Mohammad Javad Amiri, Mohammad Sadoghi

DASH: Asynchronous Hardware Data Processing Services

Norman May, Daniel Ritter, Andre Dossinger, Christian Färber, Suleyman S Demirsoy

HetCache: Synergising NVMe Storage and GPU acceleration for Memory-Efficient Analytics

Hamish Nicholson, Aunn Raza

Far-and-Near: Co-Designed Storage Reliability Between Database and SSDs

Jinwoo Jeong, Sangjin Lee, Alberto Lerner

Reconstructing and Querying ML Pipeline Intermediates

Sebastian Schelter