This website is under development. If you come accross any issues, please report them to Konstantinos Kanellis (kkanellis@cs.wisc.edu) or Yannis Chronis (chronis@google.com).

Proceedings of CIDR

Session 1: Modern Data Storage

Append is Near: Log-based Data Management on ZNS SSDs

Devashish R Purandare, Peter Wilcox, Heiner Litz

SSDs Striking Back: The Storage Jungle and Its Implications on Persistent Indexes

Kaisong Huang, Darien Imai, Tianzheng Wang, Dong Xie

Are You Sure You Want to Use MMAP in Your Database Management System?

Andrew Crotty, Viktor Leis, Andrew Pavlo

D-RDMA: Bringing Zero-Copy RDMA to Database Systems

André Ryser, Alberto Lerner, Alex Forencich, Philippe Cudré-Mauroux

Micro-architectural Analysis of OLAP Systems on Persistent Memory

Jie Liang Ang, Jefferson Chu, Jiajun Liu, Bingsheng He, Hieu Le Trung, Jiong He, Qian Lin

Runtime Encoding Execution in AnalyticDB: Efficient Query Executor for Cloud Database

Qiaoyi Ding

Session 2: New Systems

A Progress Report on DBOS: A Database-oriented Operating System

Qian Li, Peter Kraft, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Jason Li, Michael Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Matei Zaharia

Mach: A Pluggable Metrics Storage Engine for the Age of Observability

Franco Solleza, Andrew Crotty, Suman Karumuri, Nesime Tatbul, Stan Zdonik

Darwin: Scale-In Stream Processing

Lawrence Benson, Tilmann Rabl

GRainDB: A Relational-core Graph-Relational DBMS

Guodong Jin, Nafisa Anzum, Semih Salihoglu

Amalur: Next-generation Data Integration in Data Lakes

Rihan Hai, Christos Koutras, Andra Ionescu, Asterios Katsifodimos

Photon: A High-Performance Query Engine for the Lakehouse

Alexander Behm

Session 3: Video Analytics

VOCAL: Video Organization and Interactive Compositional AnaLytics

Maureen Daum, Enhao Zhang, Dong He, Magdalena Balazinska, Brandon Haynes, Ranjay Krishna, Apryle Craig, Aaron Wirsing

VIVA: An End-to-End System for Interactive Video Analytics

Daniel Kang, Francisco Romero, Peter Bailis, Christos Kozyrakis, Matei Zaharia

Session 4: Cloud Computing

CompuCache: Remote Computable Caching using Spot VMs

Qizhen Zhang, Philip A Bernstein, Daniel S Berger, Badrish Chandramouli, Vincent Liu, Boon Thau Loo

Self-Organizing Data Containers

Samuel Madden, Jialin Ding, Tim Kraska, Sivaprasad Sudhir, David Cohen, Timothy Mattson, Nesime Tatbul

Farview: Disaggregated Memory with Operator Off-loading for Database Engines

Dario Korolija

Decoupled Transactions: Low Tail Latency Online Transactions Atop Jittery Servers

Pat Helland

A Network Use for Incomplete Knowledge Management

Anduo Wang

Correctness in Stream Processing: Challenges and Opportunities

Caleb Stanford, Konstantinos Kallas, Rajeev Alur

Session 5: Data Exploration

Kyrix-J: Visual Discovery of Connected Datasets in a Data Lake

Wenbo Tao, Adam Sah, Leilani Battle, Remco Chang, Michael Stonebraker

Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration

Michael R Anderson, Yuze Lou, Jiayun Zou, Michael Cafarella, Sarah Chasins, Doug Downey, Tian Gao, Kexin Huang, Dinghao Shen, Jenny Vo-Phamhi, Yitong Wang, Yuning Wang, Anna Zeng

Knowledge Graph Exploration Systems: are we lost?

Matteo Lissandrini, Davide Mottin

Data Management Opportunities for Foundation Models

Laurel Orr, Karan Goel, Christopher Ré

Towards NLP-Enhanced Data Profiling Tools

Immanuel Trummer

Session 6: Data Science

DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines

Patrick Damme, Marius Birkenbach, Constantinos Bitsakos, Matthias Boehm, Philippe Bonnet, Florina Ciorba, Mark Dokter, Pawel Dowgiallo, Ahmed Eleliemy, Christian Faerber, Georgios Goumas, Dirk Habich, Niclas Hedam, Marlies Hofer, Wenjun Huang, Kevin Innerebner, Vasileios Karakostas, Roman Kern, Tomaž Kosar, Alexander Krause, Daniel Krems, Andreas Laber, Wolfgang Lehner, Eric Mier, Marcus Paradies, Bernhard Peischl, Gabrielle Poerwawinata, Stratos Psomadakis, Tilmann Rabl, Piotr Ratuszniak, Pedro Silva, Nikolai Skuppin, Andreas Starzacher, Benjamin Steinwender, Ilin Tolovski, Pınar Tözün, Wojciech Ulatowski, Yuanyuan Wang, Izajasz Wrosz, Aleš Zamuda, Ce Zhang, Xiao Xiang Zhu

Augmenting Decision Making via Interactive What-If Analysis

Sneha Gathani, Madelon Hulsebos, James Gale, Peter J Haas

Towards Observability for Machine Learning Pipelines

Shreya Shankar, Aditya Parameswaran

Screening Native Machine Learning Pipelines with ArgusEyes.

Sebastian Schelter, Stefan Grafberger, Shubha Guha, Olivier Sprangers, Bojan Karlaš, Ce Zhang

Making Table Understanding Work in Practice

Madelon Hulsebos, Sneha Gathani, James Gale, Isil Dillig, Paul Groth, Cagatay Demiralp

Examples are All You Need: Iterative Data Discovery by Example in Data Lakes

Allan Vanterpool, Andrew Bowne, Lindsey McEvoy, Vijay Gadepally

Session 7: Query Processing

The 3D Hash Join: Building On Non-Unique Join Attributes

Daniel Flachs, Magnus Müller, Guido Moerkotte

Memory Efficient Scheduling of Query Pipeline Execution

Lukas Landgraf, Florian Wolf, Alexander Boehm, Wolfgang Lehner

Introducing a Query Acceleration Path for Analytics in SQLite3

Martin Prammer, Suryadev Sahadevan Rajesh, Junda Chen, Jignesh M Patel

Accelerating Python UDFs in Vectorized Query Execution

Steffen Kläbe, Bobby DeSantis, Stefan Hagedorn, Kai-Uwe Sattler

Boosting Efficiency of External Pipelines by Blurring Application Boundaries

Anna Herlihy, Periklis Chrysogelos, Anastasia Ailamaki

Session 8: ML and Query Optimization

Workload-driven, Lazy Discovery of Data Dependencies for Query Optimization

Jan Kossmann, Daniel Lindner, Felix Naumann, Thorsten Papenbrock

A Unified Transferable Model for ML-Enhanced DBMS

Ziniu Wu, Pei Yu, Peilun Yang, Rong Zhu, Yuxing Han, Yaliang Li, Defu Lian, Kai Zeng, Jingren Zhou

One Model to Rule them All: Towards Zero-Shot Learning for Databases

Benjamin Hilprecht, Carsten Binnig

Machine Learning, Linear Algebra, and More: Is SQL All You Need?

Mark Blacher

DataFarm: Farm Your ML-based Query Optimizer's Food! - Human-Guided Training Data Generation -

Robin van de Water, Francesco Ventura, Zoi Kaudi, Jorge-Arnulfo Quiané-Ruiz, Volker Markl

Can Transfer Learning be used to build a Query Optimizer?

Yunjia Zhang, Yannis Chronis, Jignesh M Patel, Theodoros Rekatsinas