MotherDuck: DuckDB in the cloud and in the client
Abstract
We describe and demo MotherDuck: a new service that connects DuckDB to the cloud. MotherDuck provides the concept of hybrid query processing: the ability to execute queries partly on the client and partly in the cloud. We cover the motivation for MotherDuck and some of its use cases; and outline its system architecture, which heavily uses the extension mechanisms of DuckDB. MotherDuck allows existing DuckDB users who use a laptop, like data scientists, to start using cloud computing without changing their queries: this can provide better performance as well as scalability to larger datasets. It also provides them the ability to share DuckDB databases with others through the cloud for collaboration. Hybrid query processing opens the door to new data-intensive applications, such as low-latency analytical web apps, with DuckDBwasm as the client running inside a browser. It also leads on to research questions, some of which we describe in the paper.