DuckDB Library
arXiv
The Science Data Lake: A Unified Open Infrastructure Integrating 293 Million Papers Across Eight Scholarly Sources with Embedding-Based Ontology Alignment
The Database School Podcast by Aaron Francis
Scaling DuckDB in the Cloud with MotherDuck CEO Jordan Tigani
The Hedgineer Podcast by Michael Watson
DuckDB, Apache Arrow, & the Future of Data Engineering with Rusty Conover
DuckDB in Science
Anarchy in the Database: A Survey and Evaluation of Database Management System Extensibility
ADMS 2025
RISC-V Meets RDBMS: An Experimental Study of Database Performance on an Open Instruction Set Architecture
VLDB 2025
GooseDB: A Database Engine that Optimally Refines Top-𝑘 Queries to Satisfy Representation Constraints
VLDB 2025
Freely Moving Between the OLTP and OLAP Worlds: Hermes - A High-Performance OLAP Accelerator for MySQL
VLDB 2025
Environmental Footprints of Query Processing: A Vision for Sustainable Database Architectures
VLDB 2025
Anarchy in the Database: A Survey and Evaluation of Database Management System Extensibility
Submission Guidelines
The DuckDB library is a collection of scientific papers, podcasts, projects, talks and books that focus on DuckDB or DuckLake.
Submissions are welcome in the form of pull requests in the duckdb-web repository.
You are welcome to submit both your own work and also the work of others.
When submitting, please follow these guidelines:
- The entry's filename should start with a date in YYYY-MM-DD format. It should capture the podcast's release date, the talk's presentation day, the book's publication day or the conference's first day. If the exact release date is not easily obtainable, just use an estimated date.
- For entries describing talks and papers, please link the presentation slide deck if it's available.
- If the entry has a DuckDB implementation (in core, as a community extension or as an open-source repository), please add a link pointing to this.