DuckDB
Fast analytical queries directly in Python, R, or the CLI without a server. SQL on DataFrames, Parquet, CSV, and JSON files, integrations with Pandas and Arrow, and building lightweight data pipelines.
FundamentalsTopics 1–10
- ·Introduction to DuckDB
- ·Install, CLI, and connections
- ·Tables, INSERT, and SELECT
- ·Reading CSV files
- ·Parquet and columnar data
- ·Types, casting, and NULL
- ·Filtering and sorting
- ·JOINs
- ·Aggregations and GROUP BY
- ·Export with COPY and files
Start Fundamentals →
IntermediateTopics 11–20
- ·Extensions INSTALL and LOAD
- ·Remote Parquet over HTTP(S)
- ·Window functions
- ·Common Table Expressions
- ·Parameterized queries in Python
- ·Apache Arrow integration
- ·Pandas and Polars interchange
- ·JSON and nested types
- ·EXPLAIN and query plans
- ·ATTACH and multiple databases
Start Intermediate →
AdvancedTopics 21–30
- ·Partitioned files and globs
- ·Secrets and cloud storage
- ·Memory, threads, and settings
- ·Incremental pipeline patterns
- ·Full-text search (FTS)
- ·Macros and reusable SQL
- ·Testing DuckDB-backed code
- ·Notebooks, BI, and ODBC
- ·DuckDB vs warehouses and SQLite
- ·Production pitfalls and checklist
Start Advanced →