You can’t really compare data frames with OLAP databases! 🫠 Pandas, Polars, and DuckDB are not the same. Let me explain 👇🏻
DuckDB offers all the benefits of a full database: transactions, OLAP query scalability, indexing, caching, parallel processing, and advanced analytics functions. With just SQL, you can query CSVs, Excel files, Parquet files, make HTTP requests, and more. ⚡
The real advantage? You don’t depend on external libraries like psycopg2 or Boto3. If one library breaks, your entire pipeline fails that’s risky. 😶‍🌫️ With DuckDB, you can connect to multiple sources with a single SQL query, whether it’s S3, Snowflake, Parquet files, or others.
And remember: you write code for humans, not computers. Your backend dev, data analysts, and even business analysts understand SQL.
So why you use this crazy syntax in your Jupyter notebooks? 🫥 df['day_of_week'] = pd.to_datetime(df['date_str']).dt.day_name()

Nov 10, 2025 · 2:00 PM UTC

Just use SQL it’s simpler, cleaner, and everyone can read it. 🦆
From chemicals to gas to energy solutions, we invest with discipline and partner with purpose to generate long-term value.
22
37
530