Dagster · Nov 13, 2024 · 9:49 PM UTC

Dagster

Pinned Tweet

Dagster

@dagster

13 Nov 2024

Dagster+ delivers impressive ROI according to the latest Forrester TEI report: 432% ROI with data engineers shifting from maintenance to high-value work. As one executive noted: “Because we now have a full test suite that ensures everything is actually running as expected, we trust our code more. And now it’s trivial to go through the process of managing deployments and pull requests and managing releases and deployments of pipelines and code.” Get the full report today! Link in thread

» teej · Nov 6, 2025 · 2:04 AM UTC

Dagster retweeted

» teej

@teej_m

Nov 6

I learned this when I was 20. It's not hard to start. There are 2 types of queries: reads and writes. Most of your queries are reads, handle those first. Do this, in order: 1/ Reduce reads - Cache, cache, cache. Use read-thru and write-thru caching. Use Redis. 2/ Optimize slow reads - Add indexes. Fix your N+1 queries. Add limits. Sort responsibly. Alert on slow queries. 3/ Scale hardware - Add read replicas. Upgrade memory & I/O. 4/ Split up data - Shard when you have to. Partition writes when needed (you won’t need it). Understand “good enough”. A single write instance is fine for most people. And if it's not slow, leave it alone!

solst/ICE of Astarte

@IceSolst

Nov 5

I have no understanding of database scaling. How many queries is too many, 1k/second? 10k/s?

191

2,819

Alex Noonan · Nov 6, 2025 · 3:25 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Nov 6

I recently conducted a case study with @RaysBaseball on how they utilize @Dagster to significantly enhance the velocity and quality of their data pipelines. Key results: - 50-70% faster pipeline execution: Processing reduced from hours to minutes - 15-minute data availability: Game data ready within 10 - 15 minutes vs 9 a.m. batch jobs - One-week onboarding: New data sources integrated 2-3x faster than the previous system - Zero-touch reliability: Critical pipelines run unattended nightly without intervention

Alex Noonan · Nov 4, 2025 · 5:45 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Nov 4

Our girl has me thinking about what a tasteful use of AI looks like. For me it comes down to 3 things: No one should feel like they are getting displaced, this kills momentum and adoption The implementation is well engineered. Vibe coded slop falls apart in production. Don't be this guy Meet people where they are. Don't ask your marketing girlie to use a CLI. If you can embed in your existing tools even better.

Alex Noonan · Nov 5, 2025 · 3:46 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Nov 5

🚨 New @dagster Ebook 🚨 Data teams face different challenges and growing pains as they scale up. Data platforms are like fashion in the sense that they are never finished. You are constantly adding new pipelines, migrating databases, and incorporating new tools and stakeholders. Dennis Hume and @coltonpadden wrote a fantastic eBook that goes through the challenges that you face as you scale up and how to evolve your platform in place to set yourself up for easy scaling. Check it out today! Link in thread.

MotherDuck · Nov 4, 2025 · 3:38 PM UTC

Dagster retweeted

MotherDuck

@motherduck

Nov 4

Today is day 1 of Small Data SF!!! We'll kick things off with a day of hand-on workshops. From Zero to Query: Building Your First Serverless Lakehouse with DuckLake - Jacob Matson from MotherDuck walks through creating a serverless lakehouse with DuckLake, covering ACID transactions, time travel, and schema evolution. Stop Measuring LLM Accuracy, Start Building Context - Tahlia DeMaio from Hex argues that context, not accuracy, is the real challenge in LLM systems and shows how to build context-aware analytical workflows. Keep it Simple and Scalable: pythonic ELT using dltHub - Thierry Jean from dltHub with Brian Douglas from Continue and elvis kahoro from Chalk teaches Python-based data ingestion and transformation pipelines. Composable Data Workflows: Building Pipelines That Just Work - Dennis Hume from Dagster Labs covers practical patterns for building reliable, modular pipelines that scale from laptop to production. Open Data Science Agent - Zain Hasan from Together AI shows how to build an autonomous data science agent using open-source models and the ReAct framework for end-to-end analysis tasks. Duck, duck, deploy: Building an AI-ready app in 2 hours - Russell Garner and Rebecca Bruggman from Omni start with a MotherDuck dataset and build a production-ready analytics app using Omni's semantic model and APIs. From Parsing Nightmares to Production - Upal Saha from bem demonstrates how to transform any unstructured input (PDFs, images, audio, etc.) into clean JSON and load it directly into MotherDuck. Just-in-Time Insights with Estuary - Zulfikar Qureshi from Estuary provides hands-on experience with real-time data streaming, including a lab exercise streaming live data into MotherDuck.

Alex Noonan · Nov 2, 2025 · 7:32 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Nov 2

So true for data engineering and why the orchestrator is the unsung hero of the data stack

Nate Berkopec

@nateberkopec

Nov 2

Most of my job is actually just improving observability. About 80% of perf problems are so drop dead obvious as to HOW to fix, but engineers actually have no idea that they’re even happening.

Alex Noonan · Nov 3, 2025 · 2:22 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Nov 3

Dagster is the official Data Ops Platform for the Real American Virtuous Yoeman Farmer

Alex Noonan · Nov 3, 2025 · 3:00 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Nov 3

Datalight Saving time was last weekend for most of the US. Did your pipelines behave as planned? Dealing with unusual edge cases, such as DST, is the bane of the data engineer's existence. It happens once a year, making it easy to forget, and it's time-consuming to react to afterward. Orchestrators, like Dagster, make it easy to handle it for you. For any of your schedules, if you set an execution_timezone (see below), Dagster will automatically run the schedule. Spring forward? Your 2:30 a.m. job runs at 3:30 a.m. Fall back? It waits for the second occurrence. No manual fixes.

Ankur Gupta · Oct 31, 2025 · 1:30 PM UTC

Dagster retweeted

Ankur Gupta

@getpy

Oct 31

Happy Friday Everyone DSPy Weekly Issue No 9 is out dspyweekly.com/newsletter/10… Highlights: 🔹 Articles: DSPy vs. LlamaBot, REFRAG implementation, BAML & Butter integrations, and the dissertation that started it all. 🔹 Videos: Using DSPy with Dagster & a playlist from DSPy Boston. 🔹 Projects: DSPy in Go (dsgo), a lightweight version (udspy), and dspy-bench. 🔹 Jobs: Fellowship at Harvard's Berkman Klein Center. #BAML #dspy @DSPyOSS @dagster

Alex Noonan · Oct 30, 2025 · 3:07 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Oct 30

In case you missed it, our deep dive on using both @dagster and @DSPyOSS is now on Youtube! We discussed DSPy and how its composability, evaluation, and optimization abstractions make it the best LLM development framework currently available. We also discuss how the framework integrates well with Dagster and how you can achieve improved observability and recoverability for your LLM workflows. Check out the full recording today!

colton · Oct 27, 2025 · 10:04 PM UTC

Dagster retweeted

colton @coltonpadden

Oct 27

Honored to be included on the @OpenAI developer blog where I talk about how we use Codex for educational content at @dagster! developers.openai.com/blog/c…

106

Thanabodee C. · Oct 23, 2025 · 5:42 PM UTC

Dagster retweeted

Thanabodee C. @wingyplus

Oct 23

@dagster Pipes for Go is now working! github.com/wingyplus/dagster…

akira @ DataMarket 💹 · Oct 21, 2025 · 10:53 PM UTC

Dagster retweeted

akira @ DataMarket 💹

@sista05

Oct 21

今朝、DagsterコミュニティのSlack内に常駐するAIデータアナリスト「Dagster Compass」がリリースされました！これは文章でお願いした内容に関して、ネットにあるデータから分析を試みるもので、試しに私が「世界のデータ分析企業top10を示して」とお願いしたところです。「企業価値top10をプロットして」とお願いすると、「企業価値と言っても色々あるよね。具体的に価値と言っているのは何？」と、使用者の曖昧な言葉を補正してグラフしてくれました。

Dagster

@dagster

Oct 21

Your analysts are stuck waiting on data engineers. Your data engineers are drowning in ad-hoc requests. There's a better way. We built a self-service analytics platform Compass that let our team scale analysis without scaling headcount and shipped 2x faster. Check out the full blog!

Alex Noonan · Oct 21, 2025 · 8:18 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Oct 21

Does your data analyst support you speaking a little Chinese anon?

Dagster · Oct 21, 2025 · 7:37 PM UTC

Dagster

@dagster

Oct 21

dagster.io/blog/scaling-anal…

Scaling Analysis Without Scaling the Team: How Compass Helps Our Lean Data Team Work Much Bigger

Data analysis doesn’t always scale easily. Empower your analysts to overcome technical bottlenecks and enable your entire organization to be more data-driven.

dagster.io

Dagster · Oct 21, 2025 · 7:37 PM UTC

Dagster

@dagster

Oct 21

Scout · Oct 14, 2025 · 8:54 PM UTC

Dagster retweeted

Scout

@scoutshq

Oct 14

We ❤️ Dagster + Colton! One of our earliest customers and incredible partners

colton @coltonpadden

Oct 14

Thanks for the opportunity to talk at #allthingsopen today—everyone has been so welcoming and supportive. You can find the slides here: cmpadden.github.io/slides/at…

colton · Oct 14, 2025 · 7:16 PM UTC

Dagster retweeted

colton @coltonpadden

Oct 14

Thanks for the opportunity to talk at #allthingsopen today—everyone has been so welcoming and supportive. You can find the slides here: cmpadden.github.io/slides/at…

Simon Späti 🏔️ · Oct 13, 2025 · 6:48 PM UTC

Dagster retweeted

Simon Späti 🏔️ @sspaeti

Oct 13

Consolidations in the #dataengineering market are happening fast. Tools from the MDS get unified into unified data platforms. The latest: - Fivetran → dbt - Fivetran → SQLMesh - Soda → nannyML - Snowflake → Crunchy Data - Databricks → Neon - Fivetran → Census - dbt → SDF

Alex Noonan · Oct 14, 2025 · 1:53 PM UTC

Dagster retweeted

Alex Noonan

@AlexNoonan6

Oct 14

I got kids and shit man