Ryan Abernathey · Mar 1, 2024 · 1:24 PM UTC

Ryan Abernathey

Pinned Tweet

Ryan Abernathey

@rabernat

1 Mar 2024

It was super fun talking to Max about my journey through geospatial / climate and where I think I this field is heading.

Max Lenormand

@MaxLenormand

1 Mar 2024

Today's conversation is a deep dive on how we do scientific computing today, and how it could be better Talking to @rabernat about @pangeo_data, @zarr_dev and @EarthmoverHQ Watch: piped.video/3IWp-MuSm6w

The Infra Pod · Jan 20, 2025 · 3:04 PM UTC

Ryan Abernathey retweeted

The Infra Pod

@theinfrapod

Jan 20

🌎 Why do we need a cloud-native data lake for geospatial data? In the latest episode of The Infra Pod, @tnachen & @ianlivingstone chat with the cofounders of @EarthmoverHQ, @rabernat & @_jhamman, about the future of data in climate and earth sciences. 🎧 Link in 🧵

Earthmover · Dec 20, 2024 · 7:38 PM UTC

Ryan Abernathey retweeted

Earthmover @EarthmoverHQ

20 Dec 2024

🌤️ #AMS2025 is just around the corner! We are taking AMS by storm with an exhibitor booth (booth 353), two talks from @_jhamman and @rabernat , and hosting a @pangeo_data Community Happy Hour (register here: lu.ma/ddtba5f5)!

Earthmover · Dec 9, 2024 · 4:32 PM UTC

Ryan Abernathey retweeted

Earthmover @EarthmoverHQ

9 Dec 2024

@rabernat is rounding out the week of #AGU24 with his talk, “How can we make cloud computing actually accessible to all scientists?” on Friday at 5:10 PM agu.confex.com/agu/agu24/mee…

Joe Hamman · Dec 5, 2024 · 6:02 AM UTC

Ryan Abernathey retweeted

Joe Hamman @_jhamman

5 Dec 2024

Monday through Thursday, I'll be hanging out with @rabernat at the @EarthmoverHQ booth in the exhibit hall. Swing by to say hello or to snag some swag/stickers/etc. We'll also be demoing #icechunk all week.

Earthmover · Dec 4, 2024 · 4:09 PM UTC

Ryan Abernathey retweeted

Earthmover @EarthmoverHQ

4 Dec 2024

Will you be at @theAGU next week? Earthmover is exhibiting! @rabernat and @_jhamman are participating in panel discussions and giving talks👇.

Ryan Abernathey · Nov 22, 2024 · 1:23 PM UTC

Ryan Abernathey

@rabernat

22 Nov 2024

Checked out the other site. 🟦 Seems much better. Gonna be over there more from now on. 👋

Ian Schuler · Nov 19, 2024 · 9:36 AM UTC

Ryan Abernathey retweeted

Ian Schuler @ianschuler

19 Nov 2024

I take it all back. At #SatSummit in Lisbon we finally discovered the holy grail of cloud optimized file formats

Ian Schuler @ianschuler

18 Nov 2024

Replying to @ianschuler @mouthofmorrison @rabernat @betolink @EarthmoverHQ

That said, it isn't 100% clear that NASA's best move is to immediately convert 10000+ data sets into cutting edge ARCO formats. Kerchunk and Virtual Zarr offer benefits of ARCO while keeping data in the native formats.

Joe Hamman · Nov 14, 2024 · 7:36 PM UTC

Ryan Abernathey retweeted

Joe Hamman @_jhamman

14 Nov 2024

Are you heading to #AGU24 next month? Consider joining us for a bonus day of hacking on @pangeo_data. I'll be there representing @EarthmoverHQ and helping folks work with #icechunk and @zarr_dev. Details and signup here: discourse.pangeo.io/t/post-a…

Post-AGU Pangeo Hack-day / Working Meeting (December 14, 2024 in Washington, DC)

🎉 Please join us for a Pangeo working meeting / hackathon on December 14 in Washington, DC 🎉 I’m excited to announce a Pangeo working meeting following the 2024 AGU Fall Meeting in Washington, DC....

discourse.pangeo.io

Ovais Tariq · Nov 12, 2024 · 7:18 PM UTC

Ryan Abernathey retweeted

Ovais Tariq

@ovaistariq

12 Nov 2024

🚀 New blog post: Nomadic Compute - The Future of Distributed AI Workloads 🌐 In a fast-paced AI world, flexibility & resilience are key. "Nomadic Compute" pattern lets workloads move dynamically across clouds for peak performance, cost, & availability: tigrisdata.com/blog/nomadic-…

Nomadic Infrastructure Design for AI Workloads | Tigris Object Storage

This AI stuff is cool, but GPU inference is not needed all of the time. Most of the time your instances stay idle, which means you're just burning investor money without any real benefit. Today we'll...

tigrisdata.com

Deepak Cherian · Nov 12, 2024 · 5:03 PM UTC

Ryan Abernathey retweeted

Deepak Cherian @cherian_deepak

12 Nov 2024

Come learn about recent @xarray_dev GroupBy improvements at tomorrow's (Wed, Nov 13) Pangeo Showcase! discourse.pangeo.io/t/pangeo…

Ryan Abernathey · Nov 4, 2024 · 4:17 PM UTC

Ryan Abernathey

@rabernat

4 Nov 2024

Most developers today wouldn't dream of not using version control for their code... However, the same principles can be applied to data! @EarthmoverHQ's new open source project--Icechunk--includes version control features built specifically for the @zarr_dev data model, brining powerful data version control to the world of massive multidimensional arrays. Features include * All updates occur atomically in isolated snapshots * Tags - immutable pointers to snapshots * Branches - mutable pointers to snapshots With Icechunk, you can safely experiment with changes to your data on a "dev" branch before propagating those changes to "main." You can publish an immutable version of your dataset (tag) while continuing to evolve towards the next version. Or you can simply revert incorrect changes back to an earlier version of your data. These capabilities make life so much easier for data scientists and teams using array data in production. I've been using data version control with Zarr for the past year via our Arraylake platform, and I'm thrilled that these capabilities are now fully open source. I can't imagine going back to the old way of working. Learn more at icechunk.io/

Tom Nicholas · Oct 24, 2024 · 10:21 PM UTC

Ryan Abernathey retweeted

Tom Nicholas @TEGNicholasCode

24 Oct 2024

Xarray v2024.10.0 has just been released, including support for xarray.DataTree and zarr-python v3 !!! github.com/pydata/xarray/rel… @xarray_dev @zarr_dev

Release v2024.10.0 · pydata/xarray

This release brings official support for xarray.DataTree, and compatibility with zarr-python v3! Aside from these two huge features, it also improves support for vectorised interpolation and fixes ...

github.com

Ryan Abernathey · Oct 23, 2024 · 2:06 PM UTC

Ryan Abernathey

@rabernat

23 Oct 2024

It’s a real honor to be part of this amazing collection of experts. Looking forward to helping spread the word about Cloud Native Geospatial!

CNG @cloudnativegeo

23 Oct 2024

Introducing the Founding CNG Editorial Board for the Cloud-Native Geospatial Forum (CNG)! This group of leaders from our community have graciously volunteered to guide our work. cloudnativegeo.org/blog/2024…

Matthew Rocklin · Oct 22, 2024 · 1:37 PM UTC

Ryan Abernathey retweeted

Matthew Rocklin @mrocklin

22 Oct 2024

"Large GeoSpatial Benchmarks: First Pass" Last month we asked for TiB scale geo workloads to form a benchmark suite. We got strong response. Since then we've built out these into a public suite. This post goes over what's implemented and early results docs.coiled.io/blog/geospati…

Large Scale Geospatial Benchmarks: First Pass

Summary: We implement several large-scale geo benchmarks. Most break. Fun! This article describes those benchmarks, what they attempt, how they break, and the technical work necessary to make them ...

docs.coiled.io

Pangeo · Oct 22, 2024 · 7:51 PM UTC

Ryan Abernathey retweeted

Pangeo @pangeo_data

22 Oct 2024

Join us tomorrow Oct 23 at 4 PM EDT for a Pangeo showcase on Icechunk: An Open-Source Transactional Storage Engine for Zarr by @rabernat! discourse.pangeo.io/t/pangeo…

Pangeo Showcase: "Icechunk: An Open-Source Transactional Storage Engine for Zarr"

Title: “Icechunk: An Open-Source Transactional Storage Engine for Zarr” Invited Speaker: Ryan Abernathy (ORCID: 0000-0001-5999-4917) When: Wednesday, October 23, 2024 at 4 PM EDT Where: Launch...

discourse.pangeo.io

Deepak Cherian · Oct 21, 2024 · 3:38 PM UTC

Ryan Abernathey retweeted

Deepak Cherian @cherian_deepak

21 Oct 2024

It's been a blast learning rust and working with the @EarthmoverHQ team on Icechunk! Come see what it's all about. Absolutely worth it, I promise.

Ryan Abernathey

@rabernat

21 Oct 2024

⚡️ Icechunk is fast! What does this mean for users? Reduced cost for all data-intensive compute jobs and enhanced productivity for the data scientists who work with data all day long. Icechunk, @EarthmoverHQ's new transactional cloud-native storage engine for array / tensor data, works together with @zarr_dev , augmenting the Zarr core data model with features that enhance performance, collaboration, and safety in a multi-user cloud-computing context. Reading data through Icechunk is 36x faster than trying to read HDF5 files from cloud object storage, 6x faster than regular Zarr alone, and 2.5x faster than regular Zarr + Dask. Most importantly, Icechunk can achieve throughput on par with the compute instance network bandwidth, the "hardware limit" for I/O bound workloads. Want to learn more about this benchmark? Come to our Icechunk informational webinar tomorrow, Tuesday, October 22nd from 12 - 1 PM EST. Registration link: share.hsforms.com/1SCOFqe2kT…

Ryan Abernathey · Oct 21, 2024 · 3:36 PM UTC

Ryan Abernathey

@rabernat

21 Oct 2024

Earthmover · Oct 17, 2024 · 2:09 PM UTC

Ryan Abernathey retweeted

Earthmover @EarthmoverHQ

17 Oct 2024

We’re hosting a webinar on Tuesday, October 22 from 12- 1 PM EST to discuss what Icechunk means for the scientific data community and answer questions from attendees. Register here: share.hsforms.com/1SCOFqe2kT…

Earthmover @EarthmoverHQ

15 Oct 2024

🚀 We are thrilled to announce the release of the Icechunk storage engine, a new open-source library and specification for the storage of multidimensional array (a.k.a. tensor) data in cloud object storage. Read our blog post about Icechunk here: earthmover.io/blog/icechunk

Aravind 🌍 🛰 · Oct 17, 2024 · 1:12 PM UTC

Ryan Abernathey retweeted

Aravind 🌍 🛰 @aravindEO

17 Oct 2024

There has been so much interest and investment in launching new EO satellites in the past 3-5 years. But, there has been relatively limited interest and investment into solving the boring problems of standardization, interoperability and analysis-ready data* in EO.