Introduction
Sail ⛵ is an open-source compute engine designed for both single-host and distributed settings. Our mission is to unify batch processing, stream processing, and compute-intensive (AI) workloads.
If you are already using Apache Spark, you can switch to Sail without making changes to your Spark code, and enjoy the benefits delivered by a modern system programming language (Rust) and design principles tailored to the modern compute world.
Sail is well-positioned in the composable data stack. It is built on top of solid foundations such as the Apache Arrow in-memory format and the Apache DataFusion query engine. We are working on integrations with the Ibis DataFrame library, lakehouse formats, and additional cloud storage engines. We hope Sail can assist you in delivering value from data at scale, no matter how and where the data is stored.