Introduction

Sail is a distributed multimodal compute engine well-positioned in the composable data stack. It is built on top of solid foundations such as the Apache Arrow in-memory format and the Apache DataFusion query engine. Sail delivers value from data at scale, no matter how and where the data is stored.

If you are already using Apache Spark, you can switch to Sail without making code changes, and enjoy the benefits delivered by a modern system programming language (Rust) and design principles tailored to the modern compute world.

Spark DataFrame API

Data Types

Spark SQL

Data Types

Literals

Functions and Operators

User-Defined Functions

Data Sources

Delta Lake

Iceberg

Data Storage

Catalog

System

Integrations

Deployment

Building Docker Images

Introduction

Data Types

Data Types

Literals

Delta Lake

Iceberg

System

Building Docker Images

Introduction ​

Introduction