Introduction
LakeSail is a managed data-compute platform. You bring your AWS account and your data; LakeSail runs Spark and SQL workloads on top of it, without you having to operate the underlying clusters.
What you get
- Managed clusters in your cloud. LakeSail provisions Kubernetes clusters and Karpenter-driven compute inside a network it creates in your AWS account. Credentials never leave your account.
- Batch jobs. Reusable, versioned SQL or Python jobs that run on a schedule or on demand.
- Interactive sessions. Live Spark Connect endpoints that any compatible client (PySpark, Scala, Go, Rust) can attach to.
- Catalogs. Pointers to your existing data, AWS Glue, Iceberg REST, Unity, OneLake, or in-memory.
- Identity and access control. Members, teams, organization-wide roles, team-scoped roles, fine-grained policies, SSO, and MFA.
- Notifications. Channel and rule-based delivery to email, Slack, webhooks, PagerDuty, or Rootly when jobs fail.
Built on Sail (open source)
The query engine inside every LakeSail cluster is Sail, our open-source Spark Connect-compatible engine. If you want to run the engine yourself, the docs live at docs.lakesail.com/sail. LakeSail Platform builds on top of Sail by adding cluster lifecycle management, scheduling, identity, billing, and observability.
Where to go next
- New here? Start with the Quickstart to set up your first cluster end-to-end in 10–15 minutes.
- Want the mental model first? Concepts walks through how organizations, infrastructure, catalogs, and workloads fit together.
- Just looking for the API? Jump to the API Reference.