Analytics
Please visit https://docs.lakekeeper.io for Documentation!
This is Lakekeeper: An Apache-Licensed, secure, fast and easy to use implementation of the Apache Iceberg REST Catalog specification based on apache/iceberg-rust. If you have questions, feature requests or just want a chat, we are hanging around in Discord!
A Docker Container is available on quay.io. We have prepared a minimal docker-compose file to demonstrate how to use the Lakekeeper catalog with common query engines.
git clone https://github.com/lakekeeper/lakekeeper.git
cd lakekeeper/examples/minimal
docker compose up
Then open your browser and head to localhost:8888 to load the example Jupyter notebooks or head to localhost:8181 for the Lakekeeper UI.
For more information on deployment, please check the Getting Started Guide.
The Iceberg Catalog REST interface has become the standard for catalogs in open Lakehouses. It natively enables multi-table commits, server-side deconflicting and much more. It is figuratively the (TIP) of the Iceberg.
LAKEKEEPER__OPENID_PROVIDER_URI
and you are good to go.ContractVerification
trait.Catalog
), SecretsStore
, Authorizer
, Events (CloudEventBackend
) and ContractVerification
as interfaces (Traits). This allows you to tap into any access management system of your company or stream change events to any system you like - simply by implementing a handful methods.spark
, pyiceberg
, trino
and starrocks
.Authorizer
trait.If you are missing something, we would love to hear about it in a Github Issue.
Operation | Status | Description |
---|---|---|
Namespace | All operations implemented | |
Table | All operations implemented - additional integration tests in development | |
Views | Remove unused files and log entries | |
Metrics | Endpoint is available but doesn't store the metrics |
Storage | Status | Comment |
---|---|---|
S3 - AWS | vended-credentials & remote-signing, assume role missing | |
S3 - Custom | vended-credentials & remote-signing, tested against Minio | |
Azure ADLS Gen2 | ||
Azure Blob | ||
Microsoft OneLake | ||
Google Cloud Storage |
Details on how to configure the storage profiles can be found in the Docs.
Backend | Status | Comment |
---|---|---|
Postgres | >=15 | |
MongoDB |
Backend | Status | Comment |
---|---|---|
Postgres | ||
kv2 (hcp-vault) | userpass auth |
Backend | Status | Comment |
---|---|---|
Nats | ||
Kafka |
Operation | Status | Description |
---|---|---|
Warehouse Management | Create / Update / Delete a Warehouse | |
AuthZ | Manage access to warehouses, namespaces and tables | |
More to come! |
Operation | Status | Description |
---|---|---|
OIDC (AuthN) | Secure access to the catalog via OIDC | |
Custom (AuthZ) | If you are willing to implement a single rust Trait, the AuthZHandler can be implement to connect to your system | |
OpenFGA (AuthZ) | Internal Authorization management |
Licensed under the Apache License, Version 2.0