Product and Technology

Polaris Catalog Is Now Open Source

In June 2024, Snowflake announced Polaris Catalog to provide organizations and the Iceberg community new levels of choice, flexibility and control over their data. It enables more open, secure lakehouse architectures with broad read-and-write interoperability and cross-engine access controls. Apache Iceberg™ has greatly improved data mobility by establishing a vast community around an open standard, and the next logical step is an open, community-driven catalog to complement Iceberg. This opens the door for truly vendor-neutral interoperability that many organizations want.

As of today, Polaris Catalog is open source under the Apache 2.0 license and is now available on GitHub. Snowflake’s new service powered by Polaris Catalog is now available in public preview for Snowflake customers.

Interoperability through community

Just as large communities have grown in support of open source projects for open file and table formats, there is a community emerging to collaborate on standards for metadata catalogs. Diversity of ideas and community contributions creates the most interoperable catalog across the widest variety of tools. 

Polaris Catalog implements Apache Iceberg’s REST catalog specification, which means it already enables interoperability with Apache Doris™, Apache Flink™, Apache Spark™, Daft, DuckDB, Presto, SingleStore, Snowflake, Starburst, Trino, Upsolver and more. In addition, Alation, ALTR, Atlan, Collibra, dbt Labs, data.world, Dremio, Confluent, Fivetran, Google Cloud, Immuta, Microsoft, Project Nessie, and Salesforce also intend to add integrations or make contributions to the Polaris Catalog open source project.

Contributing Project Nessie capabilities to Polaris Catalog

Project Nessie is an open source intelligent metastore and catalog for Apache Iceberg™ with Git-like semantics. Created by Dremio co-founders, it became an Apache-licensed project in 2020. 

The team at Dremio is excited to help bring the various functions and capabilities of Nessie into the Polaris  project. Contributing the capabilities of Project Nessie to Polaris Catalog will form an inclusive community dedicated to developing the most robust open source catalog for open lakehouse architectures. Innovating in one project reduces catalog sprawl and enables a broader group of contributors to drive rapid advancements. This partnership not only accelerates technical progress but also brings more contributors into the Nessie community, further strengthening the growing ecosystem around Polaris. To learn more about the Nessie ecosystem, read this.

Quote Icon

As co-founders of Apache Arrow™, creators of Project Nessie and significant contributors to Apache Iceberg™, openness is ingrained in Dremio's culture. We are delighted to support the launch of Polaris Catalog as open source under the Apache license and look forward to actively contributing to its success. With over four years of experience building Project Nessie as an open source Apache Iceberg™ Catalog, we're excited to share its differentiated capabilities, such as catalog-level versioning, multi-engine support, multi-table transactions and Git for data, with Polaris Catalog and the broader community.”

Tomer Shiran
Co-Founder and CPO, Dremio

Snowflake service, powered by Polaris Catalog, now in public preview

In addition to open sourcing, Snowflake’s service powered by Polaris Catalog is now available in public preview for Snowflake customers. This service is powered by the open source implementation of Polaris Catalog, and is an easy way to get started even if you don’t use Snowflake. You can use this service with the many engines listed above to both read and write to Iceberg tables with cross-engine security.

While other vendor-hosted catalogs deviate from the open source specification, which leads to lock-in, Snowflake’s service for Polaris Catalog is designed to be fully compatible with Polaris Catalog’s open source implementation both now and in the future. Snowflake handles the responsibilities of running the service like providing an endpoint, deploying bug fixes, and users get a completely portable catalog for their data, which can be used with Iceberg REST catalog-compatible tools.

Get started today

The Essential Guide to Modernizing Data Lakes for AI with Snowflake

Polaris Catalog: An Open Source Catalog for Apache Iceberg

Introducing Polaris Catalog: An open source solution for Apache Iceberg, enabling cross-engine interoperability.

Snowflake Expands Partnership with Microsoft to Improve Interoperability Through Apache Iceberg

Discover how Snowflake & Microsoft's expanded partnership improves data interoperability through Apache Iceberg, offering cost-effective solutions.

Open Storage with Iceberg Tables Now Generally Available

Open, interoperable storage with Iceberg Tables is now available, offering our customers enhanced flexibility, performance, and governance on open data.

Govern an Open Lakehouse with Snowflake Open Catalog

Snowflake Open Catalog, a managed service for Apache Polaris (incubating), enables secure interoperability for data lakes and lakehouses, simplifying data governance.

Unlock Open Lakehouse with Microsoft OneLake and Snowflake

Address data fragmentation with an open lakehouse, using Apache Iceberg to unify data silos and enable governed interoperability with Microsoft OneLake.

Expanding the Data Cloud with Apache Iceberg

Snowflake announces external table support for Apache Iceberg, which provides additional flexibility & interoperability, in private preview.

Iceberg Tables: Catalog Support Now Available

The latest Iceberg Tables release with SDK includes catalog support to ensure that engines outside of Snowflake can easily interoperate with Iceberg Tables.

TPC-DS at 100TB & 10TB Scale Now Available in Snowflake Samples

100 TB and 10TB versions of TPC-DS data, along with samples of the benchmark's 99 queries, are available now as sample data sets in Snowflake.

How FoundationDB Powers Snowflake Metadata Forward

In the future, you can expect even more cool features enabled by Snowflake’s metadata and powered by FoundationDB. Snowflake’s engineering team looks forward to contributing to FoundationdDB open source and collaborating with its community.

Subscribe to our blog newsletter

Get the best, coolest and latest delivered to your inbox each week

Where Data Does More

  • 30-day free trial
  • No credit card required
  • Cancel anytime