Skip to content

HDDS-15642. Design document for Delta Sharing support on Ozone#10582

Draft
spacemonkd wants to merge 2 commits into
apache:masterfrom
spacemonkd:HDDS-15642
Draft

HDDS-15642. Design document for Delta Sharing support on Ozone#10582
spacemonkd wants to merge 2 commits into
apache:masterfrom
spacemonkd:HDDS-15642

Conversation

@spacemonkd

Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

HDDS-15642. Design document for Delta Sharing support on Ozone

Please describe your PR in detail:
What changes were proposed in this pull request?

This PR adds a design document for Delta Sharing Protocol support in Apache Ozone. It proposes a new standalone Delta Sharing Gateway service that enables secure, real-time sharing of Ozone-stored datasets via the open Delta Sharing protocol.

Core features:

  • Implements Delta Sharing REST API for sharing Delta Lake and Parquet tables stored in Ozone
  • URL-based access mode: generates pre-signed S3 URLs for file downloads
  • Directory-based access mode: issues temporary S3 credentials for direct table access
  • Bearer token authentication and per-recipient authorization
  • Support for table versioning and change data feed (CDF)

Future work:

  • Apache Iceberg table support
  • Additional formats (ORC, Avro, CSV, JSON) with server-side Parquet conversion option
  • Raw file sharing (ML models, logs, documents)
  • Native Ozone protocol access (ofs://) as alternative to S3
  • Multi-cluster federation and cross-cluster sharing

Key design decisions:

  • Standalone service (not embedded in S3 Gateway) for independent lifecycle and scaling
  • File-based YAML config in Phase 1, migrates to OM metadata in Phase 4
  • Delta Kernel library for Delta log parsing
  • Service identity + gateway-level audit log for authentication

Why is this needed?

Organizations store large datasets in Ozone but lack a standardized way to share data with external teams (Pandas, Spark, Databricks, Tableau) without data duplication or granting direct storage credentials. Delta Sharing solves this by providing a simple REST API and pre-signed URL/credential-based access.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15642

How was this patch tested?

N/A

@spacemonkd spacemonkd self-assigned this Jun 22, 2026
@errose28

Copy link
Copy Markdown
Contributor

Thanks for adding this. Looks like the project is called OpenSharing now.

Building on the original Delta Sharing protocol, OpenSharing defines a cross-platform, vendor-neutral protocol for secure zero-copy sharing of Data and AI assets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants