S3 Gateway #

Use the embedded S3 Gateway to send or receive data through the S3 protocol using object storage tooling such as Minio, boto3, or AWS s3 CLI. Operations available are similar to those officially documented for S3.

S3 Gateway Syntax #

The S3 gateway presents each branch from every HPE Machine Learning Data Management repository as an S3 bucket. Buckets are represented via [<commit>.]<branch>.<repo>.<project>, with the commit being optional.

The master.foo.bar bucket corresponds to the master branch of the repo foo within the bar project.
The be97b64f110643389f171eb64697d4e1.master.foo.bar bucket corresponds to the commit be97b64f110643389f171eb64697d4e1 on the master branch of the foo repo within the bar project.

If auth is enabled, credentials must be passed with each S3 gateway endpoint as mentioned in S3 Client configuration.

Command Examples #

The following command examples assume that you have upgraded to use the embedded proxy, which will become mandatory in future releases.

Put Data Into HPE Machine Learning Data Management Repo #

Retrieve Data From HPE Machine Learning Data Management Repo #

Port Forwarding #

You can pachctl port-forward to access the s3 gateway through the localhost:30600 endpoint, however, the Kubernetes port forwarder incurs substantial overhead and does not recover well from broken connections.

AWS CLI

Learn how to configure AWS CLI for the S3 Gateway

Boto3

Learn how to configure Boto for the S3 Gateway

Credentials

Learn how to configure an S3 client.

MinIO

Learn how to configure MinIO for the S3 Gateway

Unsupported Operations

Learn which S3 operations are not supported.