HudiTarget - AWS Glue

HudiTarget

Specifies an Apache Hudi data source.

Contents

ConnectionName

The name of the connection to use to connect to the Hudi target. If your Hudi files are stored in buckets that require VPC authorization, you can set their connection properties here.

Type: String

Required: No

Exclusions

A list of glob patterns used to exclude from the crawl. For more information, see Catalog Tables with a Crawler.

Type: Array of strings

Required: No

MaximumTraversalDepth

The maximum depth of Amazon S3 paths that the crawler can traverse to discover the Hudi metadata folder in your Amazon S3 path. Used to limit the crawler run time.

Type: Integer

Required: No

Paths

An array of Amazon S3 location strings for Hudi, each indicating the root folder with which the metadata files for a Hudi table resides. The Hudi folder may be located in a child folder of the root folder.

The crawler will scan all folders underneath a path for a Hudi folder.

Type: Array of strings

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: