Spigot - AWS Glue

Spigot

Specifies a transform that writes samples of the data to an Amazon S3 bucket.

Contents

Inputs

The data inputs identified by their node names.

Type: Array of strings

Array Members: Fixed number of 1 item.

Pattern: [A-Za-z0-9_-]*

Required: Yes

Name

The name of the transform node.

Type: String

Pattern: ([\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF]|[^\r\n])*

Required: Yes

Path

A path in Amazon S3 where the transform will write a subset of records from the dataset to a JSON file in an Amazon S3 bucket.

Type: String

Pattern: ([\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF]|[^\S\r\n"'])*

Required: Yes

Prob

The probability (a decimal value with a maximum value of 1) of picking any given record. A value of 1 indicates that each row read from the dataset should be included in the sample output.

Type: Double

Valid Range: Minimum value of 0. Maximum value of 1.

Required: No

Topk

Specifies a number of records to write starting from the beginning of the dataset.

Type: Integer

Valid Range: Minimum value of 0. Maximum value of 100.

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: