AWS::Glue::Crawler S3Target - AWS CloudFormation

AWS::Glue::Crawler S3Target

Specifies a data store in Amazon Simple Storage Service (Amazon S3).

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON

{ "ConnectionName" : String, "DlqEventQueueArn" : String, "EventQueueArn" : String, "Exclusions" : [ String, ... ], "Path" : String, "SampleSize" : Integer }

YAML

ConnectionName: String DlqEventQueueArn: String EventQueueArn: String Exclusions: - String Path: String SampleSize: Integer

Properties

ConnectionName

The name of a connection which allows a job or crawler to access data in Amazon S3 within an Amazon Virtual Private Cloud environment (Amazon VPC).

Required: No

Type: String

Update requires: No interruption

DlqEventQueueArn

A valid Amazon dead-letter SQS ARN. For example, arn:aws:sqs:region:account:deadLetterQueue.

Required: No

Type: String

Update requires: No interruption

EventQueueArn

A valid Amazon SQS ARN. For example, arn:aws:sqs:region:account:sqs.

Required: No

Type: String

Update requires: No interruption

Exclusions

A list of glob patterns used to exclude from the crawl. For more information, see Catalog Tables with a Crawler.

Required: No

Type: Array of String

Update requires: No interruption

Path

The path to the Amazon S3 target.

Required: No

Type: String

Update requires: No interruption

SampleSize

Sets the number of files in each leaf folder to be crawled when crawling sample files in a dataset. If not set, all the files are crawled. A valid value is an integer between 1 and 249.

Required: No

Type: Integer

Update requires: No interruption