AWS::Glue::Classifier CsvClassifier - AWS CloudFormation

AWS::Glue::Classifier CsvClassifier

A classifier for custom CSV content.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON

{ "AllowSingleColumn" : Boolean, "ContainsCustomDatatype" : [ String, ... ], "ContainsHeader" : String, "CustomDatatypeConfigured" : Boolean, "Delimiter" : String, "DisableValueTrimming" : Boolean, "Header" : [ String, ... ], "Name" : String, "QuoteSymbol" : String }

YAML

AllowSingleColumn: Boolean ContainsCustomDatatype: - String ContainsHeader: String CustomDatatypeConfigured: Boolean Delimiter: String DisableValueTrimming: Boolean Header: - String Name: String QuoteSymbol: String

Properties

AllowSingleColumn

Enables the processing of files that contain only one column.

Required: No

Type: Boolean

Update requires: No interruption

ContainsCustomDatatype

Indicates whether the CSV file contains custom data types.

Required: No

Type: Array of String

Update requires: No interruption

ContainsHeader

Indicates whether the CSV file contains a header.

A value of UNKNOWN specifies that the classifier will detect whether the CSV file contains headings.

A value of PRESENT specifies that the CSV file contains headings.

A value of ABSENT specifies that the CSV file does not contain headings.

Required: No

Type: String

Allowed values: UNKNOWN | PRESENT | ABSENT

Update requires: No interruption

CustomDatatypeConfigured

Enables the configuration of custom data types.

Required: No

Type: Boolean

Update requires: No interruption

Delimiter

A custom symbol to denote what separates each column entry in the row.

Required: No

Type: String

Pattern: [^\r\n]

Minimum: 1

Maximum: 1

Update requires: No interruption

DisableValueTrimming

Specifies not to trim values before identifying the type of column values. The default value is true.

Required: No

Type: Boolean

Update requires: No interruption

Header

A list of strings representing column names.

Required: No

Type: Array of String

Update requires: No interruption

Name

The name of the classifier.

Required: No

Type: String

Pattern: [\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*

Minimum: 1

Maximum: 255

Update requires: No interruption

QuoteSymbol

A custom symbol to denote what combines content into a single column value. It must be different from the column delimiter.

Required: No

Type: String

Pattern: [^\r\n]

Minimum: 1

Maximum: 1

Update requires: No interruption

Examples

Create a custom classifier test

With AWS::Glue::Classifier you can create a custom classifier test.

JSON

{ "Description": "AWS Glue custom classifier test", "Resources": { "MyCSVclassifier": { "Type": "AWS::Glue::Classifier", "Properties": { "CsvClassifier": { "AllowSingleColumn": true, "ContainsHeader": "PRESENT", "Delimiter": ",", "Header": [ "id", "name" ], "Name": "csvclassify", "QuoteSymbol": "\"" } } } } }

YAML

Description: AWS Glue custom classifier test Resources: MyCSVclassifier: Type: 'AWS::Glue::Classifier' Properties: CsvClassifier: AllowSingleColumn: true ContainsHeader: PRESENT Delimiter: ',' Header: - id - name Name: csvclassify QuoteSymbol: '"'