DataFormat

class aws_cdk.aws_glue.DataFormat(*, input_format, output_format, serialization_library, classification_string=None)

Bases: object

(experimental) Defines the input/output formats and ser/de for a single DataFormat.

Stability:

experimental

ExampleMetadata:

infused

Example:

# my_database: glue.Database

glue.Table(self, "MyTable",
    database=my_database,
    table_name="my_table",
    columns=[glue.Column(
        name="col1",
        type=glue.Schema.STRING
    )],
    partition_keys=[glue.Column(
        name="year",
        type=glue.Schema.SMALL_INT
    ), glue.Column(
        name="month",
        type=glue.Schema.SMALL_INT
    )],
    data_format=glue.DataFormat.JSON
)
Parameters:
  • input_format (InputFormat) – (experimental) InputFormat for this data format.

  • output_format (OutputFormat) – (experimental) OutputFormat for this data format.

  • serialization_library (SerializationLibrary) – (experimental) Serialization library for this data format.

  • classification_string (Optional[ClassificationString]) – (experimental) Classification string given to tables with this data format. Default: - No classification is specified.

Stability:

experimental

Attributes

APACHE_LOGS = <aws_cdk.aws_glue.DataFormat object>
AVRO = <aws_cdk.aws_glue.DataFormat object>
CLOUDTRAIL_LOGS = <aws_cdk.aws_glue.DataFormat object>
CSV = <aws_cdk.aws_glue.DataFormat object>
JSON = <aws_cdk.aws_glue.DataFormat object>
LOGSTASH = <aws_cdk.aws_glue.DataFormat object>
ORC = <aws_cdk.aws_glue.DataFormat object>
PARQUET = <aws_cdk.aws_glue.DataFormat object>
TSV = <aws_cdk.aws_glue.DataFormat object>
classification_string

(experimental) Classification string given to tables with this data format.

Stability:

experimental

input_format

(experimental) InputFormat for this data format.

Stability:

experimental

output_format

(experimental) OutputFormat for this data format.

Stability:

experimental

serialization_library

(experimental) Serialization library for this data format.

Stability:

experimental