Table Of Contents

Feedback

User Guide

First time using the AWS CLI? See the User Guide for help getting started.

[ aws . kinesisanalyticsv2 ]

discover-input-schema

Description

Infers a schema for an SQL-based Amazon Kinesis Data Analytics application by evaluating sample records on the specified streaming source (Kinesis data stream or Kinesis Data Firehose delivery stream) or Amazon S3 object. In the response, the operation returns the inferred schema and also the sample records that the operation used to infer the schema.

You can use the inferred schema when configuring a streaming source for your application. When you create an application using the Kinesis Data Analytics console, the console uses this operation to infer a schema and show it in the console user interface.

See also: AWS API Documentation

See 'aws help' for descriptions of global parameters.

Synopsis

  discover-input-schema
[--resource-arn <value>]
--service-execution-role <value>
[--input-starting-position-configuration <value>]
[--s3-configuration <value>]
[--input-processing-configuration <value>]
[--cli-input-json <value>]
[--generate-cli-skeleton <value>]

Options

--resource-arn (string)

The Amazon Resource Name (ARN) of the streaming source.

--service-execution-role (string)

The ARN of the role that is used to access the streaming source.

--input-starting-position-configuration (structure)

The point at which you want Kinesis Data Analytics to start reading records from the specified streaming source discovery purposes.

Shorthand Syntax:

InputStartingPosition=string

JSON Syntax:

{
  "InputStartingPosition": "NOW"|"TRIM_HORIZON"|"LAST_STOPPED_POINT"
}

--s3-configuration (structure)

Specify this parameter to discover a schema from data in an Amazon S3 object.

Shorthand Syntax:

BucketARN=string,FileKey=string

JSON Syntax:

{
  "BucketARN": "string",
  "FileKey": "string"
}

--input-processing-configuration (structure)

The InputProcessingConfiguration to use to preprocess the records before discovering the schema of the records.

Shorthand Syntax:

InputLambdaProcessor={ResourceARN=string}

JSON Syntax:

{
  "InputLambdaProcessor": {
    "ResourceARN": "string"
  }
}

--cli-input-json (string) Performs service operation based on the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, the CLI values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally.

--generate-cli-skeleton (string) Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input, prints a sample input JSON that can be used as an argument for --cli-input-json. If provided with the value output, it validates the command inputs and returns a sample output JSON for that command.

See 'aws help' for descriptions of global parameters.

Output

InputSchema -> (structure)

The schema inferred from the streaming source. It identifies the format of the data in the streaming source and how each data element maps to corresponding columns in the in-application stream that you can create.

RecordFormat -> (structure)

Specifies the format of the records on the streaming source.

RecordFormatType -> (string)

The type of record format.

MappingParameters -> (structure)

When you configure application input at the time of creating or updating an application, provides additional mapping information specific to the record format (such as JSON, CSV, or record fields delimited by some delimiter) on the streaming source.

JSONMappingParameters -> (structure)

Provides additional mapping information when JSON is the record format on the streaming source.

RecordRowPath -> (string)

The path to the top-level parent that contains the records.

CSVMappingParameters -> (structure)

Provides additional mapping information when the record format uses delimiters (for example, CSV).

RecordRowDelimiter -> (string)

The row delimiter. For example, in a CSV format, 'n' is the typical row delimiter.

RecordColumnDelimiter -> (string)

The column delimiter. For example, in a CSV format, a comma (",") is the typical column delimiter.

RecordEncoding -> (string)

Specifies the encoding of the records in the streaming source. For example, UTF-8.

RecordColumns -> (list)

A list of RecordColumn objects.

(structure)

For an SQL-based Amazon Kinesis Data Analytics application, describes the mapping of each data element in the streaming source to the corresponding column in the in-application stream.

Also used to describe the format of the reference data source.

Name -> (string)

The name of the column that is created in the in-application input stream or reference table.

Mapping -> (string)

A reference to the data element in the streaming input or the reference data source.

SqlType -> (string)

The type of column created in the in-application input stream or reference table.

ParsedInputRecords -> (list)

An array of elements, where each element corresponds to a row in a stream record (a stream record can have more than one row).

(list)

(string)

ProcessedInputRecords -> (list)

The stream data that was modified by the processor specified in the InputProcessingConfiguration parameter.

(string)

RawInputRecords -> (list)

The raw stream data that was sampled to infer the schema.

(string)