Accessing data feeds - AWS Marketplace

Accessing data feeds

With AWS Marketplace, you can use data feeds as a mechanism to send structured, up-to-date product and customer information from AWS Marketplace systems an Amazon S3 buckets for ETL (extract, transform, and load) between seller-owned business intelligence tools. You need to configure your environment to receive data feeds to an encrypted Amazon S3 bucket. This topic shows you how to access and unsubscribe from data feeds.

Access a data feed

  1. Allocate a business intelligence or data engineer with SQL and ETL (extract, transform, load) experience. This person also needs experience setting up APIs.

  2. Set up an Amazon Simple Storage Service bucket and a subscription to the data feeds. Use the AWS seller account ID associated with your Marketplace product listings. To do so, you can watch this YouTube video or follow the steps below.

    The video and the steps explain how to use a AWS CloudFormation template that helps simplify configuration.

    1. Open a web browser and sign into the AWS Marketplace Management Portal, then go to Set up customer data storage.

    2. Choose Create resources with AWS CloudFormation template to open the template in the AWS CloudFormation console in another window.

    3. In the template, specify the following and then choose Next:

      • Stack name – The collection of resources you're creating to enable access to data feeds.

      • Amazon S3 bucket name – The bucket for storing data feeds.

      • (Optional) Amazon SNS topic name – The topic for receiving notifications when AWS delivers new data to the Amazon S3 bucket.

    4. On the Review page, confirm your entries and choose Create stack. This will open a new page with the CloudFormation status and details.

    5. From the Resources tab, copy Amazon Resource Names (ARNs) for the following resources from the CloudFormation page into the fields on the AWS Marketplace Set up customer data storage page:

      • Amazon S3 bucket for storing data feeds

      • AWS KMS key for encrypting the Amazon S3 bucket

      • (Optional) Amazon SNS topic for receiving notifications when AWS delivers new data to the Amazon S3 bucket

    6. On the Set up customer data storage page, choose Submit.

    7. (Optional) Edit the policies created by the CloudFormation template. See Data feed policies for more details.

      You are now subscribed to data feeds. The next time data feeds are generated, you can access the data.

  3. Use an ETL (extract, transform, load) operation to connect the data feeds to your data warehouse or relational database.

    Note

    Data tools have different capabilities. You must involve a business intelligence engineer or data engineer to set up the integration to match your tool’s capabilities.

  4. To run or create SQL queries, configure the data feeds to enforce primary and foreign keys in your data tool. Each data feed represents a unique table, and you must set up all data feeds in data schema with the entity relationships. For more information about the tables and entity relationships, see Data feed tables overview in this guide.

  5. Setup Amazon Simple Notification Service to automatically refresh your data warehouse or relational database. You can configure Amazon SNS notifications to send alerts when data from each unique feed is delivered to an S3 bucket. These notifications can be leveraged to automatically refresh seller data warehouse when new data is received via data feeds, if seller data tool supports this capability. See Getting started with Amazon SNS in the Amazon Simple Notification Service Developer Guide.

    Example notification:

    { "mainExecutionId": "1bc08b11-ab4b-47e1-866a-9c8f38423a98", "executionId": "52e862a9-42d2-41e0-8010-810af84d39b1", "subscriptionId": "27ae3961-b13a-44bc-a1a7-365b2dc181fd", "processedFiles": [], "executionStatus": "SKIPPED", "errors": [], "feedType": "[data feed name]" }

    Notifications can have the following executionStatus states:

    • SKIPPED – The seller has no new data for the day.

    • COMPLETED – We delivered the feed with new data.

    • FAILED – The feed delivery has an issue.

  6. Validate the setup by running SQL queries. You can use the sample queries in this guide, or the queries on GitHub, at https://github.com/aws-samples/aws-marketplace-api-samples/tree/main/seller-data-feeds/queries.

    Note

    The sample queries in this guide were written for AWS Athena. You may need to modify the queries for use with your tools.

  7. Determine where business users want to consume data. For example, you can:

    • Export .csv data from your data warehouse or SQL database.

    • Connect your data to a visualization tool such as PowerBI or Tableau.

    • Map data to your CRM, ERP, or financial tools, such as Salesforce, Infor, or Netsuite.

For more information about AWS CloudFormation templates, see Working with AWS CloudFormation templates in the AWS CloudFormation User Guide.

Data feed policies

When your Amazon S3 bucket is created by the CloudFormation template, it will create policies for access attached to that bucket, the AWS KMS key, and the Amazon SNS topic. The policies allow the AWS Marketplace reports service to write to your bucket and SNS topic with the data feed information. Each policy will have a section like the following (this example is from the Amazon S3 bucket).

{ "Sid": "AwsMarketplaceDataFeedsAccess", "Effect": "Allow", "Principal": { "Service": "reports.marketplace.amazonaws.com" }, "Action": [ "s3:ListBucket", "s3:GetObject", "s3:PutObject", "s3:GetEncryptionConfiguration", "s3:GetBucketAcl", "s3:PutObjectAcl" ], "Resource": [ "arn:aws:s3:::datafeed-bucket", "arn:aws:s3:::datafeed-bucket/*" ] },

In this policy, reports.marketplace.amazonaws.com is the service principal that AWS Marketplace uses to push data to the Amazon S3 bucket. The datafeed-bucket is the bucket that you specified in the CloudFormation template.

When the AWS Marketplace reports service calls Amazon S3, AWS KMS, or Amazon SNS, it will provide the ARN of the data it is intending to write to the bucket when it does. To ensure that the only data written to your bucket is data written on your behalf, you can specify the aws:SourceArn in the condition of the policy. In the following example, you must replace the account-id with the ID for your AWS account.

{ "Sid": "AwsMarketplaceDataFeedsAccess", "Effect": "Allow", "Principal": { "Service": "reports.marketplace.amazonaws.com" }, "Action": [ "s3:ListBucket", "s3:GetObject", "s3:PutObject", "s3:GetEncryptionConfiguration", "s3:GetBucketAcl", "s3:PutObjectAcl" ], "Resource": [ "arn:aws:s3:::datafeed-amzn-s3-demo-bucket", "arn:aws:s3:::datafeed-amzn-s3-demo-bucket/*" , "Condition": { "StringEquals": { "aws:SourceAccount": "account-id", "aws:SourceArn": ["arn:aws:marketplace::account-id:AWSMarketplace/SellerDataSubscription/DataFeeds_V1", "arn:aws:marketplace::account-id:AWSMarketplace/SellerDataSubscription/Example-Report"] } } },

Unsubscribe from data feeds

Open a web browser and sign in to the AWS Marketplace Management Portal. Then, go to the Contact us page to submit an unsubscribe request to the AWS Marketplace Seller Operations team. The unsubscribe request can take up to 10 business days to process.