Amazon DynamoDB
Developer Guide (API Version 2012-08-10)

Step 2: Load Sample Data

In this step, you populate the Movies table with sample data.

This scenario uses a sample data file that contains information about a few thousand movies from the Internet Movie Database (IMDb). The movie data is in JSON format, as shown in the following example. For each movie, there is a year, a title, and a JSON map named info.

[ { "year" : ... , "title" : ... , "info" : { ... } }, { "year" : ..., "title" : ..., "info" : { ... } }, ... ]

In the JSON data, note the following:

  • The year and title are used as the primary key attribute values for the Movies table.

  • The rest of the info values are stored in a single attribute called info. This program illustrates how you can store JSON in a DynamoDB attribute.

The following is an example of movie data:

{ "year" : 2013, "title" : "Turn It Down, Or Else!", "info" : { "directors" : [ "Alice Smith", "Bob Jones" ], "release_date" : "2013-01-18T00:00:00Z", "rating" : 6.2, "genres" : [ "Comedy", "Drama" ], "image_url" : "", "plot" : "A rock band plays their music at high volumes, annoying the neighbors.", "rank" : 11, "running_time_secs" : 5215, "actors" : [ "David Matthewman", "Ann Thomas", "Jonathan G. Neff" ] } }

Step 2.1: Download the Sample Data File

  1. Download the sample data archive:

  2. Extract the data file (moviedata.json) from the archive.

  3. Copy and paste the moviedata.json file into your current directory.

Step 2.2: Load the Sample Data into the Movies Table

After you download the sample data, you can run the following program to populate the Movies table.

  1. Copy and paste the following program into a file named

    from __future__ import print_function # Python 2/3 compatibility import boto3 import json import decimal dynamodb = boto3.resource('dynamodb', region_name='us-west-2', endpoint_url="http://localhost:8000") table = dynamodb.Table('Movies') with open("moviedata.json") as json_file: movies = json.load(json_file, parse_float = decimal.Decimal) for movie in movies: year = int(movie['year']) title = movie['title'] info = movie['info'] print("Adding movie:", year, title) table.put_item( Item={ 'year': year, 'title': title, 'info': info, } )
  2. To run the program, type the following command: