Step 2: Load Sample Data - Amazon DynamoDB

Step 2: Load Sample Data

In this step, you populate the Movies table with sample data.

This scenario uses a sample data file that contains information about a few thousand movies from the Internet Movie Database (IMDb). The movie data is in JSON format, as shown in the following example. For each movie, there is a year, a title, and a JSON map named info.

[ { "year" : ... , "title" : ... , "info" : { ... } }, { "year" : ..., "title" : ..., "info" : { ... } }, ... ]

In the JSON data, note the following:

  • The year and title are used as the primary key attribute values for the Movies table.

  • The rest of the info values are stored in a single attribute called info. This program illustrates how you can store JSON in an Amazon DynamoDB attribute.

The following is an example of movie data.

{ "year" : 2013, "title" : "Turn It Down, Or Else!", "info" : { "directors" : [ "Alice Smith", "Bob Jones" ], "release_date" : "2013-01-18T00:00:00Z", "rating" : 6.2, "genres" : [ "Comedy", "Drama" ], "image_url" : "", "plot" : "A rock band plays their music at high volumes, annoying the neighbors.", "rank" : 11, "running_time_secs" : 5215, "actors" : [ "David Matthewman", "Ann Thomas", "Jonathan G. Neff" ] } }

Step 2.1: Download the Sample Data File

  1. Download the sample data archive:

  2. Extract the data file (moviedata.json) from the archive.

  3. Copy the moviedata.json file into your current directory.

Step 2.2: Load the Sample Data into the Movies Table

After you download the sample data, you can run the following program to populate the Movies table.

  1. Copy the following program and paste it into a file named

    from decimal import Decimal import json import boto3 def load_movies(movies, dynamodb=None): if not dynamodb: dynamodb = boto3.resource('dynamodb', endpoint_url="http://localhost:8000") table = dynamodb.Table('Movies') for movie in movies: year = int(movie['year']) title = movie['title'] print("Adding movie:", year, title) table.put_item(Item=movie) if __name__ == '__main__': with open("moviedata.json") as json_file: movie_list = json.load(json_file, parse_float=Decimal) load_movies(movie_list)
  2. To run the program, enter the following command.