Amazon Personalize
Developer Guide

SIMS Recipe

Item-to-item similarities (SIMS) is based on the concept of collaborative filtering. A SIMS model leverages user-item interaction data to recommend items similar to a given item. In the absence of sufficient user behavior data for an item, this recipe recommends popular items.

This predefined recipe has the following properties:

  • Nameaws-sims

  • Recipe ARNarn:aws:personalize:::recipe/aws-sims

  • Algorithm ARNarn:aws:personalize:::algorithm/aws-sims

  • Feature Transformation ARNarn:aws:personalize:::feature-transformation/sims

  • Recipe typeRELATED_ITEMS

The following table lists the hyperparameters used in the recipe. For each hyperparameter the name, default value, and description are given, as well as the following properties:

  • Range: [lower bound, upper bound]

  • Value type: Integer, Continuous (float), Categorical (boolean, list, string)

  • HPO tunable: Can the parameter participate in hyperparameter optimization (HPO)?

Name Default value Range Value type HPO tunable Description
Algorithm
popularity_discount_factor 0.5 [0.0, 1.0] float Yes
min_cointeraction_count 3 [0, 10] integer Yes
Featurization
min_user_history_length_percentile 0.005 [0.0, 1.0] float No The minimum percentile of user history lengths to include in model training. The history length is the amount of available data for a user.
max_user_history_length_percentile 0.995 [0.0, 1.0] float No

The maximum percentile of user history lengths to include in model training.

For example, min_hist_length_percentile = 0.05 and max_hist_length_percentile = 0.95 includes all users except the bottom and top 5% with respect to their history lengths.

min_item_interaction_count_percentile 0.01 [0.0, 1.0] float No The minimum percentile of item interaction counts to include in model training.
max_item_interaction_count_percentile 0.9 [0.0, 1.0] float No

The maximum percentile of item interaction counts to include in model training.

For example, min_item_interaction_count_percentile = 0.05 and max_item_interaction_count_percentile = 0.95 includes all items except the bottom and top 5% with respect to the number of times an item appears in the interactions dataset.