Batch enrollment in Amazon Connect Voice ID using audio data from prior calls
You can get a jump start on using biometrics by batch enrolling customers who have already consented for biometrics. Using stored audio recordings in your S3 bucket, and a JSON input file that provides the speaker identifier and a link to the audio recordings, you can invoke the Voice ID batch APIs.
To enroll customers programmatically, pass the following data to the API:
-
The domain ID to specify the domain to associate recordings to.
-
The location for the output file.
-
An input file containing a list of speakers. See Input and output file schema for the Speaker Enrollment Job in Amazon Connect Voice ID.
For each speaker the file must include:
-
A link to a call audio recording in a .wav file with 8KHz sample rate and PCM-16 encoding.
-
The corresponding
CustomerSpeakerId
for the customer. -
A channel for the caller in the audio recording. If the audio has multiple channels, you can select only one.
-
-
A KMS key to use when writing the output.
-
A role that Voice ID can assume. It must have access to the S3 bucket where the audio files are stored. This role must have access to any KMS key used to encrypt the files. It must also be able to write to the specified output location and use the KMS key requested for writing the output. Specifically, it must have the following permissions:
-
s3:GetObject
on the input bucket. -
s3:PutObject
on the output bucket. -
kms:Decrypt
on the KMS key used for input bucket’s default encryption. -
kms:Decrypt
andkms:GenerateDataKey
on the KMS key provided in the input which will be used for writing output file to the output bucket.
You must have
iam:PassRole
permissions when making the call and providing thedataAccessRole
. To enable confused deputy protection for thedataAccessRole
, see Amazon Connect Voice ID cross-service confused deputy prevention. -
-
Optionally, a fraud check skip flag in case you want to skip checks for fraud and voice spoofing on the enrollment audio.
-
Optionally, specify a fraudster watchlist that you want to perform fraud checks against. If no watchlist is specified, Voice ID uses the default fraudster watchlist for the domain.
-
Optionally, the fraud threshold in case you want to raise or lower the risk.
-
Optionally, a flag to re-enroll enrolled customers. This is useful if you want to refresh the audio recording, since the default is to ignore previously enrolled customers.
The batch enrollment returns the CustomerSpeakerId
,
GeneratedSpeakerId
, and associated status for each entry. It stores
this data in a JSON file at the output path you specify in the API.
Note
You are charged for enrolling speakers. For more information, see Amazon Connect Voice ID
Pricing