Handling errors for an SQS event source in Lambda
To handle errors related to an SQS event source, Lambda automatically uses a retry strategy with a
backoff strategy. You can also customize error handling behavior by configuring your SQS event
source mapping to return partial batch responses.
Backoff strategy for failed invocations
When an invocation fails, Lambda attempts to retry the invocation while implementing a backoff strategy.
The backoff strategy differs slightly depending on whether Lambda encountered the failure due to an error in
your function code, or due to throttling.
-
If your function code caused the error, Lambda will stop processing and retrying the invocation.
In the meantime, Lambda gradually backs off, reducing the amount of concurrency allocated to your Amazon SQS event source mapping.
After your queue's visibility timeout runs out, the message will again reappear in the queue.
-
If the invocation fails due to throttling, Lambda gradually backs off
retries by reducing the amount of concurrency allocated to your Amazon SQS event source mapping. Lambda continues
to retry the message until the message's timestamp exceeds your queue's visibility timeout, at which point
Lambda drops the message.
Implementing partial batch responses
When your Lambda function encounters an error while processing a batch, all messages in that batch become
visible in the queue again by default, including messages that Lambda processed successfully. As a result, your
function can end up processing the same message several times.
To avoid reprocessing successfully processed messages in a failed batch, you can configure your event
source mapping to make only the failed messages visible again. This is called a partial batch response.
To turn on partial batch responses, specify ReportBatchItemFailures
for the
FunctionResponseTypes
action when configuring your event source mapping. This lets your function
return a partial success, which can help reduce the number of unnecessary retries on records.
When ReportBatchItemFailures
is activated, Lambda doesn't scale down message polling when function invocations fail. If you expect some messages to fail—and you don't want those failures to impact the message processing rate—use ReportBatchItemFailures
.
Keep the following in mind when using partial batch responses:
-
If your function throws an exception, the entire batch is considered a complete failure.
-
If you're using this feature with a FIFO queue, your function should stop processing messages after the
first failure and return all failed and unprocessed messages in batchItemFailures
. This helps
preserve the ordering of messages in your queue.
To activate partial batch reporting
-
Review the Best practices for implementing partial batch responses.
-
Run the following command to activate ReportBatchItemFailures
for your function. To retrieve your event source mapping's UUID, run the list-event-source-mappings AWS CLI command.
aws lambda update-event-source-mapping \
--uuid "a1b2c3d4-5678-90ab-cdef-11111EXAMPLE"
\
--function-response-types "ReportBatchItemFailures"
-
Update your function code to catch all exceptions and return failed messages in a batchItemFailures
JSON response. The batchItemFailures
response must include a list of message IDs, as itemIdentifier
JSON values.
For example, suppose you have a batch of five messages, with message IDs id1
, id2
, id3
, id4
, and id5
. Your function successfully processes id1
, id3
, and id5
. To make messages id2
and id4
visible again in your queue, your function should return the following response:
{
"batchItemFailures": [
{
"itemIdentifier": "id2"
},
{
"itemIdentifier": "id4"
}
]
}
Here are some examples of function code that return the list of failed message IDs in the batch:
- .NET
-
- AWS SDK for .NET
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using .NET.
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: Apache-2.0
using Amazon.Lambda.Core;
using Amazon.Lambda.SQSEvents;
// Assembly attribute to enable the Lambda function's JSON input to be converted into a .NET class.
[assembly: LambdaSerializer(typeof(Amazon.Lambda.Serialization.SystemTextJson.DefaultLambdaJsonSerializer))]
namespace sqsSample;
public class Function
{
public async Task<SQSBatchResponse> FunctionHandler(SQSEvent evnt, ILambdaContext context)
{
List<SQSBatchResponse.BatchItemFailure> batchItemFailures = new List<SQSBatchResponse.BatchItemFailure>();
foreach(var message in evnt.Records)
{
try
{
//process your message
await ProcessMessageAsync(message, context);
}
catch (System.Exception)
{
//Add failed message identifier to the batchItemFailures list
batchItemFailures.Add(new SQSBatchResponse.BatchItemFailure{ItemIdentifier=message.MessageId});
}
}
return new SQSBatchResponse(batchItemFailures);
}
private async Task ProcessMessageAsync(SQSEvent.SQSMessage message, ILambdaContext context)
{
if (String.IsNullOrEmpty(message.Body))
{
throw new Exception("No Body in SQS Message.");
}
context.Logger.LogInformation($"Processed message {message.Body}");
// TODO: Do interesting work based on the new message
await Task.CompletedTask;
}
}
- Go
-
- SDK for Go V2
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using Go.
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: Apache-2.0
package main
import (
"context"
"encoding/json"
"fmt"
"github.com/aws/aws-lambda-go/events"
"github.com/aws/aws-lambda-go/lambda"
)
func handler(ctx context.Context, sqsEvent events.SQSEvent) (map[string]interface{}, error) {
batchItemFailures := []map[string]interface{}{}
for _, message := range sqsEvent.Records {
if /* Your message processing condition here */ {
batchItemFailures = append(batchItemFailures, map[string]interface{}{"itemIdentifier": message.MessageId})
}
}
sqsBatchResponse := map[string]interface{}{
"batchItemFailures": batchItemFailures,
}
return sqsBatchResponse, nil
}
func main() {
lambda.Start(handler)
}
- Java
-
- SDK for Java 2.x
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using Java.
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: Apache-2.0
import com.amazonaws.services.lambda.runtime.Context;
import com.amazonaws.services.lambda.runtime.RequestHandler;
import com.amazonaws.services.lambda.runtime.events.SQSEvent;
import com.amazonaws.services.lambda.runtime.events.SQSBatchResponse;
import java.util.ArrayList;
import java.util.List;
public class ProcessSQSMessageBatch implements RequestHandler<SQSEvent, SQSBatchResponse> {
@Override
public SQSBatchResponse handleRequest(SQSEvent sqsEvent, Context context) {
List<SQSBatchResponse.BatchItemFailure> batchItemFailures = new ArrayList<SQSBatchResponse.BatchItemFailure>();
String messageId = "";
for (SQSEvent.SQSMessage message : sqsEvent.getRecords()) {
try {
//process your message
messageId = message.getMessageId();
} catch (Exception e) {
//Add failed message identifier to the batchItemFailures list
batchItemFailures.add(new SQSBatchResponse.BatchItemFailure(messageId));
}
}
return new SQSBatchResponse(batchItemFailures);
}
}
- JavaScript
-
- SDK for JavaScript (v3)
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using JavaScript.
// Node.js 20.x Lambda runtime, AWS SDK for Javascript V3
export const handler = async (event, context) => {
const batchItemFailures = [];
for (const record of event.Records) {
try {
await processMessageAsync(record, context);
} catch (error) {
batchItemFailures.push({ itemIdentifier: record.messageId });
}
}
return { batchItemFailures };
};
async function processMessageAsync(record, context) {
if (record.body && record.body.includes("error")) {
throw new Error("There is an error in the SQS Message.");
}
console.log(`Processed message: ${record.body}`);
}
Reporting SQS batch item failures with Lambda using TypeScript.
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: Apache-2.0
import { SQSEvent, SQSBatchResponse, Context, SQSBatchItemFailure, SQSRecord } from 'aws-lambda';
export const handler = async (event: SQSEvent, context: Context): Promise<SQSBatchResponse> => {
const batchItemFailures: SQSBatchItemFailure[] = [];
for (const record of event.Records) {
try {
await processMessageAsync(record);
} catch (error) {
batchItemFailures.push({ itemIdentifier: record.messageId });
}
}
return {batchItemFailures: batchItemFailures};
};
async function processMessageAsync(record: SQSRecord): Promise<void> {
if (record.body && record.body.includes("error")) {
throw new Error('There is an error in the SQS Message.');
}
console.log(`Processed message ${record.body}`);
}
- PHP
-
- SDK for PHP
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using PHP.
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: Apache-2.0
<?php
use Bref\Context\Context;
use Bref\Event\Sqs\SqsEvent;
use Bref\Event\Sqs\SqsHandler;
use Bref\Logger\StderrLogger;
require __DIR__ . '/vendor/autoload.php';
class Handler extends SqsHandler
{
private StderrLogger $logger;
public function __construct(StderrLogger $logger)
{
$this->logger = $logger;
}
/**
* @throws JsonException
* @throws \Bref\Event\InvalidLambdaEvent
*/
public function handleSqs(SqsEvent $event, Context $context): void
{
$this->logger->info("Processing SQS records");
$records = $event->getRecords();
foreach ($records as $record) {
try {
// Assuming the SQS message is in JSON format
$message = json_decode($record->getBody(), true);
$this->logger->info(json_encode($message));
// TODO: Implement your custom processing logic here
} catch (Exception $e) {
$this->logger->error($e->getMessage());
// failed processing the record
$this->markAsFailed($record);
}
}
$totalRecords = count($records);
$this->logger->info("Successfully processed $totalRecords SQS records");
}
}
$logger = new StderrLogger();
return new Handler($logger);
- Python
-
- SDK for Python (Boto3)
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using Python.
# Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
# SPDX-License-Identifier: Apache-2.0
def lambda_handler(event, context):
if event:
batch_item_failures = []
sqs_batch_response = {}
for record in event["Records"]:
try:
# process message
except Exception as e:
batch_item_failures.append({"itemIdentifier": record['messageId']})
sqs_batch_response["batchItemFailures"] = batch_item_failures
return sqs_batch_response
- Ruby
-
- SDK for Ruby
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using Ruby.
# Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
# SPDX-License-Identifier: Apache-2.0
require 'json'
def lambda_handler(event:, context:)
if event
batch_item_failures = []
sqs_batch_response = {}
event["Records"].each do |record|
begin
# process message
rescue StandardError => e
batch_item_failures << {"itemIdentifier" => record['messageId']}
end
end
sqs_batch_response["batchItemFailures"] = batch_item_failures
return sqs_batch_response
end
end
- Rust
-
- SDK for Rust
-
There's more on GitHub. Find the complete example and learn how to set up and run in the
Serverless examples
repository.
Reporting SQS batch item failures with Lambda using Rust.
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: Apache-2.0
use aws_lambda_events::{
event::sqs::{SqsBatchResponse, SqsEvent},
sqs::{BatchItemFailure, SqsMessage},
};
use lambda_runtime::{run, service_fn, Error, LambdaEvent};
async fn process_record(_: &SqsMessage) -> Result<(), Error> {
Err(Error::from("Error processing message"))
}
async fn function_handler(event: LambdaEvent<SqsEvent>) -> Result<SqsBatchResponse, Error> {
let mut batch_item_failures = Vec::new();
for record in event.payload.records {
match process_record(&record).await {
Ok(_) => (),
Err(_) => batch_item_failures.push(BatchItemFailure {
item_identifier: record.message_id.unwrap(),
}),
}
}
Ok(SqsBatchResponse {
batch_item_failures,
})
}
#[tokio::main]
async fn main() -> Result<(), Error> {
run(service_fn(function_handler)).await
}
If the failed events do not return to the queue, see How do I troubleshoot Lambda function SQS ReportBatchItemFailures? in the AWS Knowledge Center.
Success and failure conditions
Lambda treats a batch as a complete success if your function returns any of the following:
Lambda treats a batch as a complete failure if your function returns any of the following:
-
An invalid JSON response
-
An empty string itemIdentifier
-
A null itemIdentifier
-
An itemIdentifier
with a bad key name
-
An itemIdentifier
value with a message ID that doesn't exist
CloudWatch metrics
To determine whether your function is correctly reporting batch item failures, you can monitor the
NumberOfMessagesDeleted
and ApproximateAgeOfOldestMessage
Amazon SQS metrics in
Amazon CloudWatch.
-
NumberOfMessagesDeleted
tracks the number of messages removed from your queue. If this
drops to 0, this is a sign that your function response is not correctly returning failed messages.
-
ApproximateAgeOfOldestMessage
tracks how long the oldest message has stayed in your queue.
A sharp increase in this metric can indicate that your function is not correctly returning failed
messages.