Configuring the X-Ray SDK for Python
The X-Ray SDK for Python has a class named xray_recorder
that provides the global recorder. You can
configure the global recorder to customize the middleware that creates segments for incoming HTTP calls.
Sections
Service plugins
Use plugins
to record information about the service hosting
your application.
Plugins
Amazon EC2 –
EC2Plugin
adds the instance ID, Availability Zone, and the CloudWatch Logs Group.Elastic Beanstalk –
ElasticBeanstalkPlugin
adds the environment name, version label, and deployment ID.Amazon ECS –
ECSPlugin
adds the container ID.
To use a plugin, call configure
on the xray_recorder
.
from aws_xray_sdk.core import xray_recorder
from aws_xray_sdk.core import patch_all
xray_recorder.configure(service='My app')
plugins = ('ElasticBeanstalkPlugin', 'EC2Plugin')
xray_recorder.configure(plugins=plugins)
patch_all()
Note
Since plugins
are passed in as a tuple, be sure to include a trailing ,
when specifying a single plugin. For example, plugins = ('EC2Plugin',)
You can also use environment variables, which take precedence over values set in code, to configure the recorder.
Configure plugins before patching libraries to record downstream calls.
The SDK also uses plugin settings to set the origin
field on the segment. This indicates the type of AWS resource that runs your application.
When you use multiple plugins, the SDK uses the following resolution order to determine the origin: ElasticBeanstalk > EKS > ECS > EC2.
Sampling rules
The SDK uses the sampling rules you define in the X-Ray console to determine which requests to record. The default rule traces the first request each second, and five percent of any additional requests across all services sending traces to X-Ray. Create additional rules in the X-Ray console to customize the amount of data recorded for each of your applications.
The SDK applies custom rules in the order in which they are defined. If a request matches multiple custom rules, the SDK applies only the first rule.
Note
If the SDK can't reach X-Ray to get sampling rules, it reverts to a default local rule of the first request each second, and five percent of any additional requests per host. This can occur if the host doesn't have permission to call sampling APIs, or can't connect to the X-Ray daemon, which acts as a TCP proxy for API calls made by the SDK.
You can also configure the SDK to load sampling rules from a JSON document. The SDK can use local rules as a backup for cases where X-Ray sampling is unavailable, or use local rules exclusively.
Example sampling-rules.json
{
"version": 2,
"rules": [
{
"description": "Player moves.",
"host": "*",
"http_method": "*",
"url_path": "/api/move/*",
"fixed_target": 0,
"rate": 0.05
}
],
"default": {
"fixed_target": 1,
"rate": 0.1
}
}
This example defines one custom rule and a default rule. The custom rule applies a five-percent
sampling rate with no minimum number of requests to trace for paths under /api/move/
.
The default rule traces the first request each second and 10 percent of additional requests.
The disadvantage of defining rules locally is that the fixed target is applied by each instance of the recorder independently, instead of being managed by the X-Ray service. As you deploy more hosts, the fixed rate is multiplied, making it harder to control the amount of data recorded.
On AWS Lambda, you cannot modify the sampling rate. If your function is called by an instrumented service, calls that generated requests that were sampled by that service will be recorded by Lambda. If active tracing is enabled and no tracing header is present, Lambda makes the sampling decision.
To configure backup sampling rules, call xray_recorder.configure
, as shown in the following
example, where rules
is either a dictionary of rules or the absolute path to a JSON
file containing sampling rules.
xray_recorder.configure(sampling_rules=
rules
)
To use only local rules, configure the recorder with a LocalSampler
.
from aws_xray_sdk.core.sampling.local.sampler import LocalSampler xray_recorder.configure(sampler=LocalSampler())
You can also configure the global recorder to disable sampling and instrument all incoming requests.
Example main.py – Disable sampling
xray_recorder.configure(sampling=False)
Logging
The SDK uses Python’s built-in logging
module with a default WARNING
logging level. Get a
reference to the logger for the aws_xray_sdk
class and call setLevel
on it to configure
the different log level for the library and the rest of your application.
Example app.py – Logging
logging.basicConfig(level='WARNING')
logging.getLogger('aws_xray_sdk').setLevel(logging.ERROR)
Use debug logs to identify issues, such as unclosed subsegments, when you generate subsegments manually.
Recorder configuration in code
Additional settings are available from the configure
method on xray_recorder
.
-
context_missing
– Set toLOG_ERROR
to avoid throwing exceptions when your instrumented code attempts to record data when no segment is open. -
daemon_address
– Set the host and port of the X-Ray daemon listener. -
service
– Set a service name that the SDK uses for segments. -
plugins
– Record information about your application's AWS resources. -
sampling
– Set toFalse
to disable sampling. -
sampling_rules
– Set the path of the JSON file containing your sampling rules.
Example main.py – Disable context missing exceptions
from aws_xray_sdk.core import xray_recorder
xray_recorder.configure(context_missing='LOG_ERROR')
Recorder configuration with Django
If you use the Django framework, you can use the Django settings.py
file to configure
options on the global recorder.
-
AUTO_INSTRUMENT
(Django only) – Record subsegments for built-in database and template rendering operations. -
AWS_XRAY_CONTEXT_MISSING
– Set toLOG_ERROR
to avoid throwing exceptions when your instrumented code attempts to record data when no segment is open. -
AWS_XRAY_DAEMON_ADDRESS
– Set the host and port of the X-Ray daemon listener. -
AWS_XRAY_TRACING_NAME
– Set a service name that the SDK uses for segments. -
PLUGINS
– Record information about your application's AWS resources. -
SAMPLING
– Set toFalse
to disable sampling. -
SAMPLING_RULES
– Set the path of the JSON file containing your sampling rules.
To enable recorder configuration in settings.py
, add the Django middleware to the list of
installed apps.
Example settings.py – Installed apps
INSTALLED_APPS = [
...
'django.contrib.sessions',
'aws_xray_sdk.ext.django',
]
Configure the available settings in a dict named XRAY_RECORDER
.
Example settings.py – Installed apps
XRAY_RECORDER = {
'AUTO_INSTRUMENT': True,
'AWS_XRAY_CONTEXT_MISSING': 'LOG_ERROR',
'AWS_XRAY_DAEMON_ADDRESS': '127.0.0.1:5000',
'AWS_XRAY_TRACING_NAME': 'My application',
'PLUGINS': ('ElasticBeanstalkPlugin', 'EC2Plugin', 'ECSPlugin'),
'SAMPLING': False,
}
Environment variables
You can use environment variables to configure the X-Ray SDK for Python. The SDK supports the following variables:
-
AWS_XRAY_TRACING_NAME
– Set a service name that the SDK uses for segments. Overrides the service name that you set programmatically. -
AWS_XRAY_SDK_ENABLED
– When set tofalse
, disables the SDK. By default, the SDK is enabled unless the environment variable is set to false.-
When disabled, the global recorder automatically generates dummy segments and subsegments that are not sent to the daemon, and automatic patching is disabled. Middlewares are written as a wrapper over the global recorder. All segment and subsegment generation through the middleware also become dummy segment and dummy subsegments.
-
Set the value of
AWS_XRAY_SDK_ENABLED
through the environment variable or through direct interaction with theglobal_sdk_config
object from theaws_xray_sdk
library. Settings to the environment variable override these interactions.
-
AWS_XRAY_DAEMON_ADDRESS
– Set the host and port of the X-Ray daemon listener. By default, the SDK uses127.0.0.1:2000
for both trace data (UDP) and sampling (TCP). Use this variable if you have configured the daemon to listen on a different port or if it is running on a different host.Format
-
Same port –
address
:port
-
Different ports –
tcp:
address
:port
udp:address
:port
-
AWS_XRAY_CONTEXT_MISSING
– Set toRUNTIME_ERROR
to throw exceptions when your instrumented code attempts to record data when no segment is open.Valid Values
-
RUNTIME_ERROR
– Throw a runtime exception. -
LOG_ERROR
– Log an error and continue (default). -
IGNORE_ERROR
– Ignore error and continue.
Errors related to missing segments or subsegments can occur when you attempt to use an instrumented client in startup code that runs when no request is open, or in code that spawns a new thread.
-
Environment variables override values set in code.