Performing automated sensitive data discovery with Amazon Macie
For broad visibility into where sensitive data might reside in your Amazon Simple Storage Service (Amazon S3) data estate, configure Amazon Macie to perform automated sensitive data discovery for your account or organization. With automated sensitive data discovery, Macie continually evaluates your S3 bucket inventory and uses sampling techniques to identify and select representative S3 objects in your buckets. Macie then retrieves and analyzes the selected objects, inspecting them for sensitive data.
By default, Macie analyzes objects by using the set of managed data identifiers that we recommend for automated sensitive data discovery. You can tailor the analyses by configuring Macie to use specific managed data identifiers, custom data identifiers, and allow lists when it performs automated sensitive data discovery for your account or organization. In addition, Macie automatically selects and analyzes objects from all of your S3 buckets. If you're the Macie administrator for an organization, this includes objects in S3 buckets that your member accounts own. You can adjust the scope of the analyses by excluding specific buckets—for example, S3 buckets that typically store AWS logging data.
As the analysis progresses each day, Macie produces records of the sensitive data that it finds and the analysis that it performs: sensitive data findings, which report sensitive data that Macie finds in individual S3 objects, and sensitive data discovery results, which log details about the analysis of individual S3 objects. Macie also updates statistics, inventory data, and other information that it provides about your Amazon S3 data.
For example, an interactive heat map on the console provides a visual representation of data sensitivity across your data estate:

These features are designed to help you evaluate data sensitivity across your data estate, and drill down to investigate and assess individual accounts, buckets, and objects. They can also help you determine where to perform deeper, more immediate analysis by running sensitive data discovery jobs. Combined with information that Macie provides about the security and privacy of your Amazon S3 data, you can also use these features to identify cases where immediate remediation might be necessary—for example, a publicly accessible bucket that Macie found sensitive data in.
To configure and use automated sensitive data discovery, your account must be a standalone Macie account or the Macie administrator account for an organization.
Topics
- How automated sensitive data discovery works
- Configuring automated sensitive data discovery for your account
- Managing automated sensitive data discovery for individual S3 buckets
- Assessing automated sensitive data discovery coverage
- Reviewing automated sensitive data discovery statistics and results
- Sensitivity scoring for S3 buckets
- Default settings for automated sensitive data discovery