AWS DataSync
User Guide

What Is AWS DataSync?

AWS DataSync is a data transfer service that simplifies, automates, and accelerates moving and replicating data between on-premises storage systems and AWS storage services over the internet or AWS Direct Connect. As a fully managed service, DataSync removes the need to modify applications, develop scripts, or manage infrastructure.

DataSync currently supports data transfer between Network File System (NFS) and Amazon Elastic File System (Amazon EFS), or Amazon Simple Storage Service (Amazon S3).

In this guide, you can find a description of the components of DataSync, detailed instructions on how to get started, and the API reference.

Use Cases

These are some of the main use cases for AWS DataSync:

  • Data migration – move active data sets rapidly over the network into Amazon S3 or Amazon EFS. DataSync includes automatic encryption and data integrity validation to make sure your data arrives securely, intact, and ready to use.

  • Data movement for timely in-cloud processing – move data into or out of AWS for processing when working with systems that generate data on-premises. This approach speeds up critical hybrid cloud workflows across many industries. These include video production in media and entertainment, seismic research in oil and gas, machine learning in life science, and big data analytics in finance.

  • Data protection – replicate and backup data to Amazon S3 for online copies that can be archived to Amazon S3 Glacier or sent to Amazon EFS for a standby file system. DataSync transfer tasks are always incremental and only transfer data that has changed.

Benefits

Using AWS DataSync provides the following benefits:

  • Simplify and automate data movement. Using DataSync, you can easily transfer data between on-premises sources and AWS storage over the network. AWS DataSync automates management of the infrastructure and the transfer processes for you. DataSync also includes encryption and data validation. This minimizes the time for in-house development and management that is otherwise needed for fast, reliable, and secure transfers.

  • Transfer data fast rapidly over the network into AWS, at a rate up to 10 Gbps. This approach speeds up migrations, hybrid workflows for analytics and machine learning, and data protection processes.

  • Reduce data transfer costs and move data cost-effectively with the flat, per-gigabyte pricing in DataSync. You also save on script development and management costs, and avoid the need for costly commercial transfer tools.

Additional AWS DataSync Resources

We recommend that you read the following sections.

AWS DataSync supports Terraform— To learn more about DataSync deployment automation with Terraform, see aws_datasync_location_efs in the Terraform documentation.