Prerequisite: Prepare project inputs in S3 Sign-in and create a job Tracking transformation progress Set up a connector Assess and reimagine job plan Reimagine job plan Custom job plan

Transformation of mainframe applications

AWS Transform accelerates the transformation of your mainframe modernization applications. This topic describes the available capabilities.

Topics

Prerequisite: Prepare project inputs in S3
Mainframe source artifacts collection
Sign-in and create a job
Tracking transformation progress
Set up a connector
Assess and reimagine job plan
Reimagine job plan
Custom job plan

Prerequisite: Prepare project inputs in S3

AWS Transform is capable of handling complex mainframe codebases. To use codebase, make sure you have all the assets in your S3 location.

Key project inputs:

Source code: You must upload your mainframe source code files to S3. This includes COBOL programs, JCL scripts, copybooks, and any other relevant source files.
Data files: If you have any VSAM files or other data files that your mainframe applications use, these need to be uploaded to S3.
Configuration files: Include configuration files specific to your mainframe environment.

Other project inputs

System Management Facility (SMF) records: If applicable, upload SMF records to a new folder in the S3 bucket where the source code is stored, these records should be in a .zip file format.
Formatting: For technical documentation generation, you can leverage an optional configuration file to generate PDF documents which aligns with your required formats and standards, including headers, footers, logos, and customized information.
Glossary: AWS Transform AWS Transform leverages automation with Generative AI for documentation generation and business rule extraction. Including a glossary CSV file with information about important abbreviations and terminologies in the root directory of your zip file will help improve the generated documentation quality.
Test data: If available, upload test data sets that can be used to validate the modernized application. This data should be stored in a new folder in the S3 bucket where the source code is stored.

Details on project inputs can be found here.

To sign into the AWS Transform web experience, follow all the instructions in the Getting started with AWS Transform section of the documentation.

To create and start a job: Follow the steps in Start your project.

Create workspace: Name and describe your workspace where jobs, collaborators, and associated artifacts will be stored.

Create job: Create a job by selecting from preconfigured job plans, or customize the job plan based on your objective by selecting from the list of supported capabilities.

Important

AWS Transform will refuse operations from you if you don’t have the proper permissions. For example, a contributor cannot cancel a job transformation of mainframe applications or delete a job. Only an administrator can perform these functions.

When you create your job you can select from the capabilities below, but the Kickoff step is always required as it is where the location of the source code for the project is located. In the first job set up in a workspace you are required to set up a connector to your Amazon S3 bucket.

Tracking transformation progress

You can track the progress of the transformation throughout the process in two ways:

Worklog – This provides a detailed log of the actions AWS Transform takes, along with human input requests, and your responses to those requests.
Dashboard – This provides high-level summary of the mainframe application transformation. It shows metrics on number of jobs transformed, transformation applied, and estimated time to complete the transformation of mainframe applications. You can also see details of each step including, lines of code by file types, generated documentation by each file type, the decomposed code, and the migration plan.

Set up a connector

AWS Transform uses a connector to access resources in your account required for mainframe modernization functions. Your connector is automatically configured with the first job you run in the workspace. Depending on the job plan you choose, AWS Transform guides you to create your connector.

Mainframe reimagine connector

The mainframe reimagine connector is for jobs running the assess and reimagine workflows. It uses an S3 bucket to access and store transformation resources, as well as an Amazon Neptune cluster to store extracted artifacts. The Neptune cluster serves as the unified knowledge graph that stores all extracted artifacts about your job, and the knowledge graph answers all questions in the AWS Transform chat interface.

Deploy the Amazon Neptune cluster in a VPC where AWS Transform can create elastic network interfaces (ENIs). These are the network connections that allow AWS Transform to securely communicate with Neptune in your VPC to load data and query your knowledge graph.

AWS CloudFormation

We recommend using the following CloudFormation template to create the resources required for your mainframe reimagine connector. Use the same S3 bucket that you plan to use for the connector when running the CloudFormation template. For more information about using CloudFormation templates, see Working with templates.

To download the CloudFormation template, choose neptune-kg-setup.yaml.

The template accepts the following parameters:

BucketName – Required. S3 bucket name for Neptune bulk loader access. You must use the same S3 bucket that you plan to use in your AWS Transform connector.
KmsKeyId – Optional. KMS key for Neptune encryption. Accepts key ID, ARN, alias name, or alias ARN. Defaults to the AWS managed key if omitted.

Provide the following details when you configure your connector:

S3 bucket ARN
Neptune cluster ARN
Neptune cluster resource ID
Application subnet IDs
Application security group ID
Neptune S3 bulk loader role ARN

Important

The Outputs tab of the CloudFormation template includes the details you need to provide to AWS Transform when configuring the connector.

Custom configuration

If you prefer to use an existing VPC or other tools to create the required infrastructure, your environment must meet the following requirements:

Neptune cluster

Amazon Neptune Serverless with engine version 1.4.5.1 or later
- IAM authentication enabled
- Storage encryption enabled (AWS managed key or customer-managed KMS key)
Serverless scaling configuration (recommended: 1–128 NCUs)
(Recommended) Read replica in a second Availability Zone for high availability and improved read performance

Networking

A VPC with DNS support and DNS hostnames enabled
At least two subnets in separate Availability Zones. AWS Transform creates ENIs in these subnets to access the Neptune cluster, which can reside in the same subnets or in dedicated subnets. Size subnets at /20 to accommodate VPC endpoint and application ENIs.
Network connectivity from the subnets to the following AWS services through VPC interface endpoints (AWS PrivateLink):
- AWS Transform Agents API – Coordinates the mainframe modernization jobs
- Amazon Bedrock Runtime – Invokes foundation models for analysis and reasoning
- Amazon Relational Database Service (Amazon RDS) – Neptune cluster management
- Amazon Elastic Compute Cloud (Amazon EC2) – Management of ENIs
- Amazon CloudWatch – Metrics for AWS Transform observability
An S3 gateway endpoint attached to the route tables associated with your subnets. AWS Transform uses this endpoint to load data from the S3 bucket to the Neptune cluster.

Security groups

Security groups are attached to ENIs, not subnets. Even when AWS Transform ENIs and Neptune reside in the same subnet, separate security groups control access between them.

A security group (attached to ENIs created by AWS Transform) that allows:
- Outbound TCP 8182 to the Neptune security group
- Outbound TCP 443 to reach VPC endpoints
A security group (attached to the Neptune cluster) that allows:
- Inbound TCP 8182 from the application security group
A VPC endpoint security group (attached to the interface endpoint ENIs) that allows:
- Inbound TCP 443 from the application security group

IAM

An IAM role with a trust policy for rds.amazonaws.com that grants s3:GetObject, s3:ListBucket, and s3:GetBucketLocation on your S3 bucket
This role must be associated with the Neptune cluster for bulk loading knowledge graph data

The following diagram shows the recommended architecture.

Architecture diagram showing the customer VPC with Neptune subnet, application subnet, AWS PrivateLink connections, and AWS service integrations for the mainframe reimagine connector.

S3 connector

For jobs with a custom plan, AWS Transform can use an S3 connector. The S3 connector uses an S3 bucket to access and store transformation resources, and an S3 vector bucket for indexed output.

Important

Your data is stored and persisted in the AWS Transform artifact store in your workspace and is used only for running the job.

Optional configuration of S3 vector bucket.

In Regions where S3 vector buckets are available, AWS Transform will store searchable vector encodings of job output in this S3 vector bucket in your account to provide an AI powered search and chat experience. Data is not used outside of this job, and is not used to train models. To enable this, you must create and provide S3 vector bucket. AWS Transform automatically creates and attaches a role with required permissions to write to this bucket.

S3 bucket CORS permissions

After your connector is configured, add the following CORS policy to your S3 bucket so you can view and compare artifacts directly in the AWS Transform console. If this policy is not set up correctly, you might not be able to use the inline viewing or file comparison functionalities of AWS Transform.


[
    {
        "AllowedHeaders": [],
        "AllowedMethods": [
            "GET"
        ],
        "AllowedOrigins": [
              "https://*.transform.eu-central-1.on.aws",
              "https://*.transform.ap-south-1.on.aws",
              "https://*.transform.ap-northeast-1.on.aws",
              "https://*.transform.ap-northeast-2.on.aws",
              "https://*.transform.ap-southeast-2.on.aws",
              "https://*.transform.ca-central-1.on.aws",
              "https://*.transform.eu-west-2.on.aws",
              "https://*.transform.us-east-1.on.aws",
              "https://*.transform.sa-east-1.on.aws"
        ],
        "ExposeHeaders": [],
        "MaxAgeSeconds": 0
    }
]

Assess and reimagine job plan

The Assess and reimagine job plan is a predefined workflow for modernizing mainframe applications. It guides you through two phases: assessing your codebase to identify business functions, and reimagining the functions you select. To choose individual capabilities instead, use the Custom job plan.

Assess

Mainframe modernization journeys typically start with evaluating the codebase to identify contents and understand relationships and dependencies. After this evaluation, you can start to decompose and select portions of the source code to modernize. The boundaries of decomposition are determined based on business functions.

The business function catalog decomposes your codebase into discrete business functions, which are built on a foundation of deterministic data paths that map each business function from start to finish. A data path is the route that data takes, starting with a business trigger through to the final write within the mainframe code. Rather than limiting analysis to a single entry point, the system spans batch jobs, CICS transactions, and their shared data stores to identify coherent units of business work.

A business function is a body of work that starts from a business trigger and ends at a measurable business outcome. AWS Transform provides a catalog of business functions that a line of business can act on, enabling informed decisions about what to modernize first.

Benefits of assessing your codebase to modernize using business functions:

Full-estate visibility – Understand how your batch jobs, online transactions, and data stores connect into meaningful functions for your business.
Actionable boundaries – Each identified business function is a self-contained unit that your line-of-business stakeholders can review and prioritize for modernization effort.
Consistency from code to specification – The same code blocks that define a business function are used to generate its specifications, with no translation layer and no reconciliation required.
Reduced discovery time – Automated detection replaces months of manual tribal-knowledge interviews with systematic, repeatable analysis.

Results of assessment provided through the chat

The results of assessment provided through the chat include a list of business functions, including the number of data paths, what each function spans in terms of batch and CICS transactions, and a business description. You are provided with a link to the artifacts generated in the Amazon S3 bucket, and you can also access them through the chat. Two artifacts are provided: business function details and business function summary.

Business function summary – Provides a list of the business functions and a natural-language description of what functions the elements within the code are performing.
Business function details – Provides an interactive graph that depicts the relationships across the source code.

To interact with the graph, select an element to show additional details about it, or open an element to interrogate its details. Within each page, there is a summary of all elements at that level and a graphical interface to interact with and learn more about each component, including the following:

Business function – Depicts which business functions are connected to each other and provides an overview of each business function. When you select a business function, the following information is provided:
- Number of data paths
- Number of lines of code
- Business description: an overview of the business function that describes, in natural language, what functions the elements within the code are performing.
- Interfaces: details how data elements within the business function relate (for example, exchange or write).
Business function details – Depicts the connections across the data paths within a single business function, allowing you to dive deeper into the composition of the business function. When you select an element, you see how that element connects across the business function, along with the following details:
- Datastore: writers and readers for the datastore.
- Other elements: an overview of the related data path that describes, in natural language, the actions and outcomes handled within the data path, a list of programs, and readers and writers.
Data paths – Depicts the details of one data path and how the elements within the data path relate to each other.
- Description: an overview of the data path that describes, in natural language, the actions and outcomes handled within the data path.
- Contents of the data path:
  - Entry point
  - Reads
  - Writes
  - Programs

Through the chat, you can interrogate the outputs to understand the elements contained within each business function, and explore the boundaries through the business function graph.

After you identify the boundaries of your modernization, the chat prompts you to select the business functions that you want to reimagine. You can select a single business function or multiple business functions. AWS Transform sends the selected boundaries and the required assessment outputs to the reimagine flow to continue the modernization journey.

After you complete your first modernization, you can use the chat to select an additional set of business functions to modernize.

Reimagine

In the reimagine phase, you can select one or all of the business functions that assessment identified and prepare them for modernization. This triggers business logic extraction followed by requirement generation for the selected business functions. The generated requirements are the primary input for forward engineering, translating legacy system understanding into a precise specification of what the modernized application must deliver.

Extract business logic

After you select one or more business functions, AWS Transform begins generating business logic for the selected business functions. After extraction is complete, AWS Transform stores the results in an Amazon S3 bucket in JSON format for downstream use. You can monitor the progress of the extraction and review any issues directly within the AWS Transform console.

To review business logic extraction results

In the left navigation pane, choose the Extract business logic step to expand it.
View the following from the step details:
- S3 bucket link: The location where the extracted business logic results are stored in JSON format.
- Issues: A list of any issues encountered during the business logic extraction process.

If any files encountered issues during extraction, the following details are displayed for each affected file:

File name: The name of the file that encountered an issue.
File type: The type of the file (for example, COBOL or JCL).
File path: The location of the file within your application.
Status: The current status of the file's extraction.
Details: A description of the issue encountered.

Note

After the extraction step completes, AWS Transform automatically begins generating requirements. You are not prompted for any additional inputs before this process starts.

Generate requirements

After business logic extraction completes, the Generate Requirements agent consumes extracted artifacts (code analysis, data analysis, business rules) and produces modernization requirements. These requirements are formal functional specifications that are technology-agnostic and include testable acceptance criteria. AWS Transform produces one requirements.md file for each business function you select and stores the output in an Amazon S3 bucket. To view the Amazon S3 location of the output, choose the Generate requirements step.

Each requirements.md file is organized into the following parts:

Title – Names the business function and the workflow it represents.
Global preconditions – Conditions that must hold true across the entire workflow before and during processing. Configurable values appear as parameter placeholders.
Numbered workflow sections – Each section represents a discrete stage of the business function. Every section contains the following:
- A user story that states the goal as a role, the action that role wants to take, and the resulting outcome.
- A list of Requirements in EARS (Easy Approach to Requirements Syntax) format, each with a unique identifier.

Note

Selecting a second set of business functions restarts the reimagine step for the newly selected functions. The console then shows results only for the new selection, so any issues with the business rules from your previous business functions, and the Amazon S3 links that store the outputs for the business logic and requirements, are no longer displayed. Your earlier outputs are not lost. You can still access them from the Artifacts tab or in your Amazon S3 bucket.

Traceability between transformation artifacts

AWS Transform maintains traceability between the artifacts it generates throughout the assess and reimagine workflow. Traceability lets you trace each requirement to your extracted business rules, or directly to the original source code that produced the requirement.

You can view all the traceability details for each business function using the traceability.yaml file. It describes how each business rule maps to a requirement.

For interactive traceability, you can use the IDE plugin in VS Code and Open VSX compatible editors. For more information, see Developer tools.

Reimagine job plan

For applications that you have scoped for modernization and are ready to reimagine, you can run a standalone reimagine job. The job analyzes your code and data, extracts business logic, identifies business domains, and generates modernization requirements for each business domain.

Custom job plan

A custom job plan lets you build your own modernization workflow by selecting the capabilities you want to run. It gives you full control over which capabilities are included. Choose from the following capabilities based on your modernization objective. Some capabilities depend on others being run first. For example, code analysis is required for most other capabilities.

Analyze code

After you share the Amazon S3 bucket path with AWS Transform, it will analyze the code for each file with details such as file name, file type, lines of code, and their paths.

Note

You can download the Analyze code results through the artifacts tab at the job or workspace level. At the job level go to the ‘Artifacts’ option in the left navigation menu, and open the ‘results’ folder, or at the workspace level find the name of the job, and open the ‘results’ folder. This will download a zip file that contains the classification file for manual classification workflow, assets list, dependencies JSON file, and list of missing files.

In the job plan select Analyze code in the left navigation pane to view your results. You can view your code analysis results in several ways:

List view – All files in the Amazon S3 bucket you want to transform for mainframe
File type view – All files in the Amazon S3 bucket displayed per file type. For a list of supported file types, see Supported files.
Folder view – All files in the Amazon S3 bucket displayed in folder structure.

Within the file results, AWS Transform provides the following information depending on what file view you choose:

Name
File type
Total lines of code
File path
Comment lines
Empty lines
Effective lines of code
Number of files
Cyclomatic Complexity - Cyclomatic complexity represents the number of linearly independent paths through a program’s source code. AWS Transform will show a cyclomatic complexity for each of the files.

Missing files– Missing files from the mainframe modernization code analysis. These files ideally, should be added as a part of the source input in Amazon S3 bucket, and the analysis step should be re-run for better and cohesive results.

Identically named – AWS Transform gives you a list of files that share the same name, and possibly the same characteristics (e.g., number of lines of code). It will not have the ability to compare the difference between the contents of any two files at one time.

Duplicated IDs – With Cobol program, the Program ID field serves as the unique identifier of the file. This ID must be unique because it’s used to call the program throughout your project. However, some projects might have COBOL files with different names but the same Program ID. Getting the list of those files during the assessment can help understand the dependencies among all programs.

Note

This is specific to COBOL code and files.

When you have programs with duplicated IDs, it’s suggested to change the Program IDs of these files to have a unique identifier for each of these in the COBOL code. You can then re-run your job to get more accurate and comprehensive code analysis results.

By resolving duplicate Program IDs, you can:

Improve code clarity and maintainability
Reduce potential conflicts in program calls
Enhance the accuracy of dependency mapping
Simplify future modernization efforts

Codebase issues – Potential issues detected within the codebase that you should resolved before continuing with the modernization project. These issues could include missing references with associated statements or unsupported links in the code.

Update classification – With manual reclassification, you can reclassify files using the bulk update feature by uploading the JSON file with the new classification.

Important

This is only available for UNKNOWN and TXT files.

After reclassification, AWS Transform will:

Updates the classification results
Re-runs dependency analysis with the new file types
Refreshes all affected analysis results

Note

You can reclassify files only after the initial analysis loop completes.

Inline viewer and file comparison

The Inline viewer is a feature in the AWS Transform for mainframe capabilities that provides two key visualization capabilities:

File view: View content of selected legacy files from jobs
File comparison: Compare content of two legacy files side-by-side

Input file viewing

To view your files in the Analyze code step

Under View code analysis results, select a file using the check box in the list.

Choose the View action button (enabled when 1 item is selected).

File content will be rendered on screen in the File View component.

File comparison

To compare files in the Analyze code step

Under View code analysis results, select two files using the check boxes in the list.
Choose the Compare action button (enabled only when 2 items are selected).
Files will be displayed side-by-side in the File comparison component.

Note

You can't select more than two files to compare files.

Important

If you're having issues with inline viewer or file comparison make sure that the S3 bucket is set up correctly. For more information on S3 bucket's CORS policy, see S3 bucket CORS permissions.

Data analysis

AWS Transform provides data analysis to help understand the impact of data relationships and elements to the mainframe modernization project. The two outputs provided are:

Data lineage: Traces the lifecycle of data by mapping relationships between data sources, jobs, and programs
Data dictionary: Serves as a repository documenting the structural metadata of legacy data elements

On completion of the analysis, there will be two tabs present for each output, and then multiple views available described below for both data lineage and dictionary.

Note

When you request data anlysis. AWS Transform performs code analysis, which is required in order to perform data analysis.

Data lineage

AWS Transform provides multiple views based on the data relationships that need to be understood across the code base being modernized. The four table views available in data lineage include:

Data sets: This view provides a comprehensive impact analysis, including operation tracking to help distinguish between read, write, update, and delete
Db2 tables: Provide the impact analysis related to Db2 tables and the operations
Program-to-data: Identify which COBOL programs reference each dataset
JCL-to-data relationships: Identify which JCL scripts reference each dataset

Summary provides an overview of data sources, and their relationships to programs and JCLs, as well as summary information about how the data sources are leveraged and total operations found in the codebase.

Data dictionary

Understanding the data elements within the data sources present in the codebase is the next step to realize how the various data sources are dependent. Data dictionary is a data catalog providing field level metadata with business language descriptions for accurate transformation mapping.

COBOL data structure: Provides field information across COBOL copybooks and INLINE references present in the codebase, including field properties and business definition
Db2 tables: Provides column and table properties across the Db2 tables present in the codebase, including primary and foreign keys, schema and index information, and data types

Relationship between data lineage and dictionary is available by selecting the data source, and then utilizing the data lineage or data dictionary button in order to dive deeper on the relationship between the data source and the data elements. The navigation between data lineage and dictionary, provides integrated data visibility with data lineage providing the "what" - structure and meaning, alongside the "where" - usage and relationship.

Activity metrics analysis

Activity metrics analysis lets you analyze System Management Facility (SMF) records of type 14, 15, 30, 64, 102, and 110. The analysis provides insights into how elements are used within your mainframe application. It can help with retiring unused code or with decisions about the target architecture for the modernized application. If you include activity metrics analysis in your job, completing the code analysis step first provides richer SMF outputs, although it is not required.

Note

When you negotiate the job plan for mainframe modernization, completing the code analysis and decomposition steps first provides richer outputs.

Onboard SMF records for analysis

You must provide the SMF record location within your Amazon S3 bucket as the initial step for SMF analysis. Provide a minimum of 13 months of records to capture annual occurrences that shorter time frames might miss. Although 13 months is recommended, any time frame with detectable SMF records produces analysis results.

Your SMF extract must meet these format requirements:

Include types 14, 15, 30 (sub-type 5), 64, 102, and 110.
Use a raw binary file in EBCDIC format.
Include RDW bytes.

Provide a link to the SMF records within your Amazon S3 bucket, ensuring the records are in a folder separate from your source code. Format options include .zip (up to 600 MB) or .tar.gz (up to 5 GB) compressed. If you provide a link to a folder where SMF records are not detected, you receive an error message and no analysis is completed.

Analysis output

The analysis output contains two components, depending on the type of record: a tabular view and a .csv output (available in your Amazon S3 bucket). Within AWS Transform, outputs are displayed for type 30 and 110. Through the artifacts in your Amazon S3 bucket, you can explore the analysis on other types of records. In the user interface, the header displays the time range of your provided records so that you can identify any missing key dates. For example, if your records span May 1 to October 31 but exclude the busy day after Thanksgiving, you can see that key records are missing. Timestamps in the web application reflect your system's time zone and the SMF records.

Artifacts available in Amazon S3 provide analysis for any record type found in the records provided.

Tabular view

The tabular view, available only for SMF 30 and 110, contains up to three main components for batch jobs and CICS transactions:

Summary – Provides key jobs and transactions.
Batch job/CICS transactions analysis – Provides aggregated analysis across jobs and transactions.
Code analysis comparison (batch only) – Provides comparison data when the code analysis step runs prior to SMF analysis.

The discovery summary provides three groups of jobs and transactions that help you quickly identify items for deeper analysis.

Batch job and CICS transaction analysis provides aggregated analysis across jobs and transactions. Some columns in the analysis results are hidden by default. Choose the gear icon to display additional fields.

The transaction key generates unique data aggregation for CICS transactions, combining four fields into a single key value:

Transaction ID
Program name
SysPlex ID
SysID

You can search by the complete key or by any component of the key in the analysis output.

Code analysis comparison highlights jobs found in either the SMF records or the analyze code step, but not present in both. This shows jobs that have not run during your SMF record time frame or were not present in the analyze code step. This output is available only when you run the code analysis step before SMF analysis in your job.

Best practices

To get comprehensive outputs for batch jobs (type 30), ensure that your JCL file name matches the job name to get meaningful outputs in the code analysis comparison.

Generate technical documentation

You can generate technical documentation for your mainframe applications undergoing modernization. By analyzing your code, AWS Transform can automatically create detailed documentation of your application programs, including descriptions of the program logic, flows, integrations, and dependencies present in your legacy systems. This documentation capability helps bridge the knowledge gap, enabling you to make informed decisions as you transition your applications to modern cloud architectures.

Note

When you request generation of technical documentation, AWS Transform performs code analysis, including code dependency analysis, which are required in order to generate the documentation.

To generate technical documentation

In the left navigation pane, under Generate technical documentation, choose Select files and configure settings.
Select the files in the Amazon S3 bucket that you want to generate documentation for, and configure the settings in the Collaboration tab.

Note
Selected files should have the same encoding type (that is, all in the same CCSID - UTF8 or ASCII). Otherwise, generated technical documentation might have empty fields or sections.
Choose the documentation detail level:
- Summary – Provides a high-level overview of each file in the scope. Also, gives a one-line summary of each file.
- Detailed functional specification – Provides comprehensive details for each file in the mainframe application transformation scope. Some details include logic and flow, dependencies, input and output processing, and various transaction details.
Note
Documentation can be generated only for COBOL and JCL files.
Choose Continue.
Once AWS Transform generates documentation, review the documentation results by following the Amazon S3 bucket path in the console, where the results are generated and stored.
Once the documentation is generated, you can also use AWS Transform chat to ask questions about the generated documentation and decide the next steps.

Add user information into the documentation


ARTIFACT_ID.zip
└── app/
    ├── File1.CBL
    ├── File2.JCL
    ├── subFolder/
    │   └ File3.CBL
└── glossary.csv
└── pdf_config.json     
├── header-logo.png
├── footer-logo.png    
└ ...

Optional files can be added in the zip file to help improve the generated documentation quality and provide customized PDF cover page. Some of these can be:

glossary.csv file: You can choose to provide and upload an optional glossary in the zip file in the S3 bucket. The glossary is in CSV format. This glossary helps create documentation with relevant descriptions in line with the customer vocabulary. A sample glossary.csv file looks like:
```
LOL,Laugh out loud
ASAP,As soon as possible
WIP,Work in progress
SWOT,"Strengths, Weaknesses, Opportunities and Threats"
```
pdf_config.json: You can leverage this optional configuration file to generate PDF documents which align with their company’s formats and standards, including headers, footers, logos, and customized information. A sample pdf_config.json looks like:
```
{
    "header": {
    "text": "Acme Corporation Documentation",
    "logo": "header-logo.png"
  },
  "customSection": {
    "variables": [
      {
        "key": "business Unit",
        "value": "XYZ"
      },
      {
        "key": "application Name",
        "value": "ABC"
      },
      {
        "key": "xxxxxxxxxx",
        "value": "yyyyyyyyyyyy"
      },
      {
        "key": "urls",
        "value": [
          {
            "text": "Product Intranet Site",
            "url": "https://example.com/intranet"
          },
          {
            "text": "Compliance Policies",
            "url": "https://example.com/policies"
          }
        ]
      }
    ]
  },
  "footer": {
    "text": "This document is intended for internal use only. Do not distribute without permission.",
    "logo": "footer-logo.png",
    "pageNumber": true
  }
}
```
- Header:
  - For the cover page PDF file, the default text will be the project name.
  - For each program PDF file, the default text will be the program name.
  - There is no default logo. If a header logo is not configured, no logo will be displayed.
  - The font size and logo size shall be dynamically changed based on the number of words or logo file size.
- Custom section:
  - If the custom section is not configured, it will be omitted from the PDF.
  - The link has to be click able.
- Footer:
  - There is no default text or logo for the footer.
  - The page number will be displayed in the footer by default, unless explicitly configured otherwise.
  - The font size and logo size shall be dynamically changed based on the number of words or logo file size.

Generate documentation inline viewer

You can view the PDF files in the generate technical documentation step.

To view the PDF files

Navigate to the Review documentation results tab.
Locate the PDF in the table listing generated PDFs.
Either select the rile and then select View or select the link element overlaid on the file name.

The PDF opens in AWS Transform, with the option to expand the screen in the top right.

Note

AWS Transform also gives you the ability to download either an XML of PDF version of the generated technical documentation.

Important

If you're having issues with documentation inline viewer, make sure that the S3 bucket is set up correctly. For more information on S3 bucket's CORS policy, see S3 bucket CORS permissions.

Extract business logic

You can extract essential business logic from your mainframe applications that are undergoing modernization. AWS Transform automatically analyzes your code to identify and document critical business elements, including detailed process flows, and business logic that is embedded in your applications. This capability serves multiple stakeholders in your modernization journey. Business analysts can leverage extracted logic to create precise business requirements and identify gaps or inconsistencies in current implementations. Developers gain the ability to quickly comprehend complex legacy system functionality without extensive mainframe expertise.

Note

When you request business logic extraction. AWS Transform performs code analysis, including code dependency and entry point analysis, which are required in order to perform business logic extraction.

To extract business logic

In the left navigation pane, under Extract business logic, choose Configure settings.
In the Collaboration tab select how you want to extract business logic:
- Application level: Generates business documents for all business functions, transactions, batch jobs, and files. This selects all of the files in the application.
- File level: Generates business documents only for files you select from the file table.
Note
- For either option you can select Include detailed functional specification so that AWS Transform includes control flow and comprehensive business rules for the selected files.
- Selected files should have the same encoding type (that is, all in the same CCSID - UTF8 or ASCII). Otherwise, generated documentation might have empty fields or sections.
- Documentation can be generated only for COBOL and JCL files.
- For application level, programs used by CICS transactions and batch jobs are grouped together, while all other programs are categorized as Unassigned.
Choose Continue.
Once AWS Transform extracts business logic it stores the results in an Amazon S3 bucket in JSON format so that you can view them online.

Note

The number of generated business rule files might be larger than your initial selection. Some selected files may trigger business rule extraction to include additional dependent files, which will also appear in the results table.

View the extracted business documentation inline

You can view the business logic in the Extract business rule step. To do this,

Navigate to Review business logic extraction results.
Select the document you want to view from the table, and then click the View result button.

The business documentation page opens in a new browser tab.

Decomposition

You can decompose your code into domains that account for dependencies between programs and components. This helps the related files and programs to be grouped appropriately within the same domain. It also helps maintain the integrity of the application logic during the decomposition process.

Note

When you request decomposition, AWS Transform performs code analysis, including code dependency analysis, which are required in order to perform decomposition. We also recommend that you perform business logic extraction before decomposition for better results.

To get started with decomposing your application:

Select Decompose code from the left navigation pane.

Note
One domain, Unassigned, is automatically created for all files not associated with a domain. On initial navigation, all files should be associated to Unassigned, unless domains were proposed from application level business logic extraction.
Create a new domain through the Actions menu, and then choose Create domain.
In Create domain, provide domain name, optional description, and mark some files as seeds.
- CICS configured files (CSD) and scheduler configured files (SCL) can be used for automatic seed detection.
- You can also set one domain only as a common component. The files in this domain are common to multiple domains.
Choose Create.

Note
You can create multiple domains with different files as seeds.
After confirming all domains and seeds, choose Decompose.
AWS Transform will check the source code files and then decompose into domains with programs and data sets with similar use cases and high programming dependencies.

AWS Transform gives you a tabular and graph view of decomposed domains as dependencies. Graph view has three options:
- Domain view – Can view how different domains are related to each other in visual format.
- Dependency view – Can view all files in each domain as a complex dependency graph. If a node that was added to a domain didn't receive information from a seed in the same domain, then this node will either be predicted into unassigned (node didn't receive any information), disconnected (in a sub graph that didn't receive seed information) or into another domain (node received information from at least that domain).
- Subgraph view - User can create subgraphs to visualize a subset of nodes to more clearly understand the relational impact and boundaries of that collection of nodes.
  - To create a subgraph select a group of nodes and select the Extract subgraph option from the tool bar
  - Merging subgraphs is available, when merging a new third subgraph is created in addition to the two subgraphs you are merging.
Note
Repeat these steps to add more domains or to reconfigure your already created domains with a different set of seeds if you don't like current domain structure.
When completed, choose Continue.

Seeds

Seeds are the foundational inputs for the decompose code phase. Each component or file (e.g., JCL, COBOL, Db2 tables, CSD, and scheduler files) can be assigned as a seed to only one domain, ensuring clear boundaries and alignment during the decomposition process.

The identification of the seeds depends on the structure of the application or portfolio. In the case of a typical mainframe legacy application, seeds can often be determined by adhering to established naming conventions, batch-level grouping in the scheduler, and transaction-level grouping defined in the CICS system. Additionally, database tables can also serve as seeds, providing another layer of structure for decomposition.

Import and/or update dependencies files

During decomposition, you can upload a JSON file for the dependencies that replaces the existing files generated by the dependencies analysis AWS Transform performs.

Export dependencies function allows you to download the dependencies json file generated in the decomposition step. After downloading, you can modify the file per your requirement. Then, you can import dependencies using the AWS Transform’s upload functionality which allows you to upload the new JSON file of the dependencies that replaces the file generated by the dependencies analysis. After that, the graph in the decomposition step will be updated.

To export, modify, and import dependencies

On the View decomposition results page, choose Actions.
In the dropdown list, choose Update dependencies file option under Other actions.
In the Update dependencies file modal,
1. Download the dependency file AWS Transform created from the existing analysis results.
2. In the downloaded file, modify the dependencies based on what you want to achieve.
3. After modifying, save and upload this file using the Upload dependency file button.
  
  Note
  The only accepted file format is JSON file.
Next, choose Import.

AWS Transform will import the dependency file and create a new dependencies graph based on your input.

Import and/or Update Domains

For customers that have domains, seed, and/or file relationships mapped prior to the decomposition step, you can upload this domain definition through the Import domains file function available through the Actions menu. Some examples of when this function may be utilized:

Bring forward decomposition from another job
Subject matter experts providing this mapping

Once the domains file has been imported, the user can either run decomposition against the domain definition, or if satisfied can Save and then Submit the domain definition.

Parent/child/neighbor files

In a dependencies graph, programs relate to each other through different types of connections. Understanding these relationships helps you analyze program dependencies during transformation of your mainframe applications. It also helps with understanding the boundaries of a domain. For example, if you select a domain, and then select parent one level, it will show you the connected nodes.

Parent relationships – A parent file calls or controls other programs. Parents sit above their dependent programs in the hierarchy. You can select parent at one level or at all levels.

Children relationships – A child file is called or controlled by the parent program. Children sit below their parent in the file hierarchy.

Neighbor relationships – Neighbors are files at the same hierarchical level. They share the same parent program and might interact with each other directly.

Reforge code

Reforge uses Large Language Models (LLMs) to improve the quality of refactored code. The initial COBOL-to-Java transformation preserves functional equivalence while retaining COBOL-influenced data structures and variable names from the legacy system. Reforge restructures this code to follow modern Java practices and idioms, replacing COBOL-style constructs with native Java collections and naming conventions. This makes the code more readable and maintainable for Java developers.

Note

Quotas for reforge are:

3,000,000 lines of code per job
50,000,000 lines of code per user per month

Reforge your code after refactoring by following these steps:

Choose Reforge java code in the left navigation pane and then select Configure code reforge.
Provide the S3 location to your zipped buildable source project and choose Continue. Use this zip structure:
```
input.zip
        └── PROJECT-pom
        ├── PROJECT-entities
        ├── PROJECT-service
        ├── PROJECT-tools
        ├── PROJECT-web (optional)
        └── pom.xml
```
AWS Transform analyzes your zip package to locate files within the PROJECT-service directory so that it can provide a selectable list of classes that you can reforge. These classes have the suffix ProcessImpl.java.
Complete the Select classes to reforge page and choose Continue. Track the reforge status on the Worklog tab.
View the results of your completed reforge on the View results page, which displays the reforge status per class. It also specifies where to find the Reforge result in your S3 bucket.

Once AWS Transform gets this input from you it gives you a downloadable file with the Reforge results.

This is the zip structure resulting from a successful reforge:


reforge.zip
└── maven_project
├── reforge.log
└──tokenizer_map.json

maven_project contains the reforged source code.
- Files that have been refactored but whose compilation was not successfully finalized are located at /src/main/resources/reforge/originalClassName.java.incomplete and are named originalClassName.java.incomplete. Compare these to the original versions of the files to choose reforged functions you want to save.
- Source files provided to AWS Transform that were refactored successfully are backed up to src/main/resources/reforge/originalClassName.java.original and are named originalClassName.java.original. The refactored versions of the files replace the source files provided to AWS Transform.
Note
The originalClassName.java files are replaced with the reforged files only if the reforging process is successful. Otherwise, they retain the original content.
reforge.log contains logs that you can use to diagnose job failures or provide to AWS support in case of an issue.
tokenizer_map.json contains a mapping of token IDs to your data, such as file paths and class/method names, that are tokenized in the logs for privacy protection. You can provide this file to AWS support in case of an issue.

Plan your modernized applications testing

You can create and manage test plans for your mainframe modernized applications based on extracted code attributes, job complexity, and scheduler paths. AWS Transform helps prioritize which jobs to test and identifies the specific artifacts needed for each test case. The test planning process is divided into three main phases: configuration, scoping, and review.

To create a test plan

Configure test plan settings
1. In the left navigation pane, under Plan your testing, choose Configure settings.
2. (Optional) Provide S3 paths for your Business Logic Extraction (BLE). These artifacts enhance test plan quality. Without BLE artifacts some fields in the test plan may remain incomplete.
Define test plan scope
1. Select entry points, such as batch jobs, to include in your test plan.
2. Filter and sort jobs based on multiple attributes:
  - Business functions (extracted from BRE BLE)
  - Domains (from decomposition phase)
  - File paths and locations
  - Custom search criteria
3. Select individual jobs or entire groups for testing.
4. Review job relationships and dependencies.
Review and adjust the test plan

The generated test plan provides comprehensive information including:
- Preferred execution order based on dependencies
- Job group assignments from scheduler
- Complexity scores, which are aggregate scores for test cases
- Business domain associations
- Cyclomatic complexity metrics
- Dataset and table dependencies
- Lines of code metrics
- Business function mappings

Test plan customization options

Your test plan can be customized to address specific needs. For example, you can:

Create new test cases by selecting multiple entry points
Merge existing test cases to combine related functionality
Split test cases for more granular testing
Remove unnecessary test cases
Add or remove entry points for existing test cases
Modify test case descriptions and attributes
Adjust execution order

Detailed test case information

Each test case provides details that describe its content and the related dataset or datafiles:

Comprehensive description of test scope
Complete list of entry points included
Aggregate metrics showing complexity and size
Business rules and automated test case guidance
Dataset and table dependencies with direction (input/output)
Interactive dependency graph visualization
Execution prerequisites and requirements

Data management features

These details help you understand the data related to the test case. You can:

Filter data sets by input/output direction
Identify required artifacts for test execution
Track data dependencies between test cases
Monitor data set usage across test plans

Note

AWS Transform automatically analyzes scheduler dependencies and assigns complexity scores to help prioritize testing efforts. Higher complexity scores indicate jobs that may require more thorough testing or isolation during the testing process.

Business rules and test case guidance

The AWS Transform test plan provides recommendations for your test case plan based on Business Rule Extraction:

Automatic processing of business rules using LLM
Generation of synthetic test cases
Testing guidance for specific business scenarios
Traceability between rules and test cases

The final test plan is stored in the specified S3 location and includes all of the necessary information to execute your testing strategy effectively. You can export the test plan for integration with other testing tools or documentation systems.

Note

While synthetic test case guidance is provided, the actual test artifacts must be created separately based on the guidance. The test plan serves as a comprehensive blueprint for your testing strategy but does not generate the test data or execution scripts.

Test case creation rules - Summary

General rules

A test case contains 1 to many JCLs in schedule execution order
Test cases are created from valid supported schedulers (CA7 and Control-M)
The test executes JCLs, not the scheduler itself or scheduled tasks
A scheduled task is unique and executes only one JCL
One JCL can be executed by multiple scheduled tasks
One JCL can exist without being in a schedule

Default test case creation rules

If a single JCL is in a test case:
- JCL is not involved with a schedule
- JCL is executed by a scheduled task that diverges into multiple branches
- A branch in the schedule contains only this JCL
If multiple JCL in a test case: represents a linear execution path sequence within a schedule branch
If JCL is on a divergent scheduled task: JCL becomes its own separate test case
If JCL is on a convergent scheduled task: JCL can start a new test case but won't be included in previous branches
Missing JCLs are skipped and execution continues (with planned future enhancement to cut test case at missing JCL point)

User operations rules

Create Test Case: user selects from available JCLs
- Succeeds if JCLs exist in schedule execution branch sequence
- Follows schedule execution order
Add a JCL to Test Case: user selects from available JCLs
- Succeeds if JCL can exist in the test case's schedule execution branch/path
Remove JCL from Test Case: user can remove any JCL from a test case
- Allowed even if it causes execution path gaps
Merge Test Cases: user selects two test cases to combine
- Succeeds if JCLs can exist together in same schedule execution branch
- Maintains schedule execution order
Split Test Case: user selects one JCL in a test case for split
- Creates new test case from split point forward
- Original test case modified to exclude JCLs past split point
Delete Test Case: user can delete any created test case

Note

You cannot create, add, or merge test cases from different scheduler branches/path. A future enhancement is planned to allow operations beyond divergent/convergent tasks in the scheduler.

Generate test data collection scripts

You can generate JCL scripts to collect test data from your mainframe systems based on the test plan created in the previous step. AWS Transform automatically creates data collection scripts for datasets, database tables, and sequential files needed for comprehensive testing. The data collection process is divided into four main phases: input configuration, test case selection, script configuration, and script generation.

To generate test data collection scripts

Provide test plan input
1. In the left navigation pane, under Test data collection, choose Provide test plan input.
2. Specify the S3 path to your test plan JSON file from Plan your modernized applications testing.
3. The input field is pre-populated if the test plan was generated in a previous job step.
4. You can also select test plan from other jobs by specifying the appropriate S3 location.
Select test cases for data collection
1. Review the complete list of test cases from your test plan.
2. Filter and sort test cases based on multiple attributes:
  - Business functions and domains
  - Database table dependencies
  - Data set requirements
  - Complexity metrics
  - Custom search criteria
3. Select individual test cases or use bulk selection options.
4. Review test case details including entry points, metrics, and business rules by clicking on one Test Case to see details.
Configure data collection scripts
1. Download sample templates and configuration files for reference.
  - AWS Transform provides sample templates for Db2 database unloads, VSAM file REPRO and sequential dataset processing to be used as guidance on the kinds of template expected by the process.
  - Standards may vary from site to site so the expectation is that customers will modify or replace these templates that conform to their own standards.
  - These modified templates need to be uploaded to a S3 bucket where the test data collection can process them.
2. Provide variable configuration file (JSON format) containing:
  - User prefixes and environment-specific constants
  - Database configuration parameters
  - Destination endpoint settings and data transfer parameters
  - Other required parameters defined by user and to be used in JCL templates
3. Upload JCL templates for different data collection methods:
  - Db2 template: For database table unloading (customize for BMC, IBM DSN, or other unload utilities)
  - VSAM template: For VSAM file processing (typically uses REPRO utility)
  - Sequential datasets template: For processing sequential datasets, partitioned datasets, GDGs etc.
Review and manage generated scripts

The generated scripts provide comprehensive data collection capabilities including:
- Separate scripts for "before" and "after" test execution data collection
- Organized script structure by test case and data type
- Scripts are automatically stored into S3 bucket for easy access and transfer
- Generated JCL scripts are ready for mainframe execution
- Variable substitution is based on user configuration defined in the templates

Script generation features

Generated scripts are automatically customized based on your templates and configuration:

Template-based generation: Uses your provided JCL templates with variable substitution
Environment: Incorporates your specific mainframe configuration
Data type handling: Creates appropriate scripts for sequential datasets, VSAM files, and database tables
Collection for test cases: Generates both "before" and "after" data collection scripts
Sequential dataset processing: AWS provided sample provides for a file transfer function, but this can be customized to compression utilities available at your site or utilities such as Connect Direct or managed file transfer etc.

Data collection strategy

The generated scripts support comprehensive data collection strategies:

Sequential dataset collection: REPRO and copy utilities for VSAM and flat files
Database table unloading: Customizable Db2 unload processes
Sequential dataset processing: Customizable post processing of sequential datasets such as compression, managed file transfer services etc.
Dependency management: Coordinated collection based on test case definition

Note

AWS Transform generates scripts based on your templates and configuration. Review all generated JCL before executing on your mainframe environment to ensure compatibility with your specific system configuration and security requirements.

Template customization and best practices

ATX Test Data Collection provides flexible template customization capabilities:

Multi-utility support: Adapt templates for different mainframe utilities (BMC, IBM, DSN)
Variable-driven configuration: Use constants for environment-specific parameters
Reusable templates: Create standardized templates for consistent script generation
Data handling: Incorporate organization-specific data handling requirements
Security integration: Include appropriate security and access controls
Performance optimization: Configure for efficient data collection and transfer

Generated output structure

The generated scripts are organized in your S3 bucket with the following structure:

Test case organization: Scripts grouped by associated test cases
Collection timing: Separate folders for "before" and "after" data collection
Data type classification: Scripts organized by sequential datasets, database tables, and transfers
Metadata files: Summary information and execution guidance
Ready-to-transfer format: JCL formatted for direct mainframe deployment

The final script collection is stored in the specified S3 location and includes all necessary JCL to execute your data collection strategy effectively. You can download the scripts for transfer to your mainframe environment or integrate with automated deployment processes.

Note

While comprehensive JCL scripts are generated, actual execution must be performed on your mainframe environment. The scripts serve as ready-to-use data collection tools but require appropriate mainframe access and execution permissions.

Test automation script generation

You can generate test automation scripts to execute test cases on your modernized application based on the test plan created in the previous step. AWS Transform automatically creates comprehensive test scripts that utilize the data collected from the test data collection process. The test automation script generation process consists of three main phases: input configuration, test case selection, and script generation results.

To generate test scripts

Provide test plan input
1. In the left navigation pane, under Test automation script generation, choose Provide test plan input.
2. Specify the S3 path to your test plan JSON file from Plan your modernized applications testing.
3. The input field is pre-populated if the test plan was generated in a previous job step.
4. You can also select test plan from other jobs by specifying the appropriate S3 location.
5. The system uses this test plan as the foundation for generating automation scripts.
Select test cases for script generation
1. Review the complete list of test cases from your test plan.
2. Filter and sort test cases based on multiple attributes:
  - Business functions and domains
  - Database table dependencies
  - Data set requirements
  - Complexity metrics
  - Custom search criteria
3. Select individual test cases or use bulk selection options with the Check All and Uncheck All buttons.
4. Review test case details including entry points, metrics, and business rules by clicking on individual test cases.
  
  The selected test cases will have automation scripts generated for execution on the modernized application.
Review and manage generated test automation scripts:
- The system displays a success message confirming script generation completion.
- Generated test scripts are automatically stored in your specified S3 bucket location.
- Access the complete list of generated test scripts with their respective S3 locations.
- Each test case has its corresponding automation script stored in individual S3 locations.
- Scripts are ready for deployment and execution on your modernized application environment.

Test automation script features

The generated automation scripts provide comprehensive testing capabilities:

Modernized application testing: Scripts are specifically designed to execute test cases on your transformed application
Data integration: Utilizes the test data collected from the previous test data collection step, these data need to be copied to folders for each test cases
Automated execution: Scripts can be used to set up data sinks, run test cases, and compare results, some parameters will have to be set according to your deployment environment
Organized structure: Scripts are systematically organized by test case in your S3 bucket
Ready-to-deploy format: Scripts are formatted for direct deployment to your testing environment

Generated output structure

The generated test automation scripts are organized in your S3 bucket with the following structure:

Test case organization: Each test case has its dedicated script stored in individual S3 folder
Execution-ready format: Scripts are formatted for immediate deployment and execution after setting up some variable depending from your environment
Centralized access: All scripts are accessible from a single S3 bucket location for easy management

Test execution strategy

The generated scripts support comprehensive test execution workflows:

Environment setup: Scripts include capabilities for setting up initial data to execute the test
Data preparation: Integration with collected test data from test case data collection step
Test case execution: Automated execution of individual test cases on the modernized application
Result comparison: Built-in capabilities for comparing test results and validating application behavior

Note

AWS Transform generates test automation scripts based on your test plan and selected test cases. The scripts are designed for execution on your modernized application environment and utilize the test data collected in the previous step. Review all generated scripts before deployment to ensure compatibility with your specific application configuration and testing requirements.

Best practices for test automation

Environment validation: Ensure your modernized application environment is properly configured before script execution
Data verification: Validate that the required test data from the collection phase is available and accessible
Script customization: Review and customize generated scripts as needed for your specific testing requirements
Execution monitoring: Implement appropriate monitoring and logging during test script execution
Result analysis: Establish processes for analyzing test results and identifying application issues

The final collection of test automation scripts provides a complete testing framework for validating your modernized application functionality. The scripts can be integrated into your continuous testing processes or executed as part of your application validation workflow.

Deployment capabilities in AWS Transform

AWS Transform helps you set up cloud environments for modernized mainframe applications by providing ready-to-use Infrastructure as Code (IaC) templates. Through the AWS Transform chat interface, you can access pre-built templates that create essential components like compute resources, databases, storage, and security controls. The templates are available in popular formats including CloudFormation (CFN), AWS Cloud Development Kit (AWS CDK), and Terraform, giving you flexibility to deploy your infrastructure.

These templates serve as building blocks that reduce the time and expertise needed to configure environments for your modernized mainframe applications. You can customize these templates to fit your needs, giving you a foundation to build your deployment environment.

To retrieve the IaC templates, ask in the AWS Transform chat for the Infrastructure-as-Code templates clarifying your preferred modernization pattern (such as AWS Transform for mainframe refactor), your preferred topology (standalone vs high availability), and your preferred format (CloudFormation vs Cloud Development Kit vs Terraform).

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Modernization journey

Mainframe source artifacts collection