CloudtoRepo

Reverse-engineer your AWS estate into Terraform — using scripts, not click-ops.

cloudtorepo scans your AWS account(s) and generates ready-to-use Terraform import {} blocks (Terraform >= 1.5) together with resource skeletons and S3 remote-state backends. After running the script you can execute terraform plan -generate-config-out=generated.tf in any service directory to capture the full live configuration automatically.

Explained in detail on the blog: https://andrewbaker.ninja/2026/03/21/reverse-engineering-your-aws-estate-into-terraform-using-cloudtorepo-org/

Requirements

Tool	Minimum version
AWS CLI	2.x
Terraform	1.5
jq	1.6
Bash	4.x

aws sts get-caller-identity   # verify credentials
terraform version             # must be >= 1.5
jq --version

Quick start

git clone https://github.com/andrewbakercloudscale/cloudtorepo.git
cd cloudtorepo
chmod +x cloudtorepo.sh reconcile.sh drift.sh examples/*.sh

1. Dry-run — preview resource counts without writing files

./cloudtorepo.sh \
  --regions "us-east-1" \
  --services "ec2,vpc,rds" \
  --dry-run

2. Single account with S3 remote state

./cloudtorepo.sh \
  --regions "us-east-1,eu-west-1" \
  --services "ec2,eks,rds,s3,vpc" \
  --state-bucket my-tf-state-prod \
  --state-region us-east-1 \
  --output ./tf-output

3. Multi-account organisation sweep

./cloudtorepo.sh \
  --accounts "123456789012,234567890123,345678901234" \
  --role OrganizationAccountAccessRole \
  --regions "us-east-1,eu-west-1,ap-southeast-2" \
  --state-bucket my-tf-state-org \
  --output ./tf-output \
  --debug

4. Named AWS profile

./cloudtorepo.sh \
  --profile prod-readonly \
  --regions "eu-west-1" \
  --services "ec2,vpc,rds,eks" \
  --output ./tf-output

Options

Flag	Description	Default
`--accounts`	Comma-separated account IDs	Current account
`--regions`	Comma-separated regions	`us-east-1`
`--services`	Comma-separated services, or `list` to print all supported services	All supported services
`--profile`	AWS named profile (`AWS_PROFILE`)	—
`--role`	IAM role name to assume in each account	—
`--state-bucket`	S3 bucket for remote state `backend "s3"`	— (local state)
`--state-region`	Region of the state S3 bucket	Same as resource region
`--output`	Root output directory	`./tf-output`
`--parallel`	Max concurrent service scans	`5`
`--exclude-services`	Comma-separated services to skip	—
`--tags`	Only import resources with these tags e.g. `"Env=prod,Team=sre"`	—
`--since`	Only include resources created after this date (`YYYY-MM-DD`)	—
`--resume`	Skip account/region/service combos already written	`false`
`--dry-run`	Print resource counts; do not write files	`false`
`--debug`	Verbose logging	`false`
`--version`	Print version and exit	—

Supported services

Category	Services
Compute	`ec2`, `ebs`, `ecs`, `eks` (clusters, node groups, addons, Fargate profiles), `lambda`
Networking	`vpc` (VPCs, subnets, security groups, route tables, IGWs, NAT gateways), `elb`, `cloudfront`, `route53`, `acm`, `transitgateway`, `vpcendpoints`
Data	`rds`, `dynamodb`, `elasticache`, `msk`, `s3`, `efs`, `opensearch`, `redshift`, `documentdb`
Streaming	`kinesis` (Data Streams + Firehose)
Integration	`sqs`, `sns`, `apigateway`, `eventbridge`, `stepfunctions`, `ses`
Security & Compliance	`iam` (roles, instance profiles, OIDC providers), `kms`, `secretsmanager`, `wafv2`, `config`, `cloudtrail`, `guardduty`
Platform	`ecr`, `ssm`, `cloudwatch`, `backup`, `codepipeline`, `codebuild`
Auth	`cognito` (user pools, clients, identity pools — fully paginated)
ETL	`glue` (jobs, crawlers, databases, connections)
Storage	`fsx` (Windows, Lustre, ONTAP, OpenZFS), `transfer` (SFTP/FTPS servers + users)
App Platform	`elasticbeanstalk`, `apprunner`, `lightsail`
Analytics	`athena` (workgroups + data catalogs), `lakeformation`, `memorydb`
Governance	`servicecatalog` (portfolios + products)

Output structure

tf-output/
├── summary.txt
└── 123456789012/
    ├── us-east-1/
    │   ├── ec2/
    │   │   ├── backend.tf
    │   │   ├── imports.tf
    │   │   └── resources.tf
    │   ├── eks/
    │   ├── lambda/
    │   │   └── _packages/
    │   └── rds/
    └── eu-west-1/

Generated files

backend.tf — S3 remote state configuration + provider block.

imports.tf — One import {} block per discovered resource, e.g.:

import {
  to = aws_eks_cluster.cluster_production
  id = "production"
}

resources.tf — Empty resource skeletons matching the import blocks.

Populating configuration from live state

For each service directory:

cd tf-output/123456789012/us-east-1/eks
terraform init
terraform plan -generate-config-out=generated.tf

Terraform reads live state and writes a fully-populated generated.tf. Review it, remove any computed / read-only attributes that would cause a diff, then commit as your Terraform baseline.

Automating the plan step with run.sh

Instead of running terraform init + terraform plan manually in every service directory, use run.sh to process the entire output tree in one command:

# Process every service directory under tf-output
./run.sh --output ./tf-output

# Limit to specific accounts, regions, or services
./run.sh --output ./tf-output --regions "us-east-1" --services "ec2,eks,rds"

# Only run terraform init (skip the plan)
./run.sh --output ./tf-output --init-only

# Preview which directories would be processed
./run.sh --output ./tf-output --dry-run

# Run up to 5 terraform processes in parallel (default: 3)
./run.sh --output ./tf-output --parallel 5

Each service directory gets a .run.log file. A summary at the end shows which directories succeeded, had no changes, or failed.

run.sh options

Flag	Description	Default
`--output`	Output directory from `cloudtorepo.sh`	`./tf-output`
`--services`	Limit to specific services	All
`--regions`	Limit to specific regions	All
`--accounts`	Limit to specific accounts	All
`--parallel`	Max concurrent terraform runs	`3`
`--init-only`	Only run `terraform init`, skip plan	`false`
`--dry-run`	Print directories; do not run terraform	`false`
`--debug`	Verbose logging	`false`

Running terraform import with import.sh

After reviewing generated.tf, use import.sh to run terraform import for every resource block in the output tree. It checks terraform state list first and skips anything already managed, so it is safe to re-run at any time.

# Preview what would be imported (no state changes)
./import.sh --output ./tf-output --dry-run

# Import everything sequentially
./import.sh --output ./tf-output

# Import with parallel workers and auto terraform init
./import.sh --output ./tf-output --parallel 4 --init

# Limit to specific accounts, regions, or services
./import.sh --output ./tf-output \
  --accounts "123456789012" \
  --regions  "us-east-1" \
  --services "ec2,eks,rds"

Each service directory gets a .import.log file. A summary at the end shows resources imported, skipped (already in state), and failed.

import.sh options

Flag	Description	Default
`--output`	Output directory from `cloudtorepo.sh`	`./tf-output`
`--services`	Limit to specific services	All
`--regions`	Limit to specific regions	All
`--accounts`	Limit to specific accounts	All
`--parallel`	Max concurrent terraform import runs	`1`
`--init`	Run `terraform init` before importing	`false`
`--dry-run`	Print what would be imported; do not run terraform	`false`
`--debug`	Verbose logging	`false`

Checking coverage with reconcile.sh

After exporting, verify you haven't missed any resources by comparing the output against AWS Resource Explorer:

# Dry run — preview without querying Resource Explorer
./reconcile.sh --output ./tf-output --dry-run

# Full reconciliation
./reconcile.sh --output ./tf-output --index-region us-east-1

# With a named AWS profile
./reconcile.sh --output ./tf-output --index-region us-east-1 --profile prod-readonly

Sample output:

Summary
-------
Total resources (Resource Explorer):  847
Matched to exported import blocks:    801
Potentially missed:                    46
Coverage:                              94%

Resource Explorer must be enabled with an aggregator index in --index-region.

reconcile.sh options

Flag	Description	Default
`--output`	Output directory from `cloudtorepo.sh`	`./tf-output`
`--index-region`	Region containing the Resource Explorer aggregator index	`us-east-1`
`--accounts`	Comma-separated account IDs to filter results	All in index
`--profile`	AWS named profile (`AWS_PROFILE`)	—
`--local`	Skip Resource Explorer; show per-service import block counts from output dir	`false`
`--dry-run`	Show what would be checked; do not query Resource Explorer	`false`
`--debug`	Verbose logging	`false`

Detecting drift with drift.sh

After establishing a Terraform baseline, use drift.sh to detect resources that have been created or deleted outside of Terraform. Unlike reconcile.sh, drift detection requires no additional AWS services — it uses only the AWS CLI.

# Report only — see what has changed without touching any files
./drift.sh --output ./tf-output --regions "us-east-1"

# Scope to specific services
./drift.sh --output ./tf-output --regions "us-east-1" --services "ec2,rds,eks"

# Apply — update imports.tf in place (adds new blocks, comments out deleted ones)
./drift.sh --output ./tf-output --regions "us-east-1" --apply

# Save report to file
./drift.sh --output ./tf-output --apply --report ./drift-report.txt

Sample output:

CloudtoRepo Drift Report
Generated: 2026-03-24T10:00:00Z
=======================================================

  123456789012 / us-east-1 / ec2
  -------------------------------------------------------
  NEW  (2 resource(s) found in AWS, not in imports.tf)
    + aws_instance.web_server_new  (id: i-0abc123def456)
    + aws_instance.batch_worker    (id: i-0def789abc012)
  REMOVED  (1 resource(s) in imports.tf, no longer in AWS)
    - aws_instance.old_bastion     (id: i-0111222333444)

=======================================================
Summary
-------
Unchanged:               22
New (not yet imported):   2
Removed (stale):          1

Run with --apply to update imports.tf files automatically.

drift.sh options

Flag	Description	Default
`--output`	Output directory from `cloudtorepo.sh`	`./tf-output`
`--accounts`	Comma-separated account IDs	Current account
`--regions`	Comma-separated regions	`us-east-1`
`--services`	Comma-separated services, or `list` to print all supported services	All supported services
`--profile`	AWS named profile (`AWS_PROFILE`)	—
`--role`	IAM role to assume in each account	—
`--apply`	Update `imports.tf` in place	`false`
`--dry-run`	Preview drift without writing files or requiring output dir	`false`
`--report`	Write report to file in addition to stdout	—
`--parallel`	Max concurrent service scans	`5`
`--exclude-services`	Comma-separated services to skip	—
`--debug`	Verbose logging	`false`

Recommended workflow

Run with --dry-run to verify resource counts and permissions.
Export a single region with your highest-priority services (use --parallel 5 for speed).
Run run.sh --output ./tf-output to execute terraform init + terraform plan across all service directories automatically.
Review generated.tf in each directory; remove computed / read-only attributes.
Run import.sh --output ./tf-output --dry-run to preview, then import.sh --output ./tf-output to execute terraform import for all resources not yet in state.
Run reconcile.sh to identify gaps (requires AWS Resource Explorer, or use --local for a quick count without it).
Commit the baseline on a baseline-import branch.
Refactor incrementally via pull requests.
Run drift.sh regularly (or in CI) to catch resources created outside Terraform.
Use --exclude-services to skip services managed by a different team or tool.

IAM permissions

The principal running the scripts needs read-only access to the services you want to scan. No write permissions are required for cloudtorepo.sh, drift.sh, or report.sh. reconcile.sh additionally requires AWS Resource Explorer read access.

Minimum IAM policy

Attach this managed policy (or an inline equivalent) to the IAM role or user:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "CloudtorepoReadOnly",
      "Effect": "Allow",
      "Action": [
        "acm:ListCertificates",
        "appconfig:ListApplications", "appconfig:ListEnvironments",
        "athena:ListWorkGroups", "athena:ListDataCatalogs",
        "backup:ListBackupPlans", "backup:ListBackupVaults",
        "bedrock-agent:ListAgents", "bedrock-agent:ListKnowledgeBases",
        "cloudfront:ListDistributions",
        "cloudtrail:DescribeTrails",
        "cloudwatch:DescribeAlarms", "logs:DescribeLogGroups",
        "codebuild:ListProjects",
        "codepipeline:ListPipelines",
        "cognito-idp:ListUserPools", "cognito-idp:ListUserPoolClients",
        "cognito-identity:ListIdentityPools",
        "config:DescribeConfigRules", "config:DescribeConfigurationRecorders",
        "docdb:DescribeDBClusters",
        "dynamodb:ListTables",
        "ec2:Describe*",
        "ecr:DescribeRepositories", "ecr:GetLifecyclePolicy",
        "ecs:ListClusters", "ecs:ListServices", "ecs:ListTaskDefinitions",
        "efs:DescribeFileSystems",
        "eks:ListClusters", "eks:DescribeCluster",
        "eks:ListNodegroups", "eks:ListAddons", "eks:ListFargateProfiles",
        "elasticache:DescribeReplicationGroups", "elasticache:DescribeCacheClusters",
        "elasticbeanstalk:DescribeApplications", "elasticbeanstalk:DescribeEnvironments",
        "elasticloadbalancing:DescribeLoadBalancers",
        "emr:ListClusters",
        "events:ListEventBuses", "events:ListRules",
        "fsx:DescribeFileSystems",
        "glue:GetDatabases", "glue:GetJobs", "glue:GetCrawlers",
        "guardduty:ListDetectors",
        "iam:ListRoles", "iam:ListPolicies", "iam:ListGroups",
        "iam:ListInstanceProfiles", "iam:ListOpenIDConnectProviders",
        "kafka:ListClusters",
        "kinesis:ListStreams",
        "kms:ListKeys",
        "lambda:ListFunctions",
        "lightsail:GetInstances", "lightsail:GetRelationalDatabases",
        "memorydb:ListClusters",
        "opensearch:ListDomainNames",
        "organizations:ListAccounts", "organizations:ListRoots",
        "organizations:ListOrganizationalUnitsForParent",
        "rds:DescribeDBInstances", "rds:DescribeDBClusters",
        "redshift:DescribeClusters",
        "route53:ListHostedZones",
        "s3:ListAllMyBuckets", "s3:GetBucketLocation",
        "sagemaker:ListDomains", "sagemaker:ListEndpoints",
        "secretsmanager:ListSecrets",
        "servicecatalog:ListPortfolios", "servicecatalog:SearchProductsAsAdmin",
        "ses:ListEmailIdentities",
        "sns:ListTopics",
        "sqs:ListQueues",
        "ssm:DescribeParameters",
        "stepfunctions:ListStateMachines",
        "transfer:ListServers",
        "wafv2:ListWebACLs",
        "xray:GetGroups", "xray:GetSamplingRules"
      ],
      "Resource": "*"
    },
    {
      "Sid": "CloudtorepoTagFilter",
      "Effect": "Allow",
      "Action": [
        "tag:GetResources"
      ],
      "Resource": "*"
    },
    {
      "Sid": "CloudtorepoMultiAccount",
      "Effect": "Allow",
      "Action": [
        "sts:AssumeRole"
      ],
      "Resource": "arn:aws:iam::*:role/*"
    },
    {
      "Sid": "CloudtorepoReconcile",
      "Effect": "Allow",
      "Action": [
        "resource-explorer-2:Search",
        "resource-explorer-2:GetIndex",
        "resource-explorer-2:GetView",
        "resource-explorer-2:ListViews"
      ],
      "Resource": "*"
    }
  ]
}

Note: The CloudtorepoMultiAccount statement is only needed when using --role for cross-account sweeps. The CloudtorepoReconcile statement is only needed for reconcile.sh. You can omit either block if you don't use those features.

Testing

The tests/ directory contains a bats-core test suite (112 tests across 7 suites) that exercises all scripts using a mock AWS CLI and mock Terraform binary — no real AWS credentials or Terraform state needed.

File	Tests	What it tests
`tests/cloudtorepo.bats`	26	Main scanner: flags, service exporters, slug dedup, `--resume`, `--output-format`, `--since`, `--exclude-services`, `--services list`
`tests/drift.bats`	26	Drift detection: NEW/REMOVED reporting, `--apply` mutations, `--dry-run`, `--services list`, 13 scan functions
`tests/import.bats`	13	Terraform import runner: flags, filters, `--dry-run`, `--parallel`, state-skipping, import call verification
`tests/reconcile.bats`	8	Coverage calculation: simple IDs, ARN-as-ID, composite IDs, missed resources, `--local`
`tests/report.bats`	13	Markdown report: title, summary table, per-service counts, sort order, `--out`, drift section
`tests/run.bats`	11	Terraform plan runner: flags, filters, `--dry-run`, `--init-only`, no-match exit
`tests/common.bats`	15	Shared library: `slugify`, `tag_match`, `log`/`debug`/`die`

Install bats-core:

# macOS
brew install bats-core

# Linux / CI
git clone https://github.com/bats-core/bats-core.git
sudo bats-core/install.sh /usr/local

Run tests:

# Individual suites
bats tests/cloudtorepo.bats
bats tests/drift.bats
bats tests/import.bats
bats tests/reconcile.bats
bats tests/report.bats
bats tests/run.bats
bats tests/common.bats

# Run all at once
bats tests/

The mock AWS CLI (tests/helpers/mock_aws.bash) intercepts every aws subcommand and returns fixture data set per-test via mock_response. Real jq is required.

Shared helper functions (log, debug, slugify, AWS retry wrapper, tag filtering, cross-account role assumption) live in lib/common.sh and are sourced by both cloudtorepo.sh and drift.sh.

Contributing

See CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github		.github
examples		examples
lib		lib
scripts		scripts
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
cloudtorepo.png		cloudtorepo.png
cloudtorepo.sh		cloudtorepo.sh
drift.sh		drift.sh
favicon-tc.png		favicon-tc.png
favicon.png		favicon.png
favicon.svg		favicon.svg
import.sh		import.sh
index.html		index.html
reconcile.sh		reconcile.sh
report.sh		report.sh
run.sh		run.sh
sync.sh		sync.sh
terraclaim.jpg		terraclaim.jpg
terraclaim.png		terraclaim.png
terraclaimv2.png		terraclaimv2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CloudtoRepo

Requirements

Quick start

1. Dry-run — preview resource counts without writing files

2. Single account with S3 remote state

3. Multi-account organisation sweep

4. Named AWS profile

Options

Supported services

Output structure

Generated files

Populating configuration from live state

Automating the plan step with run.sh

run.sh options

Running terraform import with import.sh

import.sh options

Checking coverage with reconcile.sh

reconcile.sh options

Detecting drift with drift.sh

drift.sh options

Recommended workflow

IAM permissions

Minimum IAM policy

Testing

Contributing

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

CloudtoRepo

Requirements

Quick start

1. Dry-run — preview resource counts without writing files

2. Single account with S3 remote state

3. Multi-account organisation sweep

4. Named AWS profile

Options

Supported services

Output structure

Generated files

Populating configuration from live state

Automating the plan step with run.sh

run.sh options

Running terraform import with import.sh

import.sh options

Checking coverage with reconcile.sh

reconcile.sh options

Detecting drift with drift.sh

drift.sh options

Recommended workflow

IAM permissions

Minimum IAM policy

Testing

Contributing

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 1

Languages

Packages