Senior Cloud Infrastructure Engineer
Job Description:
Key Responsibilities
- Lead the inventory of AWS resources across us-east-1 and us-west-2, classifying assets by environment and function (Compute, Networking, Databases).
- Import manually provisioned resources (CloudFront, ALBs, VPC components, S3, EC2) into Terraform state without re-provisioning to ensure zero downtime.
- Identify production workloads currently residing in Development VPCs and implement architectural changes to strictly isolate Dev, Staging, and Production environments.
- Configure AWS security services (GuardDuty, AWS Config) to support continuous compliance validation and align with SOC 2 expectations for access control and logging.
- Review the existing RDS footprint (approx. 32 instances) and document ETL flows from MongoDB Atlas to Postgres; ensure databases are properly represented in Terraform.
- Define and implement standard Terraform structure, modules, and tagging conventions (e.g., ck-terraform, ck-service) for future scalability.
- Implement infrastructure changes to support change visibility and traceability as required for SOC 2 audits.
Detailed Scope of Work & Deliverables
1. Infrastructure Discovery & Classification
Inventory all AWS resources and identify which are managed by Terraform vs. manually provisioned.
Apply resource tags to determine provisioning source and ownership.
Classify resources by environment (Dev, Staging, Prod) and function.
2. Environment Separation & Alignment
Document and remediate cross-VPC communication paths.
Move production workloads out of the Development VPC.
Implement Terraform changes to enforce logical and architectural separation of environments.
3. Terraform Coverage Expansion
Migrate selected resources (CloudFront, Load Balancers, S3, EC2) into Terraform.
Import existing resources into the state file to prevent destructive changes.
Standardize module usage across the organization.
4. Security & Compliance Controls
Review and enhance existing GuardDuty and AWS Config setups.
Assist in defining workflows for security findings and configuration drift.
Ensure all infrastructure changes prioritize non-disruptive remediation and data integrity.
Required Qualifications & Experience
Minimum Qualifications (Pass/Fail):
- At least 5 years of hands-on experience with Terraform in a production AWS environment.
- Demonstrated experience importing existing, manually created AWS resources into Terraform state (terraform import) without service interruption.
Required Experience:
- Experience bringing wild or unmanaged infrastructure under Infrastructure-as-Code (IaC) management.
- Experience implementing infrastructure controls specifically to meet SOC 2 or similar compliance frameworks (ISO 27001, HIPAA).
- Deep expertise in AWS Networking, including VPC Peering, Transit Gateways, Security Groups, and NACLs.
- Experience managing RDS instances and understanding database connectivity/security (Postgres, MongoDB Atlas).
Skills & Competencies:
- Terraform (module development, state management, refactoring).
- AWS services (EC2, RDS, S3, CloudFront, VPC, IAM).
- Python or Bash for automation tasks.
Technical Skill Set
- Terraform (Expert level), Terragrunt (Preferred).
- AWS (Expert level).
- Python, Bash.
- RDS (Postgres), MongoDB Atlas.
- AWS Config, GuardDuty, IAM, Security Hub.
- Git, GitHub/GitLab.