Senior DevOps AWS Engineer

Buenos Aires, Argentina
Full-Time
Remote

Job Description:

About the Role

We are looking for a highly motivated Senior DevOps Engineer to join a mature and collaborative engineering organization responsible for delivering a mission-critical SaaS platform used by thousands of professionals.

This position offers the opportunity to work within an established cloud environment while actively contributing to its evolution. You will partner closely with software engineers, architects, and technology leaders to improve infrastructure reliability, optimize deployment processes, enhance observability, and drive cloud modernization initiatives.

The ideal candidate combines strong AWS expertise with a proactive mindset, enjoys solving complex infrastructure challenges, and thrives in environments where operational excellence, automation, and continuous improvement are highly valued.

What You'll Do

Cloud Infrastructure & Operations

Manage and optimize cloud infrastructure hosted primarily on AWS, including services such as EC2, VPC, S3, IAM, Lambda, ECS/EKS, and related components.
Maintain infrastructure through Infrastructure as Code (Terraform), ensuring scalability, security, and operational consistency.
Partner with engineering teams to support deployments, troubleshoot environment issues, and improve developer productivity.

DevOps & Automation

Design, improve, and maintain CI/CD pipelines to support reliable and efficient software delivery.
Automate operational processes, infrastructure provisioning, monitoring, and deployment workflows.
Identify opportunities to reduce manual effort and improve system reliability through automation.

Platform Modernization

Participate in strategic infrastructure initiatives, including cloud-native transformations, server migrations, containerization efforts, and serverless adoption.
Evaluate and implement architectural improvements that increase scalability, performance, and resilience.
Contribute to technical discussions and provide recommendations on infrastructure best practices.

Monitoring & Reliability

Maintain and enhance observability platforms, logging systems, and monitoring solutions.
Establish proactive monitoring, alerting, and incident response processes.
Support capacity planning, backup validation, disaster recovery readiness, and overall platform reliability.

Incident Management & Support

Act as an escalation point for complex production issues.
Participate in incident response activities and root cause analysis.
Document operational procedures and continuously improve troubleshooting playbooks.

Required Qualifications

5+ years of experience in DevOps, Cloud Engineering, Site Reliability Engineering, or related infrastructure roles.
Strong hands-on experience with AWS cloud services in production environments.
Proven experience managing infrastructure using Terraform or similar Infrastructure-as-Code tools.
Experience supporting Linux-based environments and working knowledge of Windows Server administration.
Hands-on experience with CI/CD platforms such as Azure DevOps, GitLab CI, GitHub Actions, Jenkins, or similar.
Scripting experience with Bash, PowerShell, Python, or equivalent automation languages.
Experience troubleshooting distributed applications and cloud infrastructure.
Strong understanding of networking fundamentals, security best practices, and cloud architecture principles.
Excellent communication and collaboration skills within cross-functional engineering teams.
Ability to work independently in a fully remote environment.

Preferred Qualifications

AWS Certifications (Solutions Architect, SysOps Administrator, DevOps Engineer, or equivalent).
Experience working in regulated industries such as Healthcare, FinTech, or SaaS platforms with high compliance requirements.
Experience with Docker, Kubernetes, ECS, or containerized environments.
Familiarity with observability platforms such as Grafana, Prometheus, CloudWatch, Datadog, or similar.
Working knowledge of SQL Server administration, database maintenance, and backup validation.
Experience supporting high-availability production systems serving large user bases.

What We Offer

Fully remote work environment.
Long-term career growth opportunities.
Exposure to large-scale cloud infrastructure and modernization initiatives.
Collaborative engineering culture focused on learning, innovation, and continuous improvement.
Competitive compensation package.
Generous PTO policy, including vacation, personal, and sick leave.
Paid parental leave.
Flexible work environment that supports work-life balance and professional development.

Success Profile

The successful candidate is someone who:

Takes ownership and proactively solves problems.
Thinks beyond day-to-day operations and contributes to long-term platform strategy.
Values automation, reliability, and operational excellence.
Enjoys partnering with software engineers to build scalable systems.
Thrives in collaborative, fast-moving, and continuously evolving environments.