We are seeking a highly skilled AWS Cloud Engineer to design, implement, and maintain scalable, secure, and cost-effective cloud infrastructure. The ideal candidate will have strong expertise in AWS services, infrastructure automation, monitoring, security, and incident management. This role will play a critical part in ensuring the reliability, performance, and resilience of our cloud environments while supporting continuous improvement and operational excellence.
Key ResponsibilitiesInfrastructure & Automation
- Build, deploy, and manage scalable AWS cloud environments using Infrastructure as Code (IaC) tools such as Terraform and AWS CloudFormation.
- Automate infrastructure provisioning, configuration management, and operational workflows.
- Maintain and optimize cloud architecture to ensure high availability, scalability, and reliability.
Monitoring & Observability
- Implement and manage monitoring, logging, and alerting solutions using tools such as Amazon CloudWatch, Datadog, and OpenTelemetry.
- Track system health, infrastructure performance, and application availability.
- Develop dashboards and reporting mechanisms to provide operational visibility and actionable insights.
Security & Compliance
- Manage and govern AWS Identity and Access Management (IAM) policies, roles, and permissions.
- Implement security best practices, including network segmentation, security groups, and least-privilege access controls.
- Coordinate vulnerability management, patching, and remediation activities across AWS workloads.
- Support compliance and governance initiatives in alignment with organizational standards.
Cost Optimization
- Monitor cloud spending and resource utilization using AWS Cost Explorer and related tools.
- Identify opportunities to reduce costs through resource right-sizing, automation, and elimination of unused assets.
- Collaborate with stakeholders to optimize cloud resource consumption while maintaining performance and reliability.
Incident Management & Support
- Provide Level 2 and Level 3 operational support for cloud infrastructure and platform services.
- Lead troubleshooting efforts during major incidents and service disruptions.
- Conduct root cause analysis (RCA) and post-incident reviews, implementing corrective and preventive actions.
Business Continuity & Disaster Recovery
- Develop, maintain, and test disaster recovery (DR) strategies and business continuity plans.
- Manage backup and recovery solutions across cloud environments.
- Design and implement automated failover mechanisms to ensure service resilience and availability.
Required Qualifications
- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field, or equivalent practical experience.
- Strong hands-on experience with AWS cloud services, including:
- Amazon EC2
- Amazon S3
- Amazon VPC
- AWS IAM
- AWS Organizations and AWS Control Tower
- Experience implementing Infrastructure as Code (IaC) using Terraform and/or AWS CloudFormation.
- Proficiency in one or more scripting languages such as Bash, Python, or PowerShell.
- Experience building and managing CI/CD pipelines.
- Hands-on experience with containerization and orchestration technologies, including Docker and Amazon EKS.
- Strong understanding of cloud networking, security, and operational best practices.
- Experience supporting production environments and participating in incident response activities.
Preferred Qualifications
- AWS certifications such as AWS Certified Solutions Architect, AWS Certified SysOps Administrator, or AWS Certified DevOps Engineer.
- Experience with observability platforms such as Datadog and OpenTelemetry.
- Familiarity with ITIL service management processes.
- Understanding of Site Reliability Engineering (SRE) principles and practices.
- Experience with multi-account AWS environments and cloud governance frameworks.
- Strong analytical, troubleshooting, and problem-solving skills.
- Excellent communication and stakeholder management abilities.
Key Competencies
- Cloud Infrastructure Management
- Infrastructure Automation
- DevOps & CI/CD
- Security & Compliance
- Monitoring & Observability
- Incident Response & Root Cause Analysis
- Disaster Recovery & Business Continuity
- Cost Optimization
- Continuous Improvement
Pay: Up to $9,000.00 per month
Work Location: In person