وصف الوظيفة
We seek a highly skilled and experienced SRE Engineer to join our dynamic team. The ideal candidate will have a robust background in software engineering, with extensive experience in using Terraform for infrastructure as code (IaC) to manage and provision cloud resources on AWS.
- Design, implement, and manage cloud infrastructure using Terraform and other IaC tools.
- Proactively monitor, troubleshoot, and optimize the performance of cloud environments to guarantee high availability and efficiency.
- Implement and maintain CI/CD pipelines for automated code deployment and infrastructure changes.
- Develop and manage the data stack, encompassing infrastructure resources, implementation, and data lake setup.
- Respond to and manage critical alerts and incidents, coordinating swift response efforts to mitigate impact and downtime.
- Perform thorough root cause analysis (RCA) for incidents to identify and address underlying issues, developing solutions that prevent future occurrences.
- Document and maintain comprehensive guides for system configurations and operational procedures, promoting knowledge sharing and operational excellence.