Site Reliability Engineer
Job Description
3-12 years of experience in code management, deployment processes, procedures, and tools in a DevOps or SRE role. · Experience with monitoring tools (preferred: Dynatrace, Splunk, Datadog, Grafana, and New Relic). · Proficiency in observability trends, tools, products, and technologies. · Ability to identify gaps in the SRE practice and implement solutions. · Experience supporting a cloud production environment (preference for Azure). · Competency in one or more programming languages for automation (Python preferred). · Knowledge of cloud deployment tools and methodologies (e.g., Ansible, Terraform, Azure DevOps). · Understanding of Kubernetes and Docker architecture and associated tools. · Experience with configuration management solutions (e.g., Chef, Ansible, AWS CodeDeploy). · Proficiency with repository and pipeline-related tools (e.g., GitLab, Jenkins, Bamboo, Travis, CircleCI). · Strong troubleshooting skills. · Ability to take ownership and deliver solutions autonomously