NTNTT DATA Global Delivery Services Limited
Senior Site Reliability Engineer (REMOTE)
Pune ₹5-8 LPA Posted 4 Apr 2025
FULL TIME
Scripting
Google Cloud
AI ML
Job Description
Senior Site Reliability Engineer (Remote)
We are looking for a talented Senior Site Reliability Engineer (SRE) with a strong background in Google Cloud Platform (GCP), Google BI and AI/ML tools (Looker, BigQuery ML, Vertex AI, etc.) and RedHat OpenShift administration. The ideal candidate will be responsible for ensuring the reliability, performance, and scalability of our on-premises and cloud-based systems along with focus on reducing costs for Google Cloud.
Duties and responsibilities:
- System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
- Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
- Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
- Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
- Collaboration: Work closely with development and operations teams to improve system reliability and performance.
- Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
- Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.
Qualifications:
- 5+ years of experience in site reliability engineering or a similar role.
- 5+ years with proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, BigQuery, Pub/Sub, etc.).
- 5+ years of experience with Google BI and AI/ML tools (Looker, BigQuery ML, Vertex AI, etc.)
- 5+ years of experience with automation tools (Terraform, Ansible, Puppet).
- 5+ years of experience with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
- 5+ years of strong scripting skills (Python, Bash, etc.).
- 3+ years of previous experience with networking concepts and protocols.
- 3+ years of experience with monitoring tools (Prometheus, Grafana, etc.).
Preferred:
- Google Cloud Professional DevOps Engineer
- Google Cloud Professional Cloud Architect
- Red Hat Certified Engineer (RHCE) or similar Linux certification
- Bachelor's degree in computer science, Engineering, or a related field.
Please note Shift Timing Requirement: 1:30pm IST -10:30 pm IST