SASaviynt
Engineer-Cloud Operations
Bangalore ₹2-3 LPA Posted 25 Jun 2025
FULL TIME
Information Security
Monitoring
Dns
Account Management
Disaster Recovery
+3 more
Job Description
What You Will Be Doing
Monitoring and Alerting
- Proactively monitor the health and performance of cloud infrastructure and applications.
- Implement robust alerting systems to identify and address issues quickly.
- Analyze system logs and metrics to identify trends and anomalies.
Incident Response
- Respond to incidents and outages in a timely and effective manner.
- Troubleshoot and resolve complex technical problems, escalating as necessary.
- Conduct root cause analysis to prevent recurrence.
Deployment and Configuration
- Deploy and configure infrastructure components on AWS and Azure (virtual machines, networks, storage).
- Manage Kubernetes clusters and deploy application containers.
- Automate routine tasks using scripting and configuration management tools.
Performance Optimization
- Identify and implement strategies to improve application response times and resource usage.
- Conduct load testing and capacity planning to ensure scalability.
Collaboration
- Collaborate with development teams to understand application requirements and deployment processes.
- Work closely with infrastructure teams to ensure high availability and reliability.
What You Bring
- 3+ years of experience in cloud operations or DevOps engineering.
- Strong proficiency in AWS and Azure, including services like EC2, VPC, S3, RDS, Azure Virtual Machines, Virtual Networks, and Storage Accounts.
- In-depth knowledge of Kubernetes and container orchestration.
- Experience with Java-based applications deployed on cloud platforms.
- Proficiency in scripting languages such as Bash, Python, or PowerShell.
- Experience with configuration management tools like Ansible or Puppet.
- Strong understanding of networking concepts including TCP/IP, DNS, and load balancing.
- Excellent problem-solving and troubleshooting skills.
- Ability to work independently and in a team environment.
- Strong written and verbal communication skills.
Additional Considerations
- This position requires working a night shift to ensure 24/7 global coverage.
- On-call responsibilities may include responding to urgent issues outside regular business hours.
Security and Compliance
- Complete security and privacy awareness training during onboarding and annually.
- Review, understand, and comply with all relevant policies and procedures, including:
- Data Classification, Retention & Handling
- Incident Response
- Business Continuity & Disaster Recovery
- Mobile Device and Access Control
- Personnel Security and Privacy Policies