HUHuntingcube Recruitment Solutions Private Limited
SRE L3
Hyderabad ₹3-7 LPA Posted 19 Jun 2025
FULL TIME
Ec2
S3
Bash
Aws
Job Description
Responsibilities:
Database Administration (DBA) Skills
- Relational Databases: MySQL, PostgreSQL, Oracle, MS SQL Server.
- Database Backup & Recovery: Tools and strategies for database backups and disaster recovery.
- Performance Tuning: Query optimization, indexing strategies, and database performance troubleshooting.
- Database Security: User management, roles, access control, and auditing.
Infrastructure as a Service Knowledge
- Infrastructure as Code (IaC): Terraform, CloudFormation, Kubernetes.
- Kubernetes & Containers: Good Knowledge and Understanding of Kubernetes and usage of Containers.
- Observability Tools: ELK stack (Elasticsearch, Logstash, Kibana)
- Database Migration: Migrating databases across different platforms or cloud environments.
- Infrastructure Scaling: Vertical and horizontal scaling techniques in cloud environments.
SRE Principles and knowledge (Site Reliability Engineering)
- Strong hands-on experience in AWS and Azure cloud, and a fair understanding of Google Cloud would be required.
- Experience in handling APIs, troubleshooting API calls, and ensuring seamless integration and performance.
- Incident Management: Handling database outages, incident response, and on-call rotations.
- Monitoring and Alerting: Tools like Prometheus, Grafana, Datadog, CloudWatch , suggest proactive monitoring for the application stack
- Understanding on core SRE principles: SLA, SLI, SLO, Error budgets etc
- Disaster Recovery Planning: Ensuring high availability (HA) and disaster recovery (DR) solutions.
- Performance Optimisation :- Track latency, slow performance , high utilisation issues and recommend optimisation as required.
Scripting and Automation
- Scripting Languages: Python, Shell scripting, Bash, PowerShell.
- Automation Tools: Ansible, Puppet, Chef.
- Infrastructure Automation: Automating database deployment, patching, and scaling.
Networking and Infrastructure
- Networking Basics: TCP/IP, DNS, Firewall, Load Balancers.
- Database Connectivity: Connection pooling, failover strategies, and multi- region deployment.
- Storage and Disk Management: Understanding IOPS, latency, and throughput.
- Infrastructure: Familiarity with AWS services like EC2, S3, VPC, Security
- Groups, Private and Public subnets,IAM, CloudWatch, Cloudtrail etc and Azure services like Virtual Machines, Azure functions, Virtual Network, Resource Manager, etc.
OS Skills
- Expertise in Linux OS ( RHEL, Ubuntu, Centos)
- Understanding of file systems (ext4, XFS, etc.), permissions, and ownerships
- Knowledge of process monitoring, management, and troubleshooting
- Proficiency with tools like top, htop, vmstat, iostat, sar, and dstat to monitor CPU, memory, disk I/O, and network usage.
- Ability to analyze system logs (/var/log/, journalctl, dmesg) for troubleshooting.
- Understanding of resource limits (CPU, memory, disk, network) and how they impact database performance.
- Knowledge of partitioning tools (fdisk, parted) and file system management (mkfs, mount, umount).
- Understanding of RAID configurations and Logical Volume Management (LVM) for storage scalability.
Troubleshooting and Debugging
- Log Analysis: Reading and analysing database and system logs.
- Root Cause Analysis (RCA): Performing in-depth analysis after major incidents and sharing RCA with customers.
- Query Performance: Analysing slow queries, deadlocks, and resource contention.
Soft Skills
- Communication Skills: Clear written and verbal communication with internal and external stakeholders.
- Problem-Solving: Ability to prioritise, troubleshoot critical issues and bring them to closure..
- Collaboration: Working closely with DevOps, Infrastructure, and Engineering teams.