HU

SRE L3

Huntingcube Recruitment Solutions Private Limited
Hyderabad3-7 LPA Posted 19 Jun 2025
FULL TIME
Ec2
S3
Bash
Aws

Job Description

Responsibilities:

Database Administration (DBA) Skills

  • Relational Databases: MySQL, PostgreSQL, Oracle, MS SQL Server.
  • Database Backup & Recovery: Tools and strategies for database backups and disaster recovery.
  • Performance Tuning: Query optimization, indexing strategies, and database performance troubleshooting.
  • Database Security: User management, roles, access control, and auditing.

Infrastructure as a Service Knowledge

  • Infrastructure as Code (IaC): Terraform, CloudFormation, Kubernetes.
  • Kubernetes & Containers: Good Knowledge and Understanding of Kubernetes and usage of Containers.
  • Observability Tools: ELK stack (Elasticsearch, Logstash, Kibana)
  • Database Migration: Migrating databases across different platforms or cloud environments.
  • Infrastructure Scaling: Vertical and horizontal scaling techniques in cloud environments.

SRE Principles and knowledge (Site Reliability Engineering)

  • Strong hands-on experience in AWS and Azure cloud, and a fair understanding of Google Cloud would be required.
  • Experience in handling APIs, troubleshooting API calls, and ensuring seamless integration and performance.
  • Incident Management: Handling database outages, incident response, and on-call rotations.
  • Monitoring and Alerting: Tools like Prometheus, Grafana, Datadog, CloudWatch , suggest proactive monitoring for the application stack
  • Understanding on core SRE principles: SLA, SLI, SLO, Error budgets etc
  • Disaster Recovery Planning: Ensuring high availability (HA) and disaster recovery (DR) solutions.
  • Performance Optimisation :- Track latency, slow performance , high utilisation issues and recommend optimisation as required.

Scripting and Automation

  • Scripting Languages: Python, Shell scripting, Bash, PowerShell.
  • Automation Tools: Ansible, Puppet, Chef.
  • Infrastructure Automation: Automating database deployment, patching, and scaling.

Networking and Infrastructure

  • Networking Basics: TCP/IP, DNS, Firewall, Load Balancers.
  • Database Connectivity: Connection pooling, failover strategies, and multi- region deployment.
  • Storage and Disk Management: Understanding IOPS, latency, and throughput.
  • Infrastructure: Familiarity with AWS services like EC2, S3, VPC, Security
  • Groups, Private and Public subnets,IAM, CloudWatch, Cloudtrail etc and Azure services like Virtual Machines, Azure functions, Virtual Network, Resource Manager, etc.

OS Skills

  • Expertise in Linux OS ( RHEL, Ubuntu, Centos)
  • Understanding of file systems (ext4, XFS, etc.), permissions, and ownerships
  • Knowledge of process monitoring, management, and troubleshooting
  • Proficiency with tools like top, htop, vmstat, iostat, sar, and dstat to monitor CPU, memory, disk I/O, and network usage.
  • Ability to analyze system logs (/var/log/, journalctl, dmesg) for troubleshooting.
  • Understanding of resource limits (CPU, memory, disk, network) and how they impact database performance.
  • Knowledge of partitioning tools (fdisk, parted) and file system management (mkfs, mount, umount).
  • Understanding of RAID configurations and Logical Volume Management (LVM) for storage scalability.

Troubleshooting and Debugging

  • Log Analysis: Reading and analysing database and system logs.
  • Root Cause Analysis (RCA): Performing in-depth analysis after major incidents and sharing RCA with customers.
  • Query Performance: Analysing slow queries, deadlocks, and resource contention.

Soft Skills

  • Communication Skills: Clear written and verbal communication with internal and external stakeholders.
  • Problem-Solving: Ability to prioritise, troubleshoot critical issues and bring them to closure..
  • Collaboration: Working closely with DevOps, Infrastructure, and Engineering teams.

Required Skills

Join WhatsApp Channel