IB

Software Engineer-Quality and Performance-Distributed Systems

IBM
Bangalore5-10 LPA Posted 13 Nov 2025
FULL TIME
Performance Tuning
Docker
Load Testing
Kubernetes
Distributed Systems
+1 more

Job Description

  • We're building Astra Serverless, the next generation of distributed, scalable, fault-tolerant, serverless NoSQL data services — powered by Apache Cassandra and extended with native Vector and AI capabilities across multi-cloud environments.
  • Our customers depend on our platform to serve real-time, mission-critical workloads on a global scale. Ensuring reliability, performance, and correctness under unpredictable workloads is a non-trivial challenge — and that's where you come in.
  • As an engineer on theQuality Engineering and Performance team, you'll develop and evolve the system-level testing frameworks that validate a distributed database-as-a-service at massive AI-driven workload scale. You'll help ensure that new features, performance improvements, and AI-driven extensions meet the highest standards of scalability and resilience.
  • Why this role
  • You'll work at the intersection of distributed systems engineeringandtest architecture — hands on designing and building automation and frameworks that simulate complex multi-cloud deployments, chaos scenarios, and performance stress conditions.
  • This is not QA-as-usualyou'll engineer the test systems that validate an elastic database platform capable of scaling thousands of non-uniform nodes, self-healing under failure, and integrating real-time vector search and analytics.
  • If you thrive on deep technical challenges, curiosity, analytical and systems thinking, and building tools other engineers rely on, this role will feel like home.
  • What You'll Help Doing
  • Design and develop frameworksfor end-to-end and chaos testing of distributed, serverless Cassandra-based systems.
  • Engineer automation that validates data correctness, fault tolerance, and performance under complex multi-region and multi-cloud topologies.
  • Collaborate closelywith feature development teams to model real-world scenarios and integrate automated validation into the delivery pipeline.
  • Continuously evolve the test infrastructure for scale, speed, and observability — leveraging Kubernetes, Docker, and cloud-native toolchains.
  • Profile and tunedistributed workloads to uncover systemic bottlenecks and verify that service-level goals are consistently met.
  • Contribute code to shared testing frameworks and participate in design and code reviews across teams.
  • Own the full cycleof quality engineering — from test design and execution to insights and continuous improvement.
  • Required education
  • Bachelor's Degree
  • Preferred education
  • Master's Degree
  • Required technical and professional expertise
  • Exposure to system level Java and Python development in testing for distributed or cloud systems — replication, partitioning, consistency, and eventual convergence.
  • Familiarity with Linux,Kubernetes,Docker, andCI/CDpipelines (Jenkins, GitHub Actions, etc.).
  • Knowledge of analyzing complex logs and metrics to isolate performance and reliability issues.
  • Eagerness to learn more about and using chaos testing,fault injection, orresilience validation.
  • Preferred technical and professional experience
  • Familiarity withNoSQL technologies (Cassandra, DynamoDB, ScyllaDB, etc.) and cloud platforms(AWS, GCP, Azure) and multi-cloud topologies.
  • Understanding of vector search,AI embeddings, ordata-intensive workloads.
  • Curiosity-driven mindset, strong communication skills, and a focus on collaboration and craftsmanship.
Join WhatsApp Channel