PRPrimetrace Technologies
Platform Operations, Aladdin Data, Associate
Mumbai ₹2-6 LPA Posted 12 Jun 2025
FULL TIME
Devops
Splunk
Kubernetes
Grafana
Prometheus
+1 more
Job Description
Role Description
- 3-5 years of hand-on experience working as part of Platform Operations, Site Reliability Engineering, DevOps or related engineering teams .
- Building your skills as a domain expert on the functionality and capabilities of the platform.
- Triaging and timely resolution of client inquiries.
- Enable user best practice execution on the platform including training and adoption of new platform features.
- Understanding and acting on platform telemetry alerts including invocation of our Incident Management response plays.
- Look for opportunities to automate our workflows to improve our team s effectiveness and efficiency.
- Reporting and metrics generation on platform reliability as well as user inquiry trends.
- Contribute to building out our observability framework to enhance our platform.
Desirable Skills
- Experience building, managing and supporting large-scale platforms.
- Understanding of the K8s Operator Pattern -- comfort and courage to wade into (predominantly golang based) operator implementation code bases
- Hands -on experience deploying log management and observability platform tooling: SPLUNK / Prometheus / Grafana, AlertManager .
- Strong attention to details and focus on high quality delivery.
- Comfortable reading and writing Python code.
- Comfortable working with clients and partners at all levels of the business.