Опис
We are looking for an experienced Site Reliability Engineer (SRE) to support and enhance the reliability, scalability, and performance of Apache Druid, a key component of our Data Platform.Essential FunctionsProvide SRE support for Apache Druid, ensuring high availability, performance, and reliability of the platform.Perform administration, rollout, and maintenance of Apache Druid clusters.Be part of a 24x7 on-call rotation (1-week shift every 6 weeks) to handle production incidents and ensure system uptime.Monitor and troubleshoot issues in AWS-based Kubernetes (EKS) environments to maintain platform stability.Develop automation solutions for deployment, monitoring, testing, and CI/CD pipelines to improve system efficiency.Utilize Python for scripting, automation, and operational tooling.Collaborate with cross-functional teams, ensuring smooth communication and issue resolution.QualificationsStrong hands-on experience with Apache Druid – administration, rollout, and maintenance.Expertise in AWS with a solid understanding of Kubernetes operations (EKS is a must).Proficiency in Python for automation and scripting tasks.Experience with monitoring, alerting, and incident response in cloud-native environments.Familiarity with CI/CD pipelines and automation for deployment and testing.Excellent communication skills and ability to work in a collaborative team environment.Would be a plusExperience working in a customer-facing or consulting role.Additional programming experience in Java (though Python is preferred).Knowledge of Machine Learning and AI concepts.We offerOpportunity to work on bleeding-edge projectsWork with a highly motivated and dedicated teamCompetitive salaryFlexible scheduleBenefits package - medical insurance, sportsCorporate social eventsProfessional development opportunitiesWell-equipped officeAbout UsGrid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.