Site Reliability Engineers (SREs)

Opportunity with Dish Network

Site Reliability Engineers (SREs)

Objectives of this Role

· Understanding and documenting the performance and scalability non-functional requirements including SLI/SLOs. Validating requirements with Business Stakeholders

· Manage SLI/SLOs of customer facing interfaces as well as backend services and provide improvement plans for non-compliance

· Develop custom dashboards in New Relic to represent a holistic view of system operational health

· Improve reliability, quality, and time-to-market of our suite of software solutions

· Support release engineering by providing automation support as well as push changes to production when manual intervention needed

· Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve

· Provide primary operational support and engineering for multiple large distributed software applications

Daily and Monthly Responsibilities

· Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding

· Partner with engineering teams to improve services through rigorous testing and release procedures

· Participate in system design consulting, platform management, and capacity planning

· Modelling areas of risk to estimate latency characteristics and capacity requirements. Typically, this will either be refining the workload and modelling how it applies to a set of components, or working with component suppliers to estimate capacity requirements.

· Create sustainable systems and services through automation and uplifts

· Balance feature development speed and reliability with well-defined service level objectives

Required Skills and Qualifications

Experience 7 to 14yrs

Must Skill - resiliency, reliability, SLI, SLO, AWS, Kubernetes, Observability, performance tuning, problem management, New Relic, log analysis, DevOps

· Bachelor’s degree in computer science or other highly technical, scientific discipline

· Ability to program (structured and OO) with one or more high level languages, such as Go, Java, C/C++, Ruby, React Native and JavaScript

· Experience AWS cloud services like EC2, S3, Cloud Front, EKS as well as dynamic resource management frameworks (Kubernetes)

· Experience in any one of the application performance management tool (preferably New Relic), EFK stack and log analysis

· A proactive approach to spotting problems, areas for improvement, and performance bottlenecks

· Ability to drive a collaborative approach across business functions, and external partners

Preferred Qualifications

· Previous success in technical engineering

· Coding experience beyond simple scripts

bangalore, KA, IN