Linux Site Reliability Consultant
Linux Site Reliability Consultant AUS | Remote | Work from Home | #LIRemote Why you? Do you thrive on solving tough problems—even under pressure? Are you motivated by fast-paced environments with continuous learning opportunities? Do you enjoy collaborating with a team of peers who push you to constantly up your game?At Pythian, we are building a next-generation Site Reliability Engineering team. We need motivated and talented individuals on our teams, and we want you!You’ll act as a technology leader and advisor for our clients, as well as a mentor for other team members. Projects would include things such as infrastructure architecture, automation, and intelligent monitoring systems from the design phase through the implementation phase.
What will you be doing?
Operate, maintain and administer solutions that contribute to the operational efficiency, availability and visibility of customer infrastructure.
Planning maintenance activity, design documentation and standard procedures
Provide Root Cause Analysis reports for outages/incidents (ITIL - Problem Management)
Observe and provide feedback on the current state of the client’s infrastructure, and identify opportunities to improve resiliency, reduce the occurrence of incidents and automate repetitive administrative and operational tasks.
Contribute to, improve and maintain team documentation about client systems and infrastructure, procedures, policies and schedules.
Gather and document information about client environments through audit activities, and analyze the information to identify opportunities for improvement and application of best practices.
Work collaboratively with team mates to contribute to the continuous improvement of our working culture.
Act as a technology leader for clients, as well as drive client discussions on technology road maps.
Participate in an on-call rotation in an escalation capacity.
What do we need from you?
Solid understanding of microservices architecture and container technologies ( Kubernetes is a must , Docker, lxc etc)
Experience working with at least one major cloud provider . Preferably Google but AWS or Azure would suffice (including infrastructure as code deployment with Cloud Formation, Terraform, Opsworks etc)
Clear understanding of software development lifecycles and best practices from an infrastructure point of view
Understanding the end to end operations of a ‘Business System’ vs components.
Comprehensive systems hardware and network troubleshooting experience
Common Linux distribution platform installation, configuration and performance tuning
TCP/IP networking, NIC bonding and network services configuration (DNS, NTP, DHCP, SMTP, etc.)
Operation and administration of virtual infrastructure, including experience with at least one hypervisor (VMware, Hyper-V, KVM, etc.)
Ability to describe IaaS, PaaS, SaaS, pros and cons of each, use cases for virtualization and cloud
Administration of web servers and supporting technologies, including network load balancers
Scripting and automation of administrative tasks using bash, python, ruby, go etc.
Experience with the design, development and deployment of at least one major configuration management framework (i.e. Puppet, Ansible, Chef, Salt)
System and application error investigation, troubleshooting of access/availability issues including deep multi system root cause analysis
Experience managing networking devices, such as switches and firewalls from a variety of vendors
Solid understanding of DevOps tools, processes, and culture
Exposure and operational experience with network monitoring systems (NMS)
Ability to pick up new technologies quickly
Ability to provide accurate work scheduling and task estimations for work delivery
What do you get in return?
Love your career: Competitive total rewards package with an annual bonus
Love your development: Hone your skills or learn new ones with our substantial training allowance; participate in professional development days, attend conferences, become certified, whatever you like!
Love your work/life balance: Why commute? Work remotely from your home (forever), there’s no daily travel requirement to an office!, You can be located anywhere in Canada, all you need is a stable internet connection.
Love your workspace: We give you all the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalise your work environment!
Love your community: Blog during work hours; take a day off and volunteer for your favorite charity.