Bothell, WA (Corporate)

Site Reliability Engineer

North America

Reports to: Director, DevOps & Information Security

Position summary

Winshuttle is looking for a Site Reliability Engineer to automate critical processes and implement end-to-end application and platform monitoring that weaves together multiple data inputs to produce and enables proactive, self-healing solutions. As Site Reliability Engineer, you will implement toolsets to improve availability and reliability, develop automated solutions to reduce manual errors and improve productivity. You will be part of the DevOps team working with test, systems, and software engineers to troubleshoot and resolve incidents that arise within the cloud based multi-tenant application operating in a 24 x 7 high availability environment.  Working closely with the Software Development and DevOps teams’ you will utilize trends and metrics to identify opportunities for improvements within existing frameworks, tools, and processes to continuously enhance operational reliability and scalability. In summary, this role is responsible for complete application availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.

About Winshuttle

Are you interested in working in a fun, collaborative environment, for an award winning workplace? Winshuttle is dedicated to fostering a culture of respect and innovation to support and empower employees' ambitions. We're constantly looking for entrepreneurs who aren't afraid to think outside the box, and don't take themselves too seriously. We embrace and support our employees who seek opportunities for continued learning, inspire others, and live and breathe our core PACT values. We have a work hard, play hard mentality; we're constantly evolving lean solutions for ERP business processes by day, and dominating on the frisbee golf course by night. Our strength and competitive advantage stems from our awesome employees, and we strive to create a balanced work life that is as inspiring and rewarding as life at home. Think you might be a great fit?

Essential functions and responsibilities

  • Work closely with software engineering and DevOps teams to define and implement service monitoring solutions that proactively enhance and automate issue identification and resolution.
  • Automate a scalable and repeatable incident management process with automated escalation to Engineering, DevOps and Support teams.
  • Ensure security best practices are present from the initial build and integrated into the entire development process.
  • Perform deep investigations that stretch your skills as you traverse rich telemetry streams to isolate and solve complex performance and reliability issues for online services.
  • Review and influence ongoing design, architecture, standards and methodology for improving application performance, health and sustained availability by managing an instrumentation platform and build automation within a CI/CD framework.
  • Build and document support matrix process by working closely with the global support team in order to triage incidents into a fully functional escalation path with incident response following the proper channel for final resolution.

Desired behaviors

  • Willing to work on call (24×7) as part of an escalation team.
  • Investigate and troubleshoot technical issues analytically and thoroughly, and assist in addressing the issues with improved process and auto-remediation; fully documented and articulated to the Global Support Organization.
  • Perform deep investigations that stretch your skills as you traverse rich telemetry streams to isolate and solve complex performance and reliability issues for online services.
  • Strong customer focus with ability to work effectively across multiple business and technical teams to ensure continued customer happiness.
  • Initiates action – is results oriented, takes responsibility for actions and outcomes. Meets commitments and strives for high performance.
  • Technically proficient – knows role and has a solid familiarity with tasks and responsibilities.
  • Takes responsibility for learning – knows personal strengths and recognizes development needs. Is open to feedback and always seek to learn.
  • Display ethical character and competence – acts with integrity and intent, is accountable for own actions, behaves according to the PACT values. Act as a good citizen of Winshuttle.

Knowledge

  • Certification in Microsoft Cloud Technologies, preferred
  • Familiarity with VSTS, Jira and Agile methodologies; includes epics, stories and daily standups with scrubbing
  • 5+ years’ experience with software and systems architecture
  • Experience with Container and Orchestration technologies such as Docker, Kubernetes, etc.is a plus
  • BA/BS degree in computer science or other related fields

Experience

  • Ability to script with PowerShell and read and decipher C#. Experience with VSTS or TFS frameworks and work within a fully engaged Agile/Sprint practice.
  • Monitor solutions and performance with New Relic, OMS or other Enterprise-Based monitoring.
  • Deep understanding of cloud services (preferably Azure), multi-tenant SaaS platform design and supportability, and software security development integrated into the application build process.
  • Experience with managing availability, capacity, and security to provide the best ROI with given workloads and environmental requirements against the given resources.
  • Experience working with software engineering team members and ownership of translating customer and technical requirements into service architecture to meet Quality of Service.

This job posting does not imply that these are the only duties to be performed. Employees occupying this position will be required to follow any other-related instructions and to perform any other job related duties requested by their supervisor. To perform this job successfully, an individual must be able to perform each essential duty and meet the physical requirements satisfactorily. Reasonable accommodations may be made to enable qualified individuals with disabilities to perform the essential functions.