You're using an older version of Internet Explorer that is no longer supported. Please update your browser.

Lead Site Reliability Engineer

Toronto, ON
Full Time
2 days ago
What is the opportunity?

We are looking to add a Lead Site Reliability Engineer (SRE) with a strong problem solving and engineering background to the Retail Banking & Payments Technology SRE organization. The team will work collaboratively with the application development arm of the organization and other IT partners required to succeed in its mandate. As the Lead Site Reliability Engineer, you will provide the leadership to an SRE team that will be responsible for successfully executing the strategy in transforming IT Operations. From Monitoring to incident response, SREs are focused on building and monitoring anything in production that improves service resiliency and reducing repetitive manual tasks.

What will you do?
  • Lead a squad in implementing SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing) while striving to reduce toil using automation tools
  • Perform code and non-functional (performance, security, maintainability) reviews of all production bound SRE solutions
  • Help drive transformation by continuously looking for ways to automate existing processes
  • Maintain technology currency (perform server patching, certificate renewal, etc.) with keen eye on automating opportunities
  • Run engineering mindset meetups accelerating breadth and depth of knowledge in community
  • Perform a production support role: proactive monitoring of environments, troubleshooting all systems and applications in scope, including off-hours support
  • Drive incident response: facilitate communication channels, develop and execute playbooks, meet SLOs, and coordinate within own squad and other application stakeholders to get to resolution
  • Assist in incident management and problem management for applications in scope

What do you need to succeed?

  • Production support experience who is able to effectively guide a team through the incident response process
  • Experienced people manager that prioritizes engineering while maintaining production resiliency and compliance standards
  • Hands-on experience in a variety of SRE languages and tools including Ansible, Dynatrace Managed, Moog, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Blue Prism, Catchpoint
  • Software engineer experience with production class delivery, strong analytical mindset, communication skills, and sense of ownership / drive (SRE, DevOps, Cloud, Data)
  • Intermediate experience in a variety of environments including Cloud, distributed and mainframe, business workflows and services/APIs, databases
  • Experience with Agile (SCRUM) methodology

  • Experience with Docker, OpenShift
  • Knowledge of networking and security
  • Familiarity with performance engineering concepts

What's in it for you?

We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
  • A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable
  • Leaders who support your development through coaching and managing opportunities
  • Work in a dynamic, collaborative, progressive, and high-performing team
  • Flexible work/life balance options
  • Opportunities to do challenging work
  • Opportunities to take on progressively greater accountabilities

Learn more about RBC Tech Jobs

Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign upand get customized info on our latest jobs, career tips and Recruitment events that matter to you.

Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at .

City: Toronto
Address: 88 Queens Quay West
Work Hours/Week: 37.5
Work Environment: Office
Employment Type: Permanent
Career Level: Experienced Hire/Professional
Pay Type: Salary + Variable Bonus
Required Travel(%):0
Exempt/Non-Exempt: N/A
People Manager: Yes
Application Deadline: 04/11/2021
Platform: Technology and Operations

Req ID: 336342
Ad Code(s):
Information Technology