Lead Site Reliability Engineer- People Soft
RBC Capital
Job Description
What is the opportunity?
City National Bank (CNB), an RBC company, is seeking a Lead Site Reliability Engineer, who will be responsible for supporting CNB Corporate applications along with the implementation of Site Reliability Engineering solutions. As a Lead SRE, you will play a critical role in ensuring the reliability, scalability, and performance of key applications, balancing production support responsibilities with continuous improvement initiatives.
What will you do?
- Enhance and optimize system resilience, focusing on self-healing, auto-recovery, and proactive failure detection.
- Drive observability improvements by automating monitoring processes and integrating best-in-class tooling for real-time performance insights.
- Troubleshoot and resolve complex performance issues, ensuring seamless operations across data ecosystems and external vendor integrations.
- Collaborate with cross-functional teams to improve service availability, reliability, and scalability across critical business functions.
- Innovate and implement automation to reduce manual toil, enhance efficiency, and support continuous improvement in system reliability.
- Troubleshoot and fix Peoplesoft Financial Accounting business processes and batch jobs.
- Monitor, Manage and Optimize job scheduling workflows using CA Workload Automation (CAWA) ESP dSeries.
- Support and enhance file transmission processes across internal systems and external vendors using MessageWay and Moveit.
- Monitor, troubleshoot, and resolve complex incidents, ensuring system stability and high availability.
- Improve observability and alerting using Dynatrace, ELK.
- Automate operational tasks to enhance system reliability and reduce toil.
- Collaborate with cross-functional teams to improve system resilience and scalability across corporate functions.
- Provide support during off-hours as needed to ensure critical system uptime.
- Arrange tabletop exercises, practice Chaos Engineering principles to build resilience and better production support capabilities.
What do you need to succeed?
Must-have:
- 10+ years hands on IT experience in Peoplesoft Financial Accounting software, specifically in-depth knowledge of General Ledger, Accounts Payable and Asset Management. Ability to troubleshoot critical production incidents.
- Hands-on experience with monitoring and observability tools (Dynatrace, ELK).
- Strong troubleshooting skills with proven ability for high-severity incidents resolution. Hands-on experience with ServiceNow and PagerDuty.
- Knowledge of incident, problem, and change management processes, including patching, conducting root cause analysis, and facilitating blameless postmortems.
- Proven experience in leading and managing high-impact IT incidents in a fast-paced environment.
- Google SRE certification is preferred, with experience in implementing SLIs, SLOs, and SLAs for monitoring and reliability.
- Scripting and automation experience for reliability improvements.
- Exceptional communication abilities to coordinate with business and IT teams. Lead and manage remote teams with a focus to exceed client expectations.
Nice-to-have:
- 5+ years’ experience with job scheduling and workload automation tool, CA Workload Automation (CAWA) ESP dSeries is required. External certification is preferred.
- 5+ years’ experience in configuration and monitoring for Managed File Transfer solution, MessageWay / Moveit. Expertise in file transfer protocols (SFTP, FTP, FTPS, email-based transmission).
- Prior experience leading SRE functions in the financial services industry.
What's in it for you?
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
- A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
- Leaders who support your development through coaching and managing opportunities.
- Ability to make a difference and lasting impact
- Work in a dynamic, collaborative, progressive, and high-performing team
- A world-class training program in financial services
- Flexible work/life balance options.
- Opportunities to do challenging work.
- Opportunities to take on progressively greater accountabilities.
- Opportunities to building close relationships with clients.
#LI-POST
#TECHPJ
Job Skills
Agile Methodology, Group Problem Solving, IT Systems Integration, Organizational Leadership, Product Services, Software Development Life Cycle (SDLC), System Applications, System Integration Testing (SIT), Systems SoftwareAdditional Job Details
Address:
City:
Country:
Work hours/week:
Employment Type:
Platform:
Job Type:
Pay Type:
Posted Date:
Application Deadline:
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above
Inclusion and Equal Opportunity Employment
At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all.
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.