Lead DevOps Platform Engineer
What is the Opportunity?
We are on the lookout for a talented DevOps Platform Engineer to join our forward-thinking team responsible for the development and enhancement of our CI/CD deployment portal. This sophisticated platform is designed to facilitate the swift and secure deployment of applications to various cloud environments, supporting all RBC application developers. As a crucial member of our team, you will craft comprehensive solutions that streamline application delivery, improve operational efficiency, and leverage cutting-edge AI technologies to optimize and automate platform operations.
What will you do?
- Work collaboratively with cross-functional teams to conceptualize, design, and deliver innovative features for our deployment platform, incorporating AI-driven solutions to enhance automation, predictive analytics, and operational efficiency.
- Guarantee the performance, quality, and responsiveness of the platform, with a strong emphasis on Site Reliability Engineering (SRE) principles, including robust monitoring, alerting, and incident response practices.
- Uphold and advance the operational capabilities of the platform, ensuring an intuitive user experience while enabling seamless integration with AI tools to proactively identify and resolve potential issues.
- Design deployment and management patterns for the various tools on our DevOps platform, leveraging AI to optimize resource allocation and deployment strategies.
- Integrate with cloud services and infrastructure to guarantee secure and efficient application deployment, while exploring AI-driven approaches to enhance security and scalability.
- Formulate automated testing procedures to confirm platform stability and dependability, utilizing AI to improve test coverage and identify edge cases.
- Engage in code review processes and contribute to the collective knowledge by documenting technical procedures, including the integration of AI technologies and methodologies.
- Stay abreast of emerging development practices, AI advancements, and technologies, actively contributing to the ongoing enhancement of our technology stack and platform capabilities.
What do you need to succeed?
Must-Have Skills:
- 5+ years of working experience.
- Bachelor’s degree in Computer Science, Engineering, or in a field relevant to the role.
- Proficient with cloud-based services (e.g., AWS, Azure) and a strong grasp of developing cloud-native applications.
- Thorough understanding of Site Reliability Engineering (SRE) and best practices for running and maintaining critical systems, including monitoring, alerting, and incident management.
- Experience with architecting, implementing, and deploying systems into integrated environments.
- Knowledge of Continuous Integration/Continuous Delivery (CI/CD) methodologies and associated tools.
- Familiarity with container technologies like Docker and orchestration platforms like Kubernetes.
- Experience using AI tools and efficient prompting of LLMs.
- Solid understanding of version control systems, particularly Git.
- Strong analytical skills, problem-solving abilities, and excellent communication skills.
Nice-to-Have Skills:
- Experience with full stack development, including experience with frameworks and languages such as JavaScript, React, Node.js, Python, or similar.
- A focus on leveraging AI to improve operational efficiency and system reliability.
- Understanding of how AI can, improve operational efficiency, system reliability enhance, and CI/CD processes and experience working with models and MCP servers.
- A strong desire for continuous learning and a commitment to ongoing personal and professional development, particularly in the areas of AI and SRE.
What’s in it for you?
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
- A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
- Leaders who support your development through coaching and managing opportunities.
- Ability to make a difference and lasting impact.
- Work in a dynamic, collaborative, progressive, and high-performing team.
- A world-class training program in financial services.
- Flexible work/life balance options.
- Opportunities to do challenging work, including leveraging AI and SRE principles to drive innovation and operational excellence.
#LI-POST
#TECHPJ
Job Skills
Agile Scrum Development, Agile Scrum Development, Application Programming Interface (API), Building Architecture, CI/CD, Cloud Based Services, Continuous Delivery, Continuous Integrations, DevOps, Docker (Software), Dynatrace APM, Git, GitHub Actions, Information Technology (IT) Infrastructure, Kubernetes, Microsoft Azure, Programming Languages, Public Clouds, Site Reliability Engineering, Software Change Request Management, Software Development Life Cycle (SDLC), Software Engineering, Software Integration Engineering, Software Product Design, Software Product Technical Knowledge {+ 4 more}