
Job Information
MathWorks Senior Program Manager - Site Reliability Engineering (SRE) in Natick, Massachusetts
Senior Program Manager - Site Reliability Engineering (SRE)
Job Summary
Apply Now
Job:20013-BWAL
Location: US-MA-Natick
Department:Program Management
As a Senior Program Manager, you will initiate and execute programs that will deliver high standards of reliability for MathWorks Online Products. These programs and initiatives will help us prevent incidents, achieve our SLOs/SLAs, and meet our operational quality goals that are strategic to the success of our online products. You will partner with Product Owners, Developers, Platform Engineering/DevOps, and Site Reliability Engineers to define and implement tools, processes, standards, and best practices to plan, build and run highly reliable Online Products.
Are you someone who is strategic, values collaboration, is passionate about process improvement and can motivate others towards achieving a shared vision? If so, then you may be the person we are looking for!
Responsibilities
Establish a shared vision and goals for achieving world class reliability for our Online Products by partnering with the right stakeholders. Create and manage program roadmaps, SMART plans, and milestones
Define and implement communication plans that address the needs of all stakeholders. Provide periodic status updates to the steering team and other stakeholders on the health of the program(s)
Define and implement tools and processes for problem management. Collaborate effectively with various stakeholders to investigate problems, identify root causes, and implement countermeasures to prevent incidents. Continuously identify opportunities for process improvement and lead the effort to design and implement them
Proactively identify risks and issues; define and implement mitigation strategies
Define process and results KPIs to measure the health of the program and associated projects
Minimum Qualifications
- A bachelor's degree and 7 years of professional work experience (or a master's degree and 5 years of professional work experience, or a PhD degree, or equivalent experience) is required.
Additional Qualifications
Experience with managing cross-organizational programs focused on building and running highly available and reliable online/SaaS products
Experience in defining and managing incident management and problem management tools and processes
Knowledge and application of Site Reliability Engineering, Platform Engineering, and DevOps framework and concepts like Observability, Reliability, Availability, and Performance
Ability to influence others even when you do not have direct authority over them
Expertise in process improvement and change management. Experience applying concepts like Root Cause Analysis, Reflection, A3, and Hansei for problem solving.
Ability to communicate effectively, both oral and written with senior management
Experience using work management and collaboration tools like JIRA, Confluence, SharePoint, and Microsoft Teams