MasterCard Biz Ops Site Reliability Engineer/Consultant in O'Fallon, Missouri
Who is Mastercard?
We are the global technology company behind the worlds fastest payments processing network. We are a vehicle for commerce, a connection to financial systems for the previously excluded, a technology innovation lab, and the home of Priceless ®. We ensure every employee has the opportunity to be a part of something bigger and to change lives. We believe as our company grows, so should you. We believe in connecting everyone to endless, priceless possibilities.
Biz Ops Site Reliability Engineer/Consultant
The Shared Components and Security Solutions BizOps team is looking for a Consultant - Business Operations Site Reliability Engineer for a new financial industry project. This position will be part of a key team who is designing this project from the start and the responsibilities will include the following:
-Work directly with Development and Strategic Program Management(SPM) teams to ensure that all of the Biz Ops requirements are built into project from the start
-Become the Subject Matter Expert(SME) in this space and prepare to train others as the team grows
-Direct the the project through it's life cycle and checkpoints to ensure that all Site Reliability requirements are met and the effort is ready to migrate into the \"run\" phase
-Are you a born problem solver who loves to figure out how something works?
-Do you have a low tolerance for manual work and look to automate everything you can?
-Are you a passionate about preventing future customer impacting incident by ensuring things are build right, up front?
Business Operations is leading several transformation work streams at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation and planning/preparation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications. Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. A biz ops Site Reliability Engineer focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders.
Ultimately, the role of biz ops is to align Product and Customer Focused priorities with Operational needs. We regularly review our run state not only from an internal perspective, but also understanding and providing the feedback loop to our development partners on how we can improve the customer experience of our applications.
For all team members:
Engage in and improve the whole lifecycle of servicesfrom inception and design, through deployment, operation and refinement.
Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
Practice sustainable incident response and blameless postmortems.
Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover
Work with a global team spread across tech hubs in multiple geographies and time zones
Provide 24/7 on-call support of critical business infrastructure and applications. This involves performing support for scheduled maintenance activities and Level 1 & 2 support for issues.
At times, resources within this position are required to initiate vendor support to investigate complex scenarios that often involve active working sessions with the vendor to troubleshoot and identify solutions. These situations are most often only reserved for deep dive troubleshooting events involving researching system health and functionality.
Conduct detailed analysis on issue investigation and determine the best path to resolution.
Provide consultation for application infrastructure design and assist with determining adherence to established standards and best practices.
Work hand in hand with global team in collaboration and knowledge sharing events such as instructional email communications and regular training meetings.
Assist in providing detailed post-incident summaries on impacting events for Senior Management and Team Members. These detailed summaries detail the issue, actions taken - both right and/or wrong, observations, and any recommendations around what changes can be made to provide awareness of future events, or permanent resolutions.
BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
Experience with algorithms, data structures, scripting, pipeline management, and software design.
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Ability to help debug and optimize code and automate routine tasks.
We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.
Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl or Ruby.
Interest in designing, analyzing and troubleshooting large-scale distributed systems.
We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
Full stack, deep network protocol understanding, and how various protocols (HTTP, Web Sockets, gRPC and TCP sockets, IP, UDP, DNS, TLS) play in the OSI model.
Understanding of principles of availability & consistency.
Experience with multiple persistence patterns and the ability to ensure high performance, scale and reliability configurations.
Mastercard is an inclusive Equal Employment Opportunity employer that considers applicants without regard to gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
If you require accommodations or assistance to complete the online application process, please contact firstname.lastname@example.org and identify the type of accommodation or assistance you are requesting. Do not include any medical or health information in this email. The Reasonable Accommodations team will respond to your email promptly.
Requisition ID: R-72145