Senior Site Reliability Engineer

Main Location
Redmond, WA, United States

Senior Site Reliability Engineer, Digital Security & Resilience

The mission of Microsoft Digital is to power, protect, and transform Microsoft as the voice of our digital transition in the market.

​​​​​​As part of Microsoft’s Cloud + AI Group, we are responsible for building, managing, and securing the platform, products, processes, and services that powers Microsoft. We build, maintain, and implement a cloud-first approach to our technology and experiences, from custom-built business solutions developing our campus of the future and our productivity and collaboration experiences like Teams and SharePoint, to horizontal 3rd party solutions like SAP and Adobe. As a steward of Microsoft and our customer’s data, a core function of Microsoft Digital is ensuring the security of every aspect of the business. Microsoft Digital is responsible for company-wide information security and compliance, with a strategic focus on information protection, assessment, awareness, governance, and enterprise business continuity. Microsoft Digital’s charter is also to influence and work alongside engineers across the company and with strategic partners to build and grow their cloud products and services. As customer zero, we deploy these services inside Microsoft and then share best practices with enterprise customers at scale across the globe.  


We are looking for a Senior Site Reliability Engineer to be part of the core software engineering team for Liquid. Liquid is a data-centric cloud service that manages multiple terabytes of data and infuses artificial intelligence to support compliance and risk decision making across the Company, particularly focused on providing the automation and analytics behind many of Microsoft's commitments to security, privacy, and other areas of customer trust. In this role, you’ll work with a range of fun, modern cloud and data-centric technologies -- Azure app services (e.g., web sites), data systems (e.g., Kusto, Azure Data Lake and Cosmos DB) and Machine Learning technologies. You’ll have plenty of mobility across these areas so you can shine where you’re already strong but learn in new areas as well.


As a team we are proud of our approach to work/life balance: we believe that achieving compelling business results does not mean sacrificing our goals as people. If you are excited by the opportunity to work in a critical business area and build long-term value for Microsoft, then we want to hear from you!


Key responsibilities:

  • Drive reliability initiatives and contribute to development efforts to release new features ensuring E2E service reliability and a great customer experience.
  • Full engagement in agile-based software development including DRI rotation.
  • In-depth data analysis to identify service trends and make necessary adjustments and improvements.
  • Introduce and maintain continuity and recoverability capabilities.
  • Apply availability, performance, and scalability expertise to ensure Liquid services continue to meet partner expectations.
  • Constantly improve customer experience through quantitative service monitoring, alerting, and the use of data and operational dashboards
  • Use your love of software to make the Liquid service even more reliable and amazing and increase customers trust on Microsoft!

Knowledge, experience and skills:


Minimum Required Qualifications:

  • BS degree in Computer Science or related technical field involving coding, or equivalent practical experience.
  • 6+ years of experience as a software engineer or Site Reliability Engineer with proven track record to improve service health, security, customer experience and remove DevOps toil.
  • 4+ years of relevant software design and development in C#, .Net.
  • At least 1 year experience implementing and optimizing Build and Release pipelines incorporating infrastructure and configuration as code.
  • At least 3 years experience with Azure stack and data related technologies.
  • At least 1 year experience driving security assessments using STRIDE or similar approach. 

Preferred Qualifications:

  • Experience with Power Shell.
  • Demonstrated experience generating deep data insights including performance related data to drive decisions and improvements.
  • Familiarity with common software design principles.
  • Excellent communication and presentation skills.
  • Demonstrated ability to collaborate well, both technically and interpersonally.
  • Ability to balance competing demands and adapt to changing priorities.


Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.


Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

We're a community of women leveraging our connections into top companies to help underrepresented women get the roles they've always deserved. Simultaneously, we work to build truly inclusive hiring processes and environments where women can thrive and not just survive.
Are you hiring? Join our platform for diversifiying your team
Senior Site Reliability Engineer
Microsoft Corporation