DevOps Reliability Engineering

Full Time Posted 28 days ago
Main Location
New York City, NY, United States
Open jobs

DevOps Reliability Engineering is a production-oriented discipline focused on improving service availability, latency, scalability, performance, and efficiency for technology products in Morgan Stanley.  Our core infrastructure processes hundreds of millions of transactions and we serve assets of more than a trillion dollars daily. If this scale resonates with you, come join us. 

We are transforming ourselves to grow SRE teams as part of our expansion.

We would like to talk to you if you:  

  • Are interested in distributed systems and working with high scale services. 
  • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. 
  • Enjoy new technological challenges and solving hard problems. 
  • Believe that a team working well together is truly smarter than the single smartest person on that team. 
  • Aspire to grow as a person, as a teammate, and as an engineer. 
  • Have Grit, drive and a deep feeling of ownership.

Your Responsibilities Include But Are Not Limited To:

  • You will use your expertise to tune and push our systems beyond their normal limit.
  • You will work closely with engineering/development teams to design, build, and maintain systems and help them decide on products to use, schema design and query tuning.
  • You will troubleshoot issues across the entire stack: hardware, software, application and network.
  • You will mentor other SREs on standard methodology for everything from monitoring to troubleshooting complex code and database issues.
  • You will identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services.
  • You will need to spend 50% of your time on and around production support.
  • Represent the SRE organization in design reviews and operational readiness exercises for new and existing services.
  • Participate in on-call rotation and periodic conference calls with other specialists from other time zones. #LI-JSR


Successful candidates have often had some or all of the following:

  • Background in Computer Science equivalent to a B.Sc. Equivalent practical experience is a reasonable substitute.
  • Awareness of, and ability to reason about modern software & systems architectures, including load-balancing, queueing, caching, distributed systems failure modes generally, micro services, and so on.
  • Any experience on automation/configuration management systems like Puppet, Chef, Ansible is an advantage.
  • Experience in software development: automation-related experience valued in particular. Scripting languages such as bash, python, ruby, or compiled languages such as C, C#, JAVA, Scala and Go are most relevant but others are acceptable. One higher level language is desired.
  • Experience with source code and binary repositories, build tools, and CI/CD (Git, Artifactory, Jenkins, Docker) etc and data streaming technologies like Spark, Kafka etc.
  • Hands on experience on enterprise tools set such as Grafana, Dynatrace, AppDynamics, and BMC etc.
  • Awareness of, and ability to reason about modern software & systems architectures, including load-balancing, queueing, caching, distributed systems failure modes generally, micro services, and so on. 
  • Deep understanding of operating system level concepts such as processes, memory allocation, and the network stack; understanding of how applications are affected by the above, and ability to debug same. 
  • Generally speaking, practical experience running large scale online systems is always an advantage. #LI-JSR
We're a community of women leveraging our connections into top companies to help underrepresented women get the roles they've always deserved. Simultaneously, we work to build truly inclusive hiring processes and environments where women can thrive and not just survive.
Are you hiring? Join our platform for diversifiying your team
DevOps Reliability Engineering
Morgan Stanley Technology