Senior Site Reliability Engineer (SRE)

Remote
Full Time
Main Location
United States
Open jobs
PagerDuty is interviewing, onboarding, and working 100% virtually at this time. As we look to the future, we plan to be ‘distributed by design’, meaning unless your job requirements make it necessary to be in a PagerDuty office, you may choose to work in-office, remotely, or hybrid. We’re focused on inclusion and employee well-being by building a culture that isn’t location specific, and gives equal opportunity to everyone—regardless of where you are working.

Overview
PagerDuty is a digital operations management platform that empowers the right action, when seconds matter.
For the teams who build and run digital systems, PagerDuty is the best way to manage the urgent, mission-critical work that is essential to keeping digital services always on. We make it easy to handle any unplanned task, event, or opportunity, right away.

We’re growing fast and looking for ambitious people who share our values and customer devotion mindset to join our high caliber team.

Do you relish the opportunity to design, build and run mission critical applications? Do you want to get the attention of hundreds of thousands of engineers and technology leaders around the globe 24/7 so they can fix problems? Yes? Then read on to find out more about what makes PagerDuty a great place to be an Engineer! 

As a Senior Site Reliability Engineer on our Infrastructure team, you’ll be part of a group that’s intensely focused on our customers and the engineering community. Whether it’s provisioning, continuous integration/deployment, monitoring, or cloud platform management, SREs provide the foundation upon which the PagerDuty product is built and architecting the future.
How You Contribute To Our Vision: Key Responsibilities
  • You partner with Engineering stakeholders to design and deliver a reliable, scalable, secure, and performant platform
  • You continuously strive to improve the customer experience: Full lifecycle support (creation, development, deployment, retirement), observability, flexible connectivity, and monitoring
  • You stay current on technical trends in order to suggest innovative tools and approaches to interesting problems
  • You share your expertise with the entire Engineering organization
  • You participate in a 24/7 on-call rotation. And yes, we use PagerDuty to manage our on-call schedules
About You: Skills and Attributes
  • You have solved multiple problems by writing code to automate your way out of them and have a passion for replacing manual processes time and time again with your code
  • You have been responsible for running critical services that multiple customers depend upon. You understand the importance and impact that operational optimization can have on a product and the positive ripple effects that it can have across an entire organization
  • You believe CI servers, push-button deploys, time-series datastores, metrics dashboards, and centralized logging are not just “nice to haves,” they are critical pieces of infrastructure that rapidly pay for themselves. You are familiar with the tool-space and can suggest products in each of these areas
  • You are empathetic: You take others’ opinions into account and clearly communicate your thoughts to reach technical solutions quickly
  • You consider it important to understand and appreciate your customers, and enjoy seeing your work improve the work of others
Minimum Requirements
  • Excellent knowledge of a dynamic language like Ruby or Python
  • Experience working on cloud-native infrastructure (e.g. AWS, GCP, Azure) including managed services such as AWS EC2, S3 and other storage options, RDS, IAM, etc.
  • Experience with Docker in a production environment including container orchestration (e.g. Nomad, Mesos, Kubernetes, etc.)
  • Experience in automating releases, continuous integration/delivery systems and relevant tools (e.g. Jenkins, CircleCI, Travis CI, Buildkite, etc.)
Preferred Requirements
  • Experience with infrastructure as code (Terraform or CloudFormation)
  • Experience with monitoring, observability and logging platforms (e.g. DataDog, New Relic, SumoLogic, Splunk, etc.)
  • Knowledge of configuration management systems like Ansible, Chef or Puppet
PagerDuty Offers:
- Competitive salaries and company equity
- ESPP (Employee Stock Purchase Program)
- Retirement plan with company match
- Comprehensive benefits package from day one 
- Company paid parental leave - up to 22 weeks for pregnant parent, up to 12 weeks for non-pregnant parent
- Paid vacation time - 3 weeks accruing in first year, 4 weeks accruing every year after
- Paid holidays and sick leave
- Paid employee volunteer time - 20 hours per year
- Bi-annual company-wide hack weeks
- Mental wellness programs

About PagerDuty
PagerDuty, Inc. (NYSE:PD) is a leader in digital operations management, serving over 13,800 customers and 700,000 users in 90 countries, including 60% of the Fortune 100. Led by CEO Jennifer Tejada, 50% of our board of directors is comprised of women, 45% of our managers are from underrepresented groups, and we are a proud member of the Pledge 1% Movement, committed to donating 1% Equity, 1% Employee time, and 1% Product to accelerate change in our communities. 

At PagerDuty, we believe you do your best work in a culture that fosters inclusion, well-being, and innovation. As a Dutonian, you will have ample opportunities to advance your career and connect with colleagues: virtual all hands calls, learning & development programs, bi-annual hack weeks, volunteering events, ERGs (employee-run groups focused on cultivating a sense of belonging for all) - there’s something for everyone. Learn more on Instagram, @pagerdutylife.

From how we build our teams to who sits in the boardroom, we hope you can see yourself at PagerDuty.

Additional Information
PagerDuty is for people. Meaning, we extend opportunities to a broad array of candidates, including those with diverse workplace experiences and backgrounds. Whether you're new to the corporate world, returning to work after a gap in employment, or simply looking to transition or take the next step in your career path, we are excited to connect with you.
PagerDuty is committed to creating a diverse environment and is an equal opportunity employer. PagerDuty does not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, parental status, veteran status, or disability status.
PagerDuty is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application process. Should you require accommodation, please email accommodation@pagerduty.com and we will work with you to meet your accessibility needs.
Our stewardship of the data of many thousands of customers means that a background check is required to join PagerDuty. We will, nonetheless, consider for employment qualified applicants with arrest and conviction records in a manner consistent with local requirements.
PagerDuty uses the E-Verify employment verification program.
To all recruitment agencies: PagerDuty does not accept agency resumes. Please do not forward resumes to our jobs alias, PagerDuty employees or any other company location. PagerDuty is not responsible for any fees related to unsolicited resumes.
Help us maintain the quality of jobs posted on PowerToFly. Let us know if this job is closed.
Mission
We're a community of women leveraging our connections into top companies to help underrepresented women get the roles they've always deserved. Simultaneously, we work to build truly inclusive hiring processes and environments where women can thrive and not just survive.
Are you hiring? Join our platform for diversifiying your team
Senior Site Reliability Engineer (SRE)
PagerDuty