Help us maintain the quality of jobs posted on PowerToFly. Let us know if this job is closed.
Job Type
Full Time
Job Details
Location: Americas, Pacific Time Posting Date: May 8, 2024 Hi there! We're seeking a talented Site Reliability Engineer to join the Developer Enablement team at Zapier. It’s our mission to make it easy for all engineering teams at Zapier to confidently operate healthy and reliable services. We will achieve this mission by advancing Zapier’s approach to observability and incident management. We know we have a lot of competition for your skills. If you’re wondering what things would be like at Zapier, read on about:
- Our Commitment to Applicants
- Culture and Values at Zapier
- Zapier Guide to Remote Work
- Zapier Code of Conduct
- Diversity and Inclusivity at Zapier
- You’re an experienced technologist. You’ve spent 4+ years working on multiple projects in SaaS companies in the world of systems engineering or software development.
- You know what great observability looks like. You’ve seen the value of comprehensive visibility into a system's internal states through rich, actionable, and timely insights, enabling quick identification and resolution of issues. You’re accustomed to detecting and resolving problems before customers notice.
- You know the cloud. You’ve participated in the design or maintenance of highly available, cloud-based infrastructure in AWS or another cloud offering. You understand how to leverage infrastructure as code tools and have learned best practices for reliability and observability. We use tools like Terraform, Kubernetes, Redis, GitLab, and Datadog, among others.
- You can code. You have experience with languages like Python or Go to create automated tools. You believe in hands-off deployments and infrastructure as code. Well-honed expertise with the fundamentals of software development goes a long way here.
- You can solve complex systems challenges. You enjoy complex challenges, understand how to improve performance, and help uncover opportunities for improvement. You’ve worked on problems where “just throw more hardware at it” isn’t enough for the system to scale.
- You’re a great communicator. Not only do you know how to share your knowledge with the team and document things well so they can be consumed asynchronously (we do this a lot as a remote company), but you know how to communicate effectively with software and support teams.
- You value our values. At Zapier, our values are at the heart of how we collaborate and how we think about our customers. In our remote setting, they help develop trust and ensure we work and collaborate together to democratize automation. You see how these values can empower meaningful work, you thrive in a collaborative setting, you are eager to continue growing and excited to be part of the team.
- Evaluate and recommend new tools and technologies that enhance our observability and reliability capabilities, ensuring that we are equipped to effectively serve our customers.
- Collaborate with service teams to resolve complex infrastructure issues and design challenges, ensuring decisions support scalable and reliable service delivery.
- Implement site reliability principles to diagnose and address systemic sources of unreliability, enhancing system stability and reducing recurrence of issues.
- Develop internal tools and systems that enhance the observability and reliability of applications, helping engineering teams to deliver high-quality software more efficiently.
- Build and continuously improve features and services that support robust system operations, including incident management processes that automate solutions to ensure system resilience and recovery.
- Engage in proactive learning from system failures to build more robust and resilient systems, preventing future issues and improving our overall infrastructure health.
About the Company
Zapier, Inc.
United States
Zapier is on a mission to make automation work for everyone so that every person and every business can move forward faster. Zapier is the leader... Read more