Senior Linux Systems Administrator (2)
Onsite
Sydney, Australia
Sydney, Australia
Full Time Posted 28 days ago
Job Type
Full Time
Job Details
Job Description The Team As a key member of the Systems Administration team within Global Cloud Operations, you will be responsible for administration and operations of the global cloud infrastructure that runs our SaaS product. This is an opportunity to be at the core of running a Cloud SaaS platform that scales to millions of users! The Cloud Operations team is responsible for availability and efficiency of the server infrastructure that runs our SaaS platform, while consuming and deploying products that have been newly developed by engineering teams. You will be working closely with engineers and developers across the company. Responsibilities
- Contribute to Configuration Management and Infrastructure as Code for ServiceNow’s global private cloud.
- Develop tools in Python, bash, and JavaScript to replace manual work and improve customer maintenance experience.
- Drive enhancements and bugfixes for large scale automation projects such as patching, provisioning, and kickstart domains.
- Design and implement procedure to accomplish maintenances where automation and tooling cannot drive resolution of root causes with internal team members.
- Prepare new ServiceNow products and services for production readiness with design review, feedback to engineering teams, training, and testing.
- Use broad knowledge and experience of systems administration and networking principles to proactively prevent and address incidents while constantly improving documentation.
- Participate in escalations and Root Cause Analysis of issues in both Australia regulated markets and global Commercial infrastructures.
- Troubleshoot database backup and restore failures as well as perform database migrations.
- Support operation of a wide variety of infrastructure services including Machine Learning and Prediction, Kafka and RabbitMQ messaging, database encryption, E-Mail infrastructure at scale, DNS, Puppet, Elasticsearch, F5 BigIP, and more.
- A strong background in systems administration and engineering, understanding of the components of a cloud infrastructure including hardware platforms, OS, applications, databases, networks, web and application servers.
- Prior experience in Site Reliability Engineering/DevOps and managing large-scale server infrastructure at a cloud computing or MSP setting is highly desirable.
- Solid experience with Linux (RedHat and derivatives like CentOS)
- Working level knowledge of at least one: Python, Bash, Ruby, JavaScript
- ServiceNow development experience is desirable.
- Strong experience with service troubleshooting, covering web front-end, Systems, Databases and Networks.
- Previous direct exposure to administrating fundamental internet services (DNS, Mail, Apache/Tomcat) with a good understanding of the LAMP stack.
- Familiarity with administrating MySQL, MariaDB, Postgres or similar technologies; proficiency preferred.
- Strong experience with service troubleshooting in a production environment covering web front-end, Systems, Databases and Networks.
- Familiarity with Networking Technologies such as routing, switching and load balancing (VPN exposure is a huge plus)
- Experience with systems and network performance and availability monitoring and analysis as well as configuration management platforms (Nagios/Icinga, SNMP, Puppet, Ansible, Splunk) is desirable.
- Understanding of ITIL v3 framework and how it applies to incident, problem and change.
Learn more about ServiceNow
Help us maintain the quality of jobs posted on PowerToFly. Let us know if this job is closed.
Mission
We're connecting diverse talent to big career moves. Meeting people who boost your career is hard - yet networking is key to growth and economic empowerment. We’re here to support you - within your current workplace or somewhere new. Upskill, join daily virtual events, apply to roles (it’s free!).
Are you hiring? Join our platform for diversifiying your team