Onsite
Full Time Posted 2 hours ago
Save Job

Job Details

Job Description

Site Reliability Engineering (SRE) is essential to Visa’s Cloud platform strategy. In this role, you’ll ensure our development platform and tools let engineers focus on innovation instead of infrastructure. You’ll promote observability best practices and automate resolution of recurring issues, working closely with software engineering teams to support security, availability, and performance. Responsibilities include triaging issues, collaborating on infrastructure management, and setting up monitoring for full coverage. Hands-on expertise is required, especially with major DevTools like GitHub, Jenkins, Jira, and Artifactory.

We seek a Software Engineer + SRE hybrid engineer. The ideal candidate deeply understands at least one major DevTool, quickly resolves tool-related issues in collaboration with developers, and applies systems thinking to maintain reliable applications and infrastructure while improving developer productivity.

Key Responsibilities:

  • DevTools Support You will be the primary point of contact for developers using tools like GitHub, Jenkins, Jira, or Artifactory.
  • Troubleshoot and resolve tool-related issues promptly to minimize developer downtime.
  • Maintain and optimize CI-CD pipelines and integrations for reliability and scalability.
  • Collaborate with development teams to improve workflows and automation.
  • Site Reliability Engineering Design, implement, and maintain systems for high availability, scalability, and performance.
  • Monitor and improve application reliability through proactive measures and incident response.
  • Develop and maintain observability solutions (metrics, logging, tracing).
  • Participate in on-call rotations and drive root cause analysis for incidents.
  • Collaboration & Continuous Improvement Partner with engineering teams to identify reliability risks and implement best practices.
  • Document processes, troubleshooting guides, and reliability of playbooks.
  •  Advocate for automation and self-service solutions to reduce operational overhead.

This is a hybrid position. Expectation of days in the office will be confirmed by your Hiring Manager.  ​


Qualifications

Basic Qualifications:
Bachelor's degree, OR 3+ years of relevant work experience

Preferred Qualifications:
Bachelor's degree, OR 3+ years of relevant work experience
Bachelor's degree in IT, CS or related field and-or 3+ Years Working Experience IT Operations and Delivery.
Experience: 3 years in SRE and-or DevTools support roles.
Beginner level programming and-or scripting in 2 or more of the following: Python, Java, Go, PowerShell, JavaScript, Terraform, Ansible, Helm, Chef, Cloud Formation.
Basic understanding of YAML, JSON, HTML, XML.
Hands on experience in Linux and -or Windows systems and good understanding of distributed computing environments.
2 years experience with CI-CD tooling such as Jenkins, Github, Bitbucket, ArgoCD, Artifactory, Azure DevOps in a large-scale environment
2 years experience with observability tooling such as Grafana, Prometheus, Splunk, Datadog, New Relic, DynaTrace, Sentry, etc. in a large-scale environment
2 years experience supporting relational and non-relational databases (MySQL, MongoDB, PostgreSQL, etc.), including creating and running queries, managing performance and scaling
2 or more years working in a Platform, SRE or Production Engineering group for high availability-critical platforms-applications
Experience managing a distributed container platform including but not limited to deployment-release management, provisioning, capacity management, workload management
Experience managing container infrastructure and supporting development transformation to a container first model.
This role requires oncall support as the team provides 24-7 operational support.
Technical Expertise: Proficiency in at least one DevTool (GitHub, Jenkins, ArgoCD, Jira, Artifactory, ).
Strong understanding of CI-CD principles and pipelines.
Solid knowledge of Linux systems, networking, and containerization (Docker-Kubernetes).
Hands-on experience with cloud platforms.
Programming-Scripting: Proficiency in Python, Ansible, or similar languages.
Mindset: Strong problem-solving skills, systems thinking, self-starter, and a passion for reliability.


Additional Information

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.


Company Details
VISA
 Foster City, CA, United States
Work at VISA

At Visa, we are driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid. As our products and... Read more

Mission
We're connecting diverse talent to big career moves. Meeting people who boost your career is hard - yet networking is key to growth and economic empowerment. We’re here to support you - within your current workplace or somewhere new. Upskill, join daily virtual events, apply to roles (it’s free!).
Are you hiring? Join our platform for diversifiying your team
Software Engineer
Save Job