Watch this video to learn more about VISA
Job Details
As a member of the Infrastructure Reliability Engineering team – You are responsible to manage a team of system engineers who focused on Kubernetes, microservices architecture, and cloud technologies. As part of this self-driven team, you will support critical container Infrastructure and ensure the stability of services by performing dedicated maintenance activities. You engage in automation activities, perform root cause analysis (RCA), and remediation. Knowledge of production support process including incident/change/problem management, call triaging, and critical issue resolution procedures.
Essential Functions:
- Infrastructure life cycle management and Production Support of container, cloud technologies and orchestration platforms
- Strong technical analytical and troubleshooting skills and possess the ability to explain technical concepts and provide guidance to staff.
- Develop and implement a comprehensive observability strategy to enhance the organization's ability to monitor, detect, and respond to potential issues proactively.
- Leverage data analytics to identify trends, patterns, and anomalies in system behavior, providing actionable insights for continuous improvement.
- Drive initiatives to enhance IT infrastructure resilience, scalability, and security.
- Provide strong leadership with a focus on attracting, motivating, and developing best-in-class talent. Mentor and coach teams to develop future leaders in alignment with company objectives.
- Balance both leading a team and engaging directly with the work needed to accomplish objectives. Assist direct reports with ongoing prioritization and resource allocation to ensure that the crucial business initiatives are delivered.
- Utilize leadership skills, problem solving and decision-making skills to facilitate and encourage participation of team members to meet objectives in congruence with approved standards and guidelines.
- Be a leader that continually raises the bar for others.
- Ability to operate in complex, highly secure, and highly available, operations environments and interact with the technology domain experts required to maintain those environments.
- Excellent communication & interpersonal skills. Coaching other members of the support team, sharing technical and customer knowledge in a helpful and timely fashion
- Responsible for partnering with the Platform, Engineering and Delivery Teams to deliver seamless infrastructure support for all Visa business lines.
- Work closely with geographically distributed teams on technical challenges and process improvements.
- Security Remediation process (vulnerability assessment and patch management)
- Responsible for adherence of established ITIL practice such as Incident, Change, Problem and Release Management
- Be scheduled On-Call to support the infrastructure and our systems.
This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.
Qualifications
Basic Qualifications:
8+ years of relevant work experience with a Bachelor’s Degree or at least 5 years of experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD, OR 11+ years of relevant work experience.
Preferred Qualifications:
9 or more years of relevant work experience with a Bachelor Degree or 7 or more relevant years of experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or 3 or more years of experience with a PhD
Minimum Bachelor's degree in Engineering, or Computer Science, Information Systems, or related field
Minimum Eight (8) years of directly related experience.
In-depth knowledge of IT infrastructure, observability tools, data analytics, and incident management processes.
Minimum 5 years of experience in System Administration
At least 5 years in the Container and Cloud (AWS & GCP) with a focus on DevOps and service-based systems engineering.
Minimum 4 years hands-on experience with Kubernetes (on internals architecture of K8s).
Minimum 2 years of experience with AWS /GCP
Minimum 3 years of Scripting experience (Shell, Python, Ansible, Terraform and YAML packages)
Minimum 2 years of experience with Microservices based applications traffic routing (i.e., Istio ServiceMesh)
Experience with configuration management tools (Chef, Ansible, terraform etc.) is must.
Working with tools surrounding the Kubernetes ecosystem such as helm, kubeadm, CSI, CNI etc. is must.
Experience with CI/CD or GitOps pipeline Architecture (i.e., ArgoCD, Code Fresh, Jenkins) is must.
Working knowledge of monitoring and logging tools: Prometheus, Graphana, Fluentbit ,Netcool, Humio
Exposure to AI platforms (OpenAI, Claude) or interest in AI-driven automation for Container operations.
Deep understanding of networking concepts
Additional Information
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.