Job details
Job Description
Qualifications
Additional Information
We seek an experienced IT professional to join us as a Staff Site Reliability Engineer, working in the Product Reliability Engineering function who will:
- Perform day-to-day site reliability engineering functions including Maintenance and incident resolution for all Debit applications, products, and services – including debit, prepaid and risk lines of business.
- Perform ongoing/Proactive analysis of various debit authorization, Api and UI based applications to detect potential problems and actively engage & facilitate the discussion to find the best possible solution.
- Work under the Guidance of technical subject matter experts and be point of contact for key DPS projects.
- Work closely with service partners such as product development, engineering teams to seamlessly implement the innovative solutions to improve the reliability, scalability, and efficiency.
- Contribute towards automating the routine tasks and processes to improve overall efficiency and reduce human errors.
- Actively participate in troubleshooting activities and SWAT calls and drive investigation towards swift resolution.
- Participate in the Major Problem Review discussions, drive the root cause analysis, identify the gaps, and come up with innovative preventive measures.
- Mentor junior team members and foster a culture of continuous improvement in the team through retrospectives and open feedback.
- Build comprehensive and robust documentation repositories that can facilitate knowledge transfer among DPS PRE and DPS Global Operations peers.
- Implement innovative GenAI and machine learning trends to continuously optimize the application reliability and efficiency.
- Work with observability team to design and implement the modern visa observability solutions such as Anomaly detection, operations intelligent platform (OIP), Fault Isolation tool (FIT) across all DPS products.
- Provide on-call support in 12*7 model.
- Self-motivated, and have excellent interpersonal and communication skills.
This is a hybrid position. Expectation of days in the office will be confirmed by your Hiring Manager.
Qualifications
Preferred Qualifications:
- 8+ years of relevant work experience with a Bachelor’s Degree or at least 4 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD, OR 8+ years of relevant work experience.
- 5 years of experience and Proficiency in one or more programming languages such as Python, Java, .NET, C#, PowerShell, Bash scripting.
- 3 or more years of experience leading the projects, key technical initiatives.
- This role requires a high level of technical expertise, leadership skills, and a strong understanding of site reliability engineering principles and practices.
- 5 years of experience and advanced proficiency in writing complex queries and working with SQL and mongo databases.
- Prior experience working on CI/CD pipelines and tools like Jenkins, chef etc.
- Prior experience partnering with product development team and evaluating application design for optimal reliability and resiliency.
- Prior experience and Strong understanding of networking concepts, protocols, and architecture. Advanced working knowledge of ITIL concepts & processes such as incident/change/problem management, call triaging, escalation procedures and such.
- Prior experience with Middleware components such as Kafka, Hazelcast, Qlik etc.
- Advanced proficiency and experience with container orchestration systems, particularly Kubernetes.
- Experience with advanced monitoring, logging, and tracing tools such as Splunk, Prometheus, Grafana, riverbed etc., for troubleshooting and performance tuning.
- Basic understanding of AI frameworks and libraries to further enhance the application resiliency and day to day operational tasks.
- Prior experience with building tools to automate production support activities that enable efficiency and productivity of all operations groups. Prior experience working in shift model in 24*7 environments.
- Candidate should be comfortable communicating with technical and non-technical peer groups, including Account Management, Client Services, and other technical platform and application support groups. Strong work ethic, self-starter, ability to work in fast-paced, team-oriented environment.
Additional Information
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
Get Weekly Job Offers
Be first to know when jobs open.