PowerToFly
Recent searches
  • Events
  • Companies
  • Resources
  • Log in
    Don’t have an account? Sign up
Filters
Clear All
Advanced filters
Job type
  • Reset Show results
Date posted
  • Reset Show results
Experience level
  • Reset Show results
Company
  • Reset Show results
Skills
  • Reset Show results
Clear All
Cancel Show Results
Active filters:
Results 8634 Jobs

Wondering why you’re not getting hired?

Take our 3-min quiz and find out!

  • See what’s holding you back
  • Know exactly what to fix
  • Get a plan to move forward
Take the Quiz!
Loading...
Loading more jobs...

No more jobs to load

No more jobs to load

Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI
Save Job
VISA

Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI

Onsite Bengaluru, India Full Time
Posted 2 hours ago
Save Job

Job Details

Job Description

We seek an experienced IT professional to join us as a Staff Site Reliability Engineer, working in the Product Reliability Engineering function who will:

Watch this video to learn more about VISA

  • Perform day-to-day site reliability engineering functions including Maintenance and incident resolution for all Debit applications, products, and services – including debit, prepaid and risk lines of business.
  • Perform ongoing/Proactive analysis of various debit authorization, Api and UI based applications to detect potential problems and actively engage & facilitate the discussion to find the best possible solution.
  • Work under the Guidance of technical subject matter experts and be point of contact for key DPS projects.
  • Work closely with service partners such as product development, engineering teams to seamlessly implement the innovative solutions to improve the reliability, scalability, and efficiency.
  • Contribute towards automating the routine tasks and processes to improve overall efficiency and reduce human errors.
  • Actively participate in troubleshooting activities and SWAT calls and drive investigation towards swift resolution.
  • Participate in the Major Problem Review discussions, drive the root cause analysis, identify the gaps, and come up with innovative preventive measures.
  • Mentor junior team members and foster a culture of continuous improvement in the team through retrospectives and open feedback.
  • Build comprehensive and robust documentation repositories that can facilitate knowledge transfer among DPS PRE and DPS Global Operations peers.
  • Implement innovative GenAI and machine learning trends to continuously optimize the application reliability and efficiency.  
  • Work with observability team to design and implement the modern visa observability solutions such as Anomaly detection, operations intelligent platform (OIP), Fault Isolation tool (FIT) across all DPS products.
  • Provide on-call support in 12*7 model.   
  • Self-motivated, and have excellent interpersonal and communication skills.

This is a hybrid position. Expectation of days in the office will be confirmed by your Hiring Manager. 


Qualifications

Preferred Qualifications:

  • 8+ years of relevant work experience with a Bachelor’s Degree or at least 4 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD, OR 8+ years of relevant work experience.
  • 5 years of experience and Proficiency in one or more programming languages such as Python, Java, .NET, C#, PowerShell, Bash scripting.
  • 3 or more years of experience leading the projects, key technical initiatives.
  • This role requires a high level of technical expertise, leadership skills, and a strong understanding of site reliability engineering principles and practices.
  • 5 years of experience and advanced proficiency in writing complex queries and working with SQL and mongo databases.
  • Prior experience working on CI/CD pipelines and tools like Jenkins, chef etc.
  • Prior experience partnering with product development team and evaluating application design for optimal reliability and resiliency. 
  • Prior experience and Strong understanding of networking concepts, protocols, and architecture. Advanced working knowledge of ITIL concepts & processes such as incident/change/problem management, call triaging, escalation procedures and such.  
  • Prior experience with Middleware components such as Kafka, Hazelcast, Qlik etc.
  • Advanced proficiency and experience with container orchestration systems, particularly Kubernetes.
  • Experience with advanced monitoring, logging, and tracing tools such as Splunk, Prometheus, Grafana, riverbed etc., for troubleshooting and performance tuning.
  • Basic understanding of AI frameworks and libraries to further enhance the application resiliency and day to day operational tasks.
  • Prior experience with building tools to automate production support activities that enable efficiency and productivity of all operations groups. Prior experience working in shift model in 24*7 environments.
  • Candidate should be comfortable communicating with technical and non-technical peer groups, including Account Management, Client Services, and other technical platform and application support groups. Strong work ethic, self-starter, ability to work in fast-paced, team-oriented environment.

Additional Information

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.


Company Details
VISA
 Foster City, CA, United States
Work at VISA

At Visa, we are driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid. As our products and... Read more

Did you submit an application for the Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI on the VISA website?