Job details
- Job Title: Senior Incident Operations & Optimization Specialist (Mainframe & Batch Focus)
- Job Level: C-13
- Department: Foundational Services - Production Operations
- Location: Chennai, India
The Senior Incident Operations & Optimization Specialist for Mainframe & Batch is a specialized technical leadership role requiring deep expertise in mainframe operations, batch job scheduling, and enterprise-scale processing environments. This position is critical to the success of the Incident Reduction Program, providing delivery of solutions which optimize and automate operations workflows.
You will be responsible for building automated incident remediation workflows and achieving measurable incident reduction through intelligent alert optimization, correlation, and automation while preserving the critical observability required for business-critical mainframe applications and batch processing. This role offers the unique opportunity to modernize event management for legacy systems using cutting-edge AIOps platforms and automation technologies.
Must to have 10-16 year of relevant experience into Mainframes Incident Operations & Optimization, batch job scheduling, enterprise-scale processing environments, delivery of solutions which optimize and automate operations workflows. building automated incident remediation workflows
Key Responsibilities- Incident & Alert Analysis: Conduct in-depth analysis of mainframe and batch processing alerts to identify chronic issues, reduce operational noise, and develop strategies to address high-volume incident generators, including recurring job failures.
- Intelligent Event Management: Design and implement domain-specific correlation, de-duplication, and suppression rules on AIOps and event management platforms. Develop logic that understands mainframe subsystem relationships and cascading batch job dependencies.
- Automation & Self-Healing: Architect and develop automation playbooks for incident data enrichment, automated job restarts, and self-healing capabilities for common mainframe and batch processing failures.
- Observability Enhancement: Assess monitoring gaps in mainframe and batch environments, proposing enhancements to ensure critical business processes have appropriate alerting coverage and align with enterprise standards.
- Cross-Functional Collaboration: Partner closely with mainframe operations, batch scheduling, and application development teams to validate correlation logic, define automation initiatives, and provide expert guidance on modern event management practices.
- Quality Assurance: Continuously validate the effectiveness of implemented rules and automation. Establish feedback loops with operational teams to conduct post-implementation reviews and iterative improvements.
- Education: Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field.
- Experience: A minimum of 8+ years of hands-on experience in mainframe operations, batch processing, or enterprise workload automation.
- Event Management & Incident Reduction: Proven track record in event management, alert tuning, and incident reduction within complex mainframe and batch environments, with quantifiable results. Direct, hands-on experience with modern AIOps and event management platforms is required.
- Technical Expertise:
- Deep understanding of mainframe architecture, operating systems, and subsystems.
- Expertise in enterprise workload automation, including job design, scheduling, and dependency management.
- Automation & Scripting: Hands-on experience developing robust automation solutions using relevant scripting languages and modern automation frameworks.
- Data Analysis: Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms.
- Problem-Solving & Analytical Skills: Excellent analytical abilities with a systematic approach to troubleshooting complex batch dependencies and failure propagation scenarios.
- Communication & Leadership: Exceptional communication skills with the ability to bridge mainframe/legacy and modern technology teams, influence collaboration, and present technical concepts to diverse audiences.
- An advanced degree in a relevant technical field.
- Relevant industry certifications (e.g., Mainframe, Workload Automation, Automation, ITIL).
- Experience with mainframe modernization initiatives, DevOps, and CI/CD pipelines.
- Familiarity with specialized financial systems.
- Background in large-scale financial services or other regulated environments, including knowledge of disaster recovery and high-availability patterns.
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Infrastructure------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Most Relevant Skills
Please see the requirements listed above.------------------------------------------------------
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
Get Weekly Job Offers
Be first to know when jobs open.