Help us maintain the quality of jobs posted on PowerToFly. Let us know if this job is closed.
Job Details
Experience Level: Experienced Hire
Categories:
- Engineering & Technology
Location(s):
- Quay Building 8th Floor, Bagmane Tech Park, Bengaluru, IN
- Develop a deep understanding of the product supported, business function and technical architecture, down to the application code and data
- Perform in-depth log analysis to identify and diagnose issues in production systems
- Utilize monitoring tools like Splunk/Data Dog, Grafana, CloudWatch etc. to quickly zero-in on application and infrastructure issues
- Collaborate with product teams by providing technical findings from Production incidents and assist in determining the root cause and resolution
- Assist efforts to resolve high impact incidents by providing technical direction on the triage call and working with business, application, and other technical teams
- Develop logging standards and identify areas of improvement in monitoring, application stability, and speed of determining root causes
- Drive initiatives to improve efficiency and quicker issue detection and resolution times
- Identify opportunities for automation and be a relentless champion to reduce manual and repetitive tasks
- Ensure quality and timely communication is maintained with business and technology stakeholders on critical issues
- Develop excellent working relationships with teammates and stakeholders across business and technology
- Be proactive, laser focused on execution and promote a culture of continuous improvement
- BS degree in Information Systems, Computer Science, Computer Engineering or equivalent
- 7+ years of solid work experience in IT and Application Support / Technology Operations
- Deep, hands on experience with log analysis and root cause identification
- Sound experience with monitoring tools like AppDynamics/Grafana, DataDog/Splunk, or CloudWatch and ability to configure alerts and dashboards
- Knowledge of .Net / Java and AWS cloud native application development along with knowledge of database technologies like PostgreSQL, Oracle or Sybase is required
- Knowledge of AWS or comparable cloud hosting technologies
- Ability to troubleshoot Cloud based systems and Linux and coordinate with technical SMEs
- Exhibits a strong sense of urgency for high severity incidents. Able to assess the customer impact and provide tactical solutions
- Good understanding of distributed systems architecture including database, middleware, server and container-based infrastructure etc. would be a plus.
- Knowledge of Python and scripting is a plus
- Some experience with GitHub and Jenkins is desired
- Hands-on experience with Incident and problem management
- Excellent problem-solving skills
- Excellent verbal and written communication skills
About the Company
Moody's
New York City, NY, United States
In a world shaped by increasingly interconnected risks, Moody's helps customers develop a holistic view of these risks to advance their business... Read more