Site Reliability Engineer

Full Time
Main Location
New York City, NY, United States
Site Reliability Engineer

Fixed Income Technology is responsible for providing real time, firm wide risk, and P&L for Fixed Income, Commodities, Credit and FX. We are running a service mesh on a Kubernetes cluster. If you join our team, you would help deploy and support the growing number of Cloud and Container services we run.

Principal Responsibilities:

  • Help team members deploy new services, including Kustomize manifests and build pipelines

  • Advise service developers on best practices for observability, metrics, logging and tracing

  • Configure Istio for a micro services environment, including routing, mirroring, A/B deployments, Circuit Breakers

  • Set up alerts with Prometheus and help trouble shoot in a multi-services environment

  • Help developers setting up Skaffold environments and Docker images

Desired Qualifications/Skills

  • At least 5 years of experience working with production environments

  • Experience with Kubernetes and Docker

  • Experience with Istio

  • Experience with Prometheus and Jaeger

  • Experience with Kustomize

  • Experience with Skaffold

  • Experience with Jenkins

  • Experience with Argo CD and Workflows

  • Excellent troubleshooting and analytical skills

  • Self-starter able to execute independently, on a deadline, and under pressure

  • Excellent written and verbal communications

  • Experience with Python and bash scripting

We're a community of women leveraging our connections into top companies to help underrepresented women get the roles they've always deserved. Simultaneously, we work to build truly inclusive hiring processes and environments where women can thrive and not just survive.
Are you hiring? Join our platform for diversifiying your team
Site Reliability Engineer
Millennium Management