I'm Interested
powertofly approved What S&P Global Has to Offer:

We met with women at S&P Global to hear about the teams they're leading, the products they're building and how they integrate work with life.

Hear directly from Irina, Megan, Sameena and Meredith.

Job Details

Position Summary:

We are looking for an applied Senior Data Scientist to design and build out advanced NLP and document understanding solutions to enable our soon-to-be-launched digital transformation product which uses advanced knowledge engineering and AI to accelerate innovation in engineering, manufacturing, and scientific operations. The perfect candidates will have strong skills in deep learning, linguistics, and probability theory, a proven track record of collaborating and iteratively implementing data-intensive solutions, strong operational skills to drive efficiency and speed, and effective project leadership. You will be a part of an early-stage team. You will educate stakeholders, mentor team members, and have a significant stake in defining the future of our NLP automation for the product.

Job Responsibilities:
  • Design, build, and maintain production document understanding, entity/relation extraction and linking, and question answering systems enabled by deep/machine learning and semantic engines over industry-specialized documents
  • Work closely with machine learning engineers, micro-service developers, and data engineers to build out knowledge graph driven AI solutions incrementally and securely
  • Champion a cultural shift to MLOps best practices and adoption of the data science platform
  • Work closely with the product management and development teams to rapidly translate the understanding of customer data and requirements to product and solutions
  • Maintain an excellent understanding of the business’s long-term goals and strategy and ensures that the design and architecture are aligned with these
  • Define and manage SLA’s for data sets and processes running in production
  • Research and experiment with emerging technologies and tools related to NLP
  • Adhere to software engineering best practices including continuous delivery and version control
Ideal Qualifications:
  • Nuanced understanding of modern DL language representation techniques with the ability to design and implement novel DL solutions
  • Mastery of Tensorflow/Pytorch frameworks for implementing and optimizing deep learning algorithms for training quality and serving performance
  • Exposure to industry or academic research, particularly in deep learning and neural networks
  • Deep understanding of many different quality metrics (MRR, F1, NDCG, Recall/Precision, etc..) and the tradeoffs of each with the ability to design ensemble metrics which correlate to business KPIs
  • Ability to apply active learning and advanced labelling techniques to bootstrap datasets for lower capacity models (trees, linear, etc..)
  • Strong algorithms, data structures, and coding background with either Java, Python, C++, or Scala programming experience
  • Experience with software engineering standard methodologies (unit testing, code reviews, design document, continuous delivery)
  • Ability to conceptualize and articulate ideas clearly and concisely
  • Entrepreneurial or intrapreneurial experience where you helped lead the creation of a new product & organization
Nice to Have’s:
  • Experience working with knowledge graphs stores (Stardog, TigerGraph, Ontotext GraphDB, Neo4j) and surrounding semantic technology (OWL, RDF, SWRL, SPARQL, JSON-LD)
  • Experience with data pipeline and workflow management tools (AWS Data Pipeline, Apache Airflow, Argo, etc.)
  • Experience with stream-processing systems (ksqlDB, Spark Streaming, Apache Beam/Flink, etc.)
  • Understanding of the theoretical and practical tradeoffs of various NoSQL stores (Cassandra, Elasticsearch, DynamoDB, etc.) with respect to different read/write patterns and availability/consistency requirements
  • BA/BS or Masters in Computer Science, Math, Physics, or other technical fields
  • Experience with at least 10+ terabyte datasets, ideally up to multiple petabytes

We’re building a software solution that connects data in revolutionary ways, illuminating answers that were previously impossible to find and empowering our clients to envision the future so they can determine the best course of action in the present. Join us!


Equal Opportunity Employer:

S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment.

If you need an accommodation during the application process due to a disability, please send an email to: and your request will be forwarded to the appropriate person.

US Candidates Only:

The EEO is the Law Poster describes discrimination protections under federal law.

We're connecting diverse talent to big career moves. Meeting people who boost your career is hard - yet networking is key to growth and economic empowerment. We’re here to support you - within your current workplace or somewhere new. Upskill, join daily virtual events, apply to roles (it’s free!).
Are you hiring? Join our platform for diversifiying your team
Senior Data Scientist
I'm Interested