Posted a month ago

Connecting the world to wellness

MINDBODY emerged from the simple idea that small business owners deserve the time to focus on what matters most: their customers. Our software has transformed that vision into the world's leading wellness services marketplace, linking hundreds of thousands of passionate health, wellness and beauty professionals to the millions of clients they serve.

MINDBODY is a cloud-based business management software company for the wellness services industry.

We serve about 35 million consumers located in 130 countries & territories.

At MINDBODY, work-life balance takes on a new meaning for us. When your life goals & values align with the work you do every day, balance is second nature.

We help inspired business owners seamlessly succeed & individuals all over the globe lead healthier, happier lives with our technology.

Company Overview -

Follow our careers page here -


The Big Data Engineer II provides strong data engineering platform and database tools, coding execution and delivery of data layer for Data Science core engines, focused on large, industry-scale B2B and B2C datasets. The Big Data Engineer II expands and optimizes our data and data pipeline architecture, as well as optimizes data flow and collection for cross functional teams. This role supports software developers, database architects, data analysts and data scientists on data initiatives and ensures optimal data delivery architecture is consistent throughout ongoing projects.

  • 5+ years of experience in Data Science/Machine Learning
  • Bachelor’s Degree in Computer Science Engineering, Mathematics, Applied Sciences, Statistics or equivalent experience required
  • Advanced SQL experience working with relational databases, query authoring as well as working familiarity with a variety of databases
  • Experience with big data tools: Hadoop, Spark, Kafka, etc.
  • Experience with relational SQL and NoSQL databases, including Postgres and Cassandra
  • Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
  • Experience with AWS cloud services: EC2, EMR, RDS, Redshift
  • Experience with stream-processing systems: Storm, Spark-Streaming, etc.
  • Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
  • Experience building and optimizing ‘big data’ data pipelines, architectures and data sets
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
  • Strong analytic skills related to working with unstructured datasets
  • Build processes supporting data transformation, data structures, metadata, dependency and workload management
  • A successful history of manipulating, processing and extracting value from large disconnected datasets
  • Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores



  • Provides hands-on execution and implementation of data science models
  • Works under general supervision to identify and analyze problems
  • Weighs relevance and accuracy of information
  • Uses professional, fundamental concepts, practices and procedures of the artificial intelligence and machine learning discipline
  • Uses data to support own point of view when interacting with the team
  • Provides information, analysis and recommendations in support of the machine learning team
  • Takes actions that are consistent with goals and objectives
  • Creates and maintains optimal data pipeline architecture
  • Assembles large, complex data sets that meet functional / non-functional business requirements
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies
  • Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics
  • Work with stakeholders including the Executive, Product, Data and Design teams to assist with data related technical issues and support their data infrastructure needs
  • Keep our data separate and secure across national boundaries through multiple data centers and AWS regions
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
  • Work with data and analytics experts to strive for greater functionality in our data systems
  • All other duties as assigned



•        Duties for this position are performed under limited supervision

•        You will be responsible for planning and organizing their own work, which may or may not be directly related to general business operations of the company or its customers

•        You will receive training and guidance from manager as needed

•        Individual contributors may be required to regularly exercise discretion and independent judgment with respect to matters of significance depending on the nature of the position

•        There is no direct management responsibility for the position



•        You will need dexterity of hands and fingers to operate a computer keyboard

•        This position is mostly stationary and will be required to remain stationary for extended periods of time

•        Specific vision abilities required by this position include close vision, color vision, and the ability to adjust focus

•        The noise level in the work environment is usually moderately quiet

We're connecting diverse talent to big career moves. Meeting people who boost your career is hard - yet networking is key to growth and economic empowerment. We’re here to support you - within your current workplace or somewhere new. Upskill, join daily virtual events, apply to roles (it’s free!).
Are you hiring? Join our platform for diversifiying your team
Senior Big Data Engineer(Pune/Remote) - Mindbody