1

Founding Data Scientist Jobs in Indiana (NOW HIRING)

This is a founding role: you will shape the data science function from the ground up, set technical direction, and own the end-to-end delivery of intelligent systems that define how our product ...

Since our founding in 2015, we have built a diverse portfolio of more than 20 drug development ... data * Co-design and deliver insight-generation and medical education activities, including ...

EHS Engineer

Jeffersonville, IN

$70K - $92K/yr

... our founding in 2001. Our commitment to sustainability is reflected in all parts of our ... Bachelor's degree in environmental engineering, Chemical Engineering, Environmental Science, or ...

EHS Engineer

Jeffersonville, IN · On-site

$70K - $92K/yr

... our founding in 2001. Our commitment to sustainability is reflected in all parts of our ... Bachelor's degree in environmental engineering, Chemical Engineering, Environmental Science, or ...

Staff Software Engineer

Francisco, IN · On-site

$263.25K - $310.04K/yr

... in Computer Science required; MS or PhD in CS, AI, or a related field is a meaningful plus ... data infrastructure * Experience as a founding or early-stage engineer at a startup that shipped at ...

New

next page

Showing results 1-20

Founding Data Scientist information

What are the key skills and qualifications needed to thrive as a Founding Data Scientist, and why are they important?

To thrive as a Founding Data Scientist, you need expert knowledge in statistics, machine learning, and data analysis, often supported by an advanced degree in a quantitative field. Familiarity with tools such as Python, R, SQL, cloud platforms (e.g., AWS, GCP), and experience with data pipeline and model deployment frameworks is typically required. Strong problem-solving abilities, entrepreneurial mindset, and the ability to communicate complex ideas clearly set exceptional candidates apart. These skills are crucial for building reliable data products from scratch, influencing company direction, and driving data-driven decision-making in an early-stage environment.

What are some unique challenges a Founding Data Scientist faces in an early-stage startup environment?

As a Founding Data Scientist in an early-stage startup, you often wear multiple hats, balancing hands-on model development with strategic planning and infrastructure setup. You'll likely be responsible for establishing data processes, selecting tech stacks, and setting up data pipelines from scratch, often with limited resources. Close collaboration with engineering, product, and leadership teams is essential, as your insights will directly influence business strategy. This role demands adaptability, strong communication skills, and a proactive mindset, as priorities can shift rapidly in a startup setting.

What are Founding Data Scientists?

Founding Data Scientists are data professionals who join a company at its earliest stages, often as one of the first technical hires. They are responsible for setting up the company's data infrastructure, developing initial machine learning models, and establishing best practices for data analysis. Their role is highly strategic, often collaborating closely with founders to influence product direction and data-driven decision-making. In addition to technical expertise, Founding Data Scientists need to be adaptable, entrepreneurial, and comfortable working in fast-paced, uncertain environments.
What cities in Indiana are hiring for Founding Data Scientist jobs? Cities in Indiana with the most Founding Data Scientist job openings:
Talent Network: Lead Data Scientist

Talent Network: Lead Data Scientist

Toptal

Remote

Full-time

Posted 6 days ago


Job description

About Toptal

Toptal is a global network of top talent in business, design, and technology that enables companies to scale their teams, on-demand. With $200+ million in annual revenue and team members based around the globe, Toptal is the world's largest fully remote workforce.

We take the best elements of virtual teams and combine them with a support structure that encourages innovation, social interaction, and fun. We see no borders, move at a fast pace, and are never afraid to break the mold.

Job Summary

We are looking for a Senior Data Scientist to join us as the first Data Scientist on a new product we are building. This is a founding role: you will shape the data science function from the ground up, set technical direction, and own the end-to-end delivery of intelligent systems that define how our product creates value. You will tackle open-ended problems involving Task Mining, Process Mining, behavioral workflow analysis, pattern discovery, predictive modeling, and applied GenAI/ML systems. The goal is not just to build models, but to turn raw interaction data into measurable product and business impact: discovered workflows, bottlenecks, optimization opportunities, and scalable foundations for future DS/ML work.

This is a remote position. We do not offer visa sponsorship or assistance. Resumes and communication must be submitted in English.

Responsibilities
  • Act as the founding Data Scientist on the product: define the DS strategy, choose the right tools and frameworks, and establish best practices.
  • Design and build Task Mining and Process Mining solutions that transform raw interaction data into discovered workflows, patterns, bottlenecks, and optimization opportunities.
  • Design, develop, and deploy ML systems and data pipelines for large-scale structured, unstructured, and event/interaction data.
  • Build predictive and pattern-discovery solutions using supervised and unsupervised learning, representation learning, sequence modeling, and LLM/GenAI approaches where appropriate.
  • Establish practical foundations for dataset construction, labeling strategy, offline/online evaluation, monitoring, feedback loops, and human-in-the-loop review where needed.
  • Own projects end-to-end, from problem framing and experimentation through production deployment and iteration. Collaborate closely with engineering on data instrumentation, pipeline design, deployment, and integration of production-ready services.
  • Communicate findings, tradeoffs, and technical concepts effectively to both technical and business stakeholders.
Qualifications and Requirements
  • 5+ years of professional experience in Data Science, Machine Learning, or Applied ML roles.
  • Demonstrated experience operating as the sole or lead Data Scientist on a product or team - owning problems end-to-end without senior DS supervision.
  • Strong experience with supervised and unsupervised ML, modern ML/data tooling, and the judgment to select the right approach for the problem.
  • Practical familiarity with representation learning, sequence modeling, Transformers, LLMs, or GenAI systems where relevant to product use cases.
  • Experience handling large-scale structured, unstructured, event, or interaction datasets.
  • Advanced proficiency in Python and SQL, with hands-on experience using tools such as PyTorch, scikit-learn, pandas/Polars, experiment tracking, and production ML workflows.
  • Experience deploying ML models, data pipelines, or intelligent systems into production.
  • Familiarity with Task Mining, Process Mining, event-log analysis, behavioral analytics, workflow automation, or adjacent domains.
  • Advanced degree in Computer Science, Data Science, AI, Statistics, Mathematics, or a related field is a plus; equivalent practical experience is strongly valued.
What We Are Looking For
  • A founder's mindset: full responsibility for outcomes, not just deliverables.
  • Comfort operating in high ambiguity: able to turn unclear product goals, noisy data, and incomplete requirements into an executable roadmap.
  • Strong business sense - connects technical work to commercial impact and measurable product value.
  • Pragmatic technical judgment - knows when to use advanced ML, when to simplify, and when better data, labeling, or evaluation is the real bottleneck.
  • Ability to build foundations for rapid scaling: reusable datasets, pipelines, metrics, evaluation frameworks, and modeling patterns future DS/ML hires can build on.
  • Highly proactive problem solver who acts without waiting for detailed instructions.
  • Excellent communication skills, with the confidence to push back constructively and propose direction.
Nice to Have
  • Previous experience as a first or early Data Scientist at a startup or new product line.
  • Direct experience with Task Mining, Process Mining, workflow intelligence, RPA, or productivity analytics.
  • Experience with LLMs and Generative AI applications, especially evaluation, structured outputs, semantic labeling, summarization, or human-in-the-loop workflows.
  • Experience working with privacy-sensitive behavioral, productivity, or user-interaction data.
  • Experience with product experimentation, causal inference, or measuring the impact of workflow/process interventions.
  • Knowledge of MLOps and distributed processing frameworks, such as Spark.
  • Experience with cloud environments, especially GCP.
apply for this job