Data Scientist, Analytics

Remote

see all job openings

Company Overview

harpin AI is a high-growth, seed-stage startup on a mission to close the gap between what your data holds and what your business can achieve. We help enterprise teams unlock the revenue potential of their existing CRM, loyalty, and support systems — without replacing a thing.

Built by a proven team with decades of experience in CX, marketing, and data systems, harpin AI was born from a clear market truth: businesses are sitting on massive amounts of valuable data — but most of it goes unused. Fragmented tools, messy data, and dashboards that don’t drive action create a wall between insights and outcomes.

We created harpin AI to tear that wall down.

Instead of another dashboard or CDP, we built a set of modular, AI-powered tools that activate the systems you already use — fueling faster decisions, smarter customer experiences, and measurable ROI. Whether it’s identifying at-risk customers, surfacing churn signals, or launching campaigns with validated segments, harpin AI turns data from a liability into a competitive advantage.

Our customers span retail, hospitality, and gaming, and use harpin AI to drive growth in days — not quarters. If you’re excited about joining a sharp, decisive, purpose-driven team that’s redefining what data can do, we’d love to meet you.

Case Study Examples

Founded: 2021
Funding: Seed-Stage, $6.5M raised through MK Capital
Revenue: Post-revenue
CEO: Founded by Scott Sahadi who has experienced successful exits from his last 3 start-ups
Current customers: Fortune 500 hospitality companies & small e-commerce brands
Industry focus: Hospitality, E-Commerce, Retail

Position Overview

Join harpin AI’s Data Science team to help shape the future of customer identity resolution. In this role, you’ll work closely with data scientists, engineers, and product leaders to design scalable data models, build reliable pipelines, and develop predictive models that power key business insights. You’ll own data quality and architecture within your domain, enabling self-service analytics through intuitive dashboards and robust metrics. From data exploration to machine learning, you’ll apply analytical rigor to solve complex problems and uncover actionable insights. Ideal candidates are hands-on, thrive in fast-paced startup environments, and are passionate about high-quality data and
impactful analytics. Proficiency in Python, SQL, and Spark, along with a strong foundation in statistical analysis and dimensional modeling, is essential. If you’re excited to collaborate, move fast, and drive results through data, we’d love to hear from you.

What You Will Do

Understand data needs by interfacing with fellow data scientists and business partners.
Architect, build, and launch efficient & reliable new data models and pipelines in partnership with our Data Science and Engineering teams.
Perform data exploration, data analysis and machine learning model development.
Build out comprehensive metrics and dimensions to measure operational health and enable analysis and predictive modeling.
Design and develop dashboards to enable self-serve data consumption.
Become a data expert in your business domain, develop a deep understanding of how your data interacts with the rest of the business, and own data quality.
Recognize and adopt best practices in reporting and analysis: data integrity, test design, analysis, validation, and documentation.
Work with product leadership to evolve product positioning, roadmap, and use cases based on what we can (and cannot) practically develop.
Draw inferences and conclusions, and create dashboards and visualizations of processed data, identify trends, anomalies.

Qualifications

3+ years relevant work experience.
BS in Computer Science, Mathematics, Statistics, or Data Science.
Experience in a startup company environment is a plus.
Passion for high data quality and scaling/automating data science work.
Demonstrated leadership in data warehousing concepts.
Experience with exploratory data analysis, statistical analysis and machine learning model development.
Experience in schema design and dimensional data modeling.
Ability to perform basic statistical analysis to inform business decisions.
Ability to turn complex problems into simple solutions.
Track record of managing multiple projects simultaneously in a fast-paced environment.
Experience creating high performance algorithms for real-time systems.
Collaborative mindset and generative work culture – we do our best work together.
Strong proficiency in Python, Spark, SQL and to work efficiently at scale with large data sets.
Flexible, nimble, and scrappy; startup mentality and willingness/ability to change direction quickly if best for the business.

Benefits

Stock Options
PTO
Paid Holidays
Medical, Dental, & Vision Benefits
401K