AI Data Scientist
Software Engineering, Data Science
New York, NY, USA
Join Tech @ Ro to build the future of healthcare, from the ground up!
At Ro, we believe that when people achieve their health goals, they can achieve their life goals. The highest-leverage way to move society forward is to give people their health, and the current healthcare system isn’t built to do that. It was built to bill, not to serve patients.
We’re building a new system. One where the patient is in control. One designed from scratch for the digital age.
At Ro, technology isn’t just a function… It's core to how we deliver care. We’ve built a vertically integrated healthcare platform that connects telehealth, diagnostics, pharmacy, and logistics into a seamless, end-to-end experience for millions of patients.
…and we’re just getting started.
As part of Tech @ Ro, you’ll work on systems that operate at scale, with an opportunity to:
- Solve complex, high-concurrency problems across a full-stack platform
- Build and ship quickly with tight feedback loops and real-world impact
- Own systems end-to-end, from architecture to production performance
- Work alongside experienced operators, technical leaders, and clinicians
- Help define how modern healthcare should be delivered
We’re a performance-driven team with a strong sense of ownership and urgency. We move fast, learn quickly, and hold a high bar for what we build, and do so with a big heart — because patients depend on it.
If you’re motivated by impact, scale, and the chance to help lead the patient revolution, come build with us.
The Role
We're hiring an Applied AI Scientist to help measure, evaluate, and improve our AI systems. You'll answer one of the most important questions in applied AI: "Is this actually working?" You'll design evaluations, analyze production behavior, run experiments, and partner closely with engineers to improve the quality of AI-powered features.
You'll work on real production systems while learning modern evaluation techniques from experienced teammates.
What You'll Do
- Build evaluation datasets, rubrics, and synthetic test cases for LLM-powered features across patient and internal workflows.
- Analyze production logs to identify model failures, hallucinations, quality issues, and operational bottlenecks.
- Design and run experiments end-to-end, including hypothesis development, dataset creation, evaluation, analysis, and recommendations.
- Track key product and operational metrics including resolution rate, handle time, touches to resolution, latency, and quality and identify opportunities for improvement.
- Partner with engineers to validate improvements and productionize successful experiments.
- Help build or integrate tooling and dashboards that make AI performance easy to understand and monitor.
Who You Are
- 1–4 years in data science, analytics, applied ML, or a closely adjacent role.
- Strong Python and SQL skills.
- Hands-on experience building or evaluating LLM-powered applications through work, research, school, hackathons, or side projects.
- You’re curious about how AI systems work beyond the model itself, including retrieval, prompting, evaluation, and production behavior.
- You’re comfortable communicating analytical findings clearly to engineers, product managers, and operational stakeholders.
- You are excited about learning rapidly in a field where best practices continue to evolve.
- Bonus: Experience with evaluation tooling, A/B testing frameworks, production model monitoring, healthcare, or other operations-heavy environments.
A note on reporting structure
This is a new function at Ro, and we're being deliberate about not over-defining it. Your manager and where you sit on the org chart will depend on the specific shape of the team we end up with. We'd rather find the right people and figure out the lines around them than pre-draw boxes and miss great candidates. If that ambiguity is a deal-breaker, this isn't the right role; if it sounds like an opportunity, we want to talk.