Doctolib

Machine Learning Engineering Manager - Voice Model - AI Teams (x/f/m)

Doctolib • Paris, Paris, France
JavaTypeScriptPythonKotlinSwift Hybrid/Remote

We are looking for a Manager to join the Voice Model ASR/STT(Automatic Speech Recognition/Speech-to-text) engineering team in AI & Clinical Products.

As an Machine Learning Manager, your mission will be to lead the team delivering the ASR backbone that powers our AI products, helping health professionals save time on documentation and focus more on patient care. You will be working in a feature team developing the speech recognition technology for Doctolib's AI-powered solutions including Consultation Assistant and Phone Assistant.

Working in the tech team at Doctolib involves building innovative products and features to improve the daily lives of care teams and patients. We work in feature teams in an agile environment, while collaborating with product, design, and business teams.

Your responsibilities include but are not limited to:

  • Own the ASR roadmap end-to-end: model design, training, evaluation, and product integration for medical-grade speech recognition
  • Lead and mentor a team of Speech-to-text experts; foster a high bar for research rigor, code quality, and operational excellence
  • Partner with MLOps to ensure training and inference pipelines are scalable, cost-efficient, and reliable in production
  • Collaborate with product, design, and clinical teams to translate user needs into measurable technical objectives
  • Drive continuous improvements to WER, medical term error rate, latency, diarization, domain adaptation, and multilingual performance

About our tech environment

  • Our solutions are built on a single fully cloud-native platform that supports web and mobile app interfaces, multiple languages, and is adapted to the country and healthcare specialty requirements. To address these challenges, we are modularizing our platform run in a distributed architecture through reusable components.
  • Our stack is composed of Rails, TypeScript, Java, Python, Kotlin, Swift, and React Native.
  • We leverage AI ethically across our products to empower patients and health professionals. Discover our AI vision here and learn about our first AI hackathon here!

Who you are

Before you read on — if you don't have the exact profile described below, but you feel this job description matches your skill set, we still encourage you to apply.

  • You have a Master's or Ph.D. degree in Computer Science, Data Science, or a related field
  • You have at least 5 years of experience in Machine Learning with deep expertise in Automatic Speech Recognition / Speech-to-Text (end-to-end or hybrid), including streaming STT and real-time constraints
  • You have hands-on experience with modern speech stacks: CTC/Transducer/Attention, Conformer/Whisper-style models, tokenizer/LM integration, diarization, and voice activity detection
  • You have strong PyTorch skills and production ML experience: model serving, monitoring, A/B testing, rollback, and incident response in partnership with MLOps
  • You are fluent in English

Now it would be fantastic if you have:

  • Experience with multilingual ASR, on-device or low-latency inference, telephony audio, or medical domain adaptation
  • Demonstrated leadership experience managing technical teams
  • A passion for pushing the boundaries of speech recognition and AI in healthcare

What we offer

  • Free comprehensive health insurance for you and your children
  • 25 days of paid vacation per year, plus up to 14 days of RTT
  • Free mental health and coaching services through our partner Moka.care
  • Work from abroad for up to 10 days per year thanks to our flexibility days policy
  • Lunch vouchers (Swile card) worth €8.50 per working day, with €4.50 covered by Doctolib
  • A subsidy from the work council to refund part of the membership to a sport club or a creative class
  • 50% reimbursement of your public transport subscription
  • Parent Care Program: receive one additional month of leave on top of the legal parental leave
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Relocation support in case of international mobility
  • Access to the best AI tools for coding, development and dedicated training

The interview process

  • HR Screen
  • Technical Deep Dive
  • System Design
  • Behavioral Interview
  • At least one reference check

Job details

  • Permanent position
  • Full-time
  • Workplace: Doctolib Paris office in Levallois-Perret
  • Work mode: hybrid 3 days/week in the office
  • Start date: as soon as possible

If you would like to find out more about tech life at Doctolib, feel free to read our latest Medium blog articles!

At Doctolib, we are committed to improving access to healthcare for everyone. This translates into our recruitment process. We evaluate candidates based solely on qualifications and motivation, without any form of discrimination.

The more diverse ideas are heard, the more our product will truly improve healthcare for all. You are welcome to apply to Doctolib, regardless of your gender, religion, age, sexual orientation, ethnicity, disability.

To ensure equal opportunities, we invite you to exclude personal information (e.g. pictures, age) from your applications. If you require any accommodation, please let us know for support during the hiring process.

Join us in building the healthcare we all dream of!

All information provided is processed by Doctolib for application management. For data processing details, click here.

Please contact hr.dataprivacy(at)doctolib.com for inquiries or to exercise your rights.