Hippocratic%20ai logo

Senior Staff Research Scientist, Speech Technologies

Hippocratic%20ai

Posted about 5 hours ago

The Opportunity

Healthcare has a access problem — not enough clinicians, too many patients, and critical gaps that technology hasn't meaningfully closed. Hippocratic AI is changing that. Our safety-focused, healthcare-only LLM platform enables autonomous, clinical-grade conversations with patients at a scale no human workforce could achieve. With over $404M in funding, a $3.5B valuation, and partnerships with leading health systems, we're not iterating on existing tools — we're building a new category.

At the center of that platform is voice. As a Senior Staff Research Scientist in Speech Technologies, you'll lead the research and engineering that makes our AI not just intelligent, but genuinely conversational — accurate, fast, and trustworthy in the highest-stakes environments imaginable.

What You'll Accomplish

  • You'll define the ASR foundation for healthcare's most advanced conversational AI. From architecture selection to production deployment, you'll shape a speech recognition system built from the ground up for clinical accuracy — one that sets the standard for what's possible when safety and performance aren't trade-offs.

  • You'll solve speech recognition problems that don't have off-the-shelf answers. Medical terminology, diverse patient populations, real-world acoustic conditions — you'll design and validate models that perform where general-purpose ASR falls short, pushing the frontier of what conversational AI can reliably understand.

  • You'll build the data infrastructure that makes breakthrough models possible. You'll architect the pipelines and curation processes for large-scale medical speech datasets, creating the training foundation that gives Hippocratic AI a durable, compounding advantage in clinical speech recognition.

  • You'll bring research into the real world at meaningful scale. You'll close the gap between state-of-the-art methods and production systems — optimizing for latency, accuracy, and resource efficiency so that patients and clinicians experience the results of your work in every conversation.

Location Requirements

We believe the best ideas happen together. We're on the hunt for a great space in the Bellevue area — and when we find it, this role will be in office five days a week. In the meantime, you'll work fully remote with quarterly trips to our Menlo Park, CA headquarters to connect and collaborate with the team.

Responsibilities

  • Design, develop, and iterate on data-driven ASR models for streaming and non-streaming conversational speech applications

  • Research and implement state-of-the-art end-to-end speech recognition architectures tailored to the medical domain

  • Train, evaluate, and optimize ASR models across accuracy, latency, and resource utilization dimensions

  • Preprocess and curate large-scale speech datasets to support robust model training

  • Collaborate closely with LLM, product, and clinical teams to integrate speech technologies into the broader Hippocratic AI platform

  • Contribute to the team's research culture through experimentation, documentation, and knowledge sharing

Basic Qualifications

  • PhD with 7+ years of hands-on ASR research and engineering experience, or a Master's degree with 10+ years of industry experience in speech recognition

  • Deep experience designing and developing algorithms for accurate, efficient speech recognition in both streaming and non-streaming contexts

  • Proven track record training and optimizing ASR models for production — balancing accuracy, latency, and compute constraints

  • Experience preprocessing and curating large speech datasets for model training

  • Strong Python and C++ programming skills

  • Comfortable working in Linux/Unix command-line environments

  • Clear communicator — you can translate complex technical work for cross-functional partners

Preferred Qualifications

  • Hands-on experience building ASR systems from 0 to 1, including data pipelines, model architecture selection, and evaluation frameworks

  • Practical experience with ESPnet, Kaldi, and PyTorch

  • Experience with CUDA for GPU-accelerated training and inference

  • Familiarity with leveraging LLMs to enhance speech recognition quality

  • Experience with neural and end-to-end endpointer modeling

  • Publications in tier-1 venues in speech recognition or NLP (Interspeech, ICASSP, ACL, etc.)

What You'll Love About Hippocratic AI

  • The mission is real. Every improvement you make to speech recognition translates directly into better patient experiences — more accurate conversations, fewer errors, broader access to care for people who currently have none.

  • The team is exceptional. You'll work alongside researchers and engineers from Google, Meta, Microsoft, NVIDIA, and Stanford, as well as clinicians and health system leaders who keep the work grounded in what actually matters to patients.

  • The technical problems are genuinely hard. This isn't fine-tuning an existing API. It's building a purpose-built, safety-critical speech stack for one of the most complex and consequential domains in the world.

  • The strategic moment is now. With $404M raised, a $3.5B valuation, and real deployments across leading health systems, Hippocratic AI has both the resources and the momentum to define this category — and the equity upside to match.

Want to see the full job description?

Sign in to view the complete details and apply to this position.

Job details

Workplace

Office

Location

Bellevue, WA

Experience

SE

Similar

Jobr Assistant extension

Get the extension →