Swahili Developers logo
All articles
Health AIMarch 28, 202611 min read

Lessons from deploying voice AI in Tanzanian hospitals

What changes when your transcription model has to work in a busy ward with three languages mid-sentence.

Swahili Developers

Published March 28, 2026

Code-switching is the norm, not the exception

In the cardiology ward at Muhimbili, a single doctor-patient consultation routinely shifts between Swahili, English medical terminology, and one of several regional languages. Off-the-shelf ASR systems collapse under this load.

Three things we changed

  1. Mixed-vocabulary acoustic modeling rather than language-detection switching
  2. Domain-specific fine-tuning on 1,200 hours of clinical audio (consented and de-identified)
  3. On-device inference so patient audio never leaves the hospital network

Results so far

Word error rate dropped from 41% (baseline) to 12% on real ward recordings. More importantly, doctors stopped switching the recorder off mid-consultation - the trust signal we actually care about.

Written by

Swahili Developers

Field notes from the team building Swahili-first AI across East Africa.

Work with us