The Largest Study on LLMs in African Healthcare

Creating a novel benchmark dataset of 25,000 question-answer pairs to rigorously evaluate 20+ LLMs on African healthcare, spanning 32 clinical specialties, contributed by 1k+ African clinicians across 15 countries

Funded by global partners
The Problem

Addressing LLM Bias in Global Healthcare

Training data bias: LLMs are trained on vast amounts of web data largely sourced from developed countries. Consequently, models are exposed to healthcare problems that are prevalent in these geographies. However, the burden of disease varies significantly in other parts of the world.

In Sub-Saharan Africa communicable diseases, maternal, and neonatal disease dominate while non communicable diseases are prevalent in the US. It is therefore important to investigate if LLMs trained on datasets sourced predominantly from data rich regions exhibit bias when applied in LMICS. see more

Although LLMs excel in standardized tasks, their application to complex African healthcare scenarios remains largely unmeasured. Language diversity, regional medical practices, and unique disease profiles necessitate tailored solutions.
AfriMed-QA

Transforming African Health with AfriMed-QA

AfriMed-QA creates a unique, inclusive, pan-African, multi-region, multi-institution dataset of 25,000 Africa-focused question-answer pairs to comprehensively evaluate LLM capabilities across diverse facets of African healthcare. It seeks to spotlight where these models excel, identify potential pitfalls, and explore scenarios where they could prove pivotal or potentially risky.

More than just a benchmarking exercise, this project stands as a guide for African academic, clinical, biomedical, and research communities. see more

It promises to equip them with invaluable insights into harnessing LLMs’ potential within African healthcare. By fine-tuning these models with locally relevant data and contextualizing their outputs, the initiative not only mitigates biases but also unveils untapped possibilities for improving patient outcomes.
Our Stats

25k Africa-Focused QA Pairs

5k

Expert Generated MCQ’s with Answers

10k

Crowdsourced MCQ’s and SAQ with Answers and explanations

10k

Consumer Health Queries with Clinician Answers

Leadership

Project leadership and Collaborating Organizations

Mercy Asiedu PhD

Research Scientist, Google Research.

Chris Fourie MD

Co-Founder, Sisonkebiotik

Tobi Olatunji MD

CEO, Intron

Abraham Owodunni

BioRAMP Research Lead, CS PhD Candidate, see more

Ohio State University

Wendy Kinara

Kenya Lead, FAMSA Liaison

Stephen Moore PhD

University of Cape Coast, GhanaNLP

Charles Nimo

CS PhD Candidate, Georgia Institute of Technology

Fola Omofoye MD

BioRAMP Clinical Lead and Adjunct Professor at more

UNC at Chapel Hill

Collaborating organization
Approach

From Ideation to Execution and Community Engagement

Methodology

Leveraging Crowdsourced Health Questions

The dataset creation will leverage professional medical questions– multiple choice
questions (MCQ) and short answer question (SAQ)– crowdsourced from exam prep
material popularly used by medical schools across Africa as well as consumer health
questions, similar to questions typed in a google search query. see more

  1. Crowdsourced MCQ and SAQ questions from clinicians and medical students
  2. Consumer health questions with clinician answers
  3. Expert generated MCQs
Data

Explore the largest African Medical QA dataset

This work is licensed under a creative commons Attribution-NonCommercial -ShareAlike 4.0 International License.

Github Repo

Access the data, benchmarking code, and results on github.

Request Commercial License

Want to use AfriMed-QA for commercial purposes? Contact us to obtain a commercial license.

Watch Video

Learn to benchmark open or closed LLMs on Africa dataset

Results

Benchmarking 20+ LLMs on 32 clinical specialties across 15 countries

Path to Impact

The dataset’s geographical and clinical diversity facilitates robust contextual evaluation of LLMs in African healthcare and provides a sufficiently large corpus to finetune LLMs to mitigate biases discovered.

It rigorously documents evidence in the context of African healthcare highlighting use cases or clinical specialties where LLMs shine as well as situations where they fall short or have a high potential for harm.

The project will be a timely and invaluable resource guiding the African academic, clinical, biomedical, and research communities on the utility of LLMs in African see more

healthcare at a scale that not only enables robust and rigorous LLM evaluation but provides a sufficiently large dataset to mitigate biases – by finetuning these  LLMs, adequately exposing model weights to African healthcare data in context. Such a rigorous evaluation could also uncover desirable and highly valuable but unexplored applications of LLMs in African healthcare, enabling  African healthcare professionals to use LLMs in novel and relevant ways that improve patient outcomes. 

The initial success of the project will stimulate broader research, eliciting more interesting questions on LLM behavior resulting in deeper investigation into specific clinical specialties of interest, demographic or sociocultural issues, as well as possible expansion of the dataset methodology to other domains beyond healthcare.

Work With Us

Your contributions make a difference

We are seeking clinicians, medical students, data scientists, data engineers, MLOPs engineers, data science students and enthusiasts. Your contributions make a difference in improving LLMs for use in African healthcare to improve patient outcomes: By contributing, you will be helping to reduce the burden on healthcare providers, allowing them to focus on delivering better patient care and improving health outcomes and strengthening healthcare in Africa. Your input will shape and influence the direction of our project and help us create a more effective and user-friendly platform.

Funded by global partners:

Contact Us