The Largest Study on LLMs in African Healthcare

Creating a novel benchmark dataset of 25,000 question-answer pairs to rigorously evaluate 20+ LLMs on African healthcare, spanning 32 clinical specialties, contributed by 1k+ African clinicians across 15 countries

Funded by global partners

The Problem

Addressing LLM Bias in Global Healthcare

Training data bias: LLMs are trained on vast amounts of web data largely sourced from developed countries. Consequently, models are exposed to healthcare problems that are prevalent in these geographies. However, the burden of disease varies significantly in other parts of the world.

In Sub-Saharan Africa communicable diseases, maternal, and neonatal disease dominate while non communicable diseases are prevalent in the US. It is therefore important to investigate if LLMs trained on datasets sourced predominantly from data rich regions exhibit bias when applied in LMICS. see more

Although LLMs excel in standardized tasks, their application to complex African healthcare scenarios remains largely unmeasured. Language diversity, regional medical practices, and unique disease profiles necessitate tailored solutions.

AfriMed-QA

Transforming African Health with AfriMed-QA

AfriMed-QA creates a unique, inclusive, pan-African, multi-region, multi-institution dataset of 25,000 Africa-focused question-answer pairs to comprehensively evaluate LLM capabilities across diverse facets of African healthcare. It seeks to spotlight where these models excel, identify potential pitfalls, and explore scenarios where they could prove pivotal or potentially risky.

More than just a benchmarking exercise, this project stands as a guide for African academic, clinical, biomedical, and research communities. see more

It promises to equip them with invaluable insights into harnessing LLMs’ potential within African healthcare. By fine-tuning these models with locally relevant data and contextualizing their outputs, the initiative not only mitigates biases but also unveils untapped possibilities for improving patient outcomes.

Our Stats

25k Africa-Focused QA Pairs

5k

Expert Generated MCQ’s with Answers

10k

Crowdsourced MCQ’s and SAQ with Answers and explanations

10k

Consumer Health Queries with Clinician Answers

Leadership

Project leadership and Collaborating Organizations

Mercy Asiedu PhD

Research Scientist, Google Research.

Chris Fourie MD

Co-Founder, Sisonkebiotik

Tobi Olatunji MD

CEO, Intron

Abraham Owodunni

BioRAMP Research Lead, CS PhD Candidate, see more

Ohio State University

Wendy Kinara

Kenya Lead, FAMSA Liaison

Stephen Moore PhD

University of Cape Coast, GhanaNLP

Charles Nimo

CS PhD Candidate, Georgia Institute of Technology

Fola Omofoye MD

BioRAMP Clinical Lead and Adjunct Professor at more

UNC at Chapel Hill

Collaborating organization

Approach

From Ideation to Execution and Community Engagement

Phase 1 - Data Collection (Intron, FAMSA):

Recruit medical schools, students, and clinicians to contribute.
Set up the platform for data collection, storage, and quality review.
Collect consumer health queries and crowdsourced exam questions.
Expand the reviewer team to facilitate rigorous question-and-answer quality review.
Deliver the initial 10,000 dataset release and establish a GitHub repository for
collaboration.

Phase 2 - Benchmarking (BioRAMP, Intron, Google Research):

Prompt various LLMs using different strategies (zero-shot, few-shot, zero-shot instruct) and evaluate their responses.
Conduct qualitative evaluation of LLM responses through blinded review by clinicians annotating for reasoning, demographic bias, hallucinations, etc
Conduct quantitative evaluation of LLMs, including accuracy, semantic and n-gram overlap (e.g. BertScore, ROUGE)
Analyze the initial results and publish findings in a machine learning conference paper
Continue data collection for consumer queries and medical student questions.

Phase 3 - Fine-tuning (BioRAMP, Intron, Google Research):

Collect expert-generated exam questions and answers and official contributions from African medical schools.
Collect Clinician answers to consumer queries
Deliver the final 20,000 dataset release.
Finetune the best-performing open-source LLM (based on Phase 2 evaluation results) on the 20k dataset and release the AfriMed-QA [fine-tuned] LLM.
Publish a final journal paper documenting the full analysis and outcomes of the project.

Phase 4 - Community Engagement [Capacity building, Dissemination] (Sisonke, Masakhane):

Increase awareness of the dataset and study results through program partners, social media outreach, and campaigns targeted at various communities
Organize masterclass sessions walking community members through the dataset, how to evaluate LLMs, run inference, and train LLMs for their use cases
Leverage the dataset for a hackathon where community members put these skills to work, creating LLMs to solve real-world problems.

Methodology

Leveraging Crowdsourced Health Questions

The dataset creation will leverage professional medical questions– multiple choice
questions (MCQ) and short answer question (SAQ)– crowdsourced from exam prep
material popularly used by medical schools across Africa as well as consumer health
questions, similar to questions typed in a google search query. see more

Crowdsourced MCQ and SAQ questions from clinicians and medical students
Consumer health questions with clinician answers
Expert generated MCQs

Data

Explore the largest African Medical QA dataset

This work is licensed under a creative commons Attribution-NonCommercial -ShareAlike 4.0 International License.

Github Repo

Access the data, benchmarking code, and results on github.

Request Commercial License

Want to use AfriMed-QA for commercial purposes? Contact us to obtain a commercial license.

Watch Video

Learn to benchmark open or closed LLMs on Africa dataset

Results

Benchmarking 30 LLMs on 30+ clinical specialties across 15 countries

COMING SOON

Read the Preprint

Get the dataset

Path to Impact

The dataset’s geographical and clinical diversity facilitates robust contextual evaluation of LLMs in African healthcare and provides a sufficiently large corpus to finetune LLMs to mitigate biases discovered.

It rigorously documents evidence in the context of African healthcare highlighting use cases or clinical specialties where LLMs shine as well as situations where they fall short or have a high potential for harm.

The project will be a timely and invaluable resource guiding the African academic, clinical, biomedical, and research communities on the utility of LLMs in African see more

healthcare at a scale that not only enables robust and rigorous LLM evaluation but provides a sufficiently large dataset to mitigate biases – by finetuning these LLMs, adequately exposing model weights to African healthcare data in context. Such a rigorous evaluation could also uncover desirable and highly valuable but unexplored applications of LLMs in African healthcare, enabling African healthcare professionals to use LLMs in novel and relevant ways that improve patient outcomes.

The initial success of the project will stimulate broader research, eliciting more interesting questions on LLM behavior resulting in deeper investigation into specific clinical specialties of interest, demographic or sociocultural issues, as well as possible expansion of the dataset methodology to other domains beyond healthcare.

Work With Us

Your contributions make a difference

We are seeking clinicians, medical students, data scientists, data engineers, MLOPs engineers, data science students and enthusiasts. Your contributions make a difference in improving LLMs for use in African healthcare to improve patient outcomes: By contributing, you will be helping to reduce the burden on healthcare providers, allowing them to focus on delivering better patient care and improving health outcomes and strengthening healthcare in Africa. Your input will shape and influence the direction of our project and help us create a more effective and user-friendly platform.

Funded by global partners:

Help & Support

About Us

Curious about news Sign up for our newsletter!