Researchers uncover ethnoracial equity gap in training data for diabetes artificial intelligence

February 22, 2021

Share Post

Drs. Quynh Pham and Joseph Cafazzo
A commentary led by Drs. Quynh Pham and Joseph Cafazzo shows that the vast majority of research into AI-based diabetes interventions makes no mention of ethnic or racial training data—an important finding given that, among people with diabetes, those from ethnic and racial groups are more likely to have poor outcomes.

By Alisa Kim

From minimally invasive robot-assisted surgery, to “training” computers to detect breast cancer, the potential for artificial intelligence (AI) to transform health care is breathtaking. But what if the data used to develop some AI-based tools—like those that help a clinician predict whether a patient will go on to have a disease—are incomplete or unsuitable?

Researchers at the Institute of Health Policy, Management and Evaluation (IHPME) asked themselves this question. A commentary led by Drs. Quynh Pham and Joseph Cafazzo shows that the vast majority of research into AI-based diabetes interventions do not include or report on the inclusion of ethnic or racial training data—an important finding given that, among people with diabetes, those from ethnic and racial groups are more likely to have poor outcomes. The case study, titled, “The Need for Ethnoracial Equity in Artificial Intelligence for Diabetes Management” was published on Feb. 10, 2021 in the Journal of Medical Internet Research.

“A lot of artificial intelligence is done by training models on retrospective data and historically, those datasets poorly represent Canadians,” says Cafazzo, a professor at IHPME and executive director of the Centre for Global eHealth Innovation at University Health Network. “People who are associated with academic centres tend to get enrolled in research trials. We tend to gather data on them, but people who are more rural and our Indigenous populations don’t get asked to be in research trials so their data is never collected and therefore never incorporated into these models.”

Pham, who is an assistant professor at IHPME and scientist at University Health Network notes there are cultural and biological factors that make it harder for certain communities of colour to manage diabetes. About one in three Canadians has prediabetes or diabetes, which puts people at greater risk of heart disease, stroke, and kidney failure.

Pham, Cafazzo, and colleagues Anissa Gamble and Jason Hearn conducted a secondary analysis of a highly cited review paper that was published in 2018 called “Artificial Intelligence for Diabetes Management and Decision Support.” The 2018 review looked at research articles on diabetes interventions using AI that include applications for clinical decision support, identifying adverse events, self-management which prompt a person to make a lifestyle change for example, and tools that predict the risk of developing diabetes based on genetic or lifestyle factors.

The team found that of the 141 articles included in the 2018 review, 90% made no mention of the ethnic and racial makeup of the datasets used to inform AI algorithms. Only 10 of the articles in the original review reported ethnic or racial data, with the average distribution being 70% White, 17% Black and 4% Asian.

Skewed training information is problematic because of a concept called distributional shift. This means, for example, that a diabetes prediction tool that was developed using data from a group that is unlike the people on which it will be used will be flat-out wrong.

“If you train your model on a data set that looks nothing like the population that you intend to apply it to, you’re going to have a massive mismatch,” says Pham, who is the first author on the paper. “Some of these studies were 99% white populations—if you take that and apply it to Markham or Mississauga or Scarborough, obviously that’s not going to work because of the demographic makeup of those communities of colour.”

The researchers recommend using representative training datasets for digital health interventions to improve accuracy and generalizability. “I think the important thing for research, especially in Canada, is to have more inclusive prospective datasets to train these models on. It brings it back to, how inclusive do we want to be in research and be honest about our differences,” says Cafazzo.

They have also developed a tool—a set of five questions—for researchers to assess how they are collecting data and the ethnic and racial relevance of the AI algorithm. The goal of this work, says Pham, is to ensure equity is built into how health innovations are designed so that all communities can benefit from them. “In five years, AI-based interventions will be standard of care. They will be federated eventually to a level where everybody, to some degree, is accessing care where something has run through a model to assist a clinician in making a diagnosis or a judgement call. We want to make sure everybody is cared for equally.”

Related News

"Group photo of a diverse team of professionals smiling together against a modern blue backdrop, showcasing camaraderie and teamwork."

Bridging the Health Equity Gap for Older Women: The Impact of Women’s Age Lab

December 4, 2024

Faculty / Research

Read More
A professionally dressed woman in front of a building

Professor Audrey Laporte Re-Appointed as Director of IHPME 

November 27, 2024

Faculty

Read More
A professional headshot of a woman with shoulder-length dark hair, smiling and wearing a blazer. The background is a deep blue with graphic elements including a medical cross and 'AI' symbol, along with colored geometric shapes in blue, green, and purple in the corners. New research explores AI transformation in healthcare.

Connaught Award-Supported Publication Explores AI Transformation in Healthcare

October 25, 2024

Faculty / Research

Read More

Leading Digital and AI Innovations in the Master of Health Informatics Program

October 16, 2024

Education / Faculty / Students

Read More
Two people; a male and woman. The male is smiling wide dressed in a suit and tie. The woman is smiling warmly, and is wearing a dress. Both are recipients of CIHR Project Grants.

IHPME Research Teams Awarded CIHR Project Grants

October 15, 2024

Faculty / Research

Read More
A medical professional dressed in scrubs smiling warmly in front of a white background. Grant to Support Breast Cancer Research

Dr. David Lim Receives Grant to Support Breast Cancer Research

October 3, 2024

Faculty

Read More

Sign up for IHPME Connect.

Keep up to date with IHPME’s News & Research, Events & Program, Recognition, e-newsletter.

Subscribe to Connect Newsletter

Get in Contact


Communications

Marielle Boutin
Email Address: ihpme.communications@​utoronto.ca

Manages all IHPME-wide communications and marketing initiatives, including events and announcements.