Skip to main
University-wide Navigation
To jump to a topic, click your choice below:
Protocol Submission AI FDA FAQs Glossary of Risk AI Video Series Additional Resources

 

AI Protocol Submission Guidance for Researchers

A quick way to think about how AI fits into your research protocol: What does the system do?; What data go into the system?; Where are the data stored and who has access?

The AI Protocol Submission Guidance for Researchers offers guidance to researchers conducting research that uses or develops Artificial Intelligence (AI). In this guidance, “AI system” refers broadly to any AI-based tool, model, or software used or developed as part of a human research study.

Not all sections will apply to every study. Researchers should use their judgment to determine which areas require AI-related explanation and what level of detail is appropriate. For example: A study using an AI transcription service to transcribe recorded interviews may only need to include the AI system name. Then, confirm whether the third-party vendor has access and maintains ownership of the data being transcribed and include this information in the informed consent.

AI Study Design Description

In the Study Design section of a protocol application, researchers are asked to describe and explain the overall study design. If using or developing AI, researchers should describe how the AI system fits within the study design. This information helps IRB reviewers determine whether the technology introduces additional risks or may require additional regulatory review or oversight. 

Use the following guidance to clearly describe the AI system and how it functions within the study design:

GENERAL INFORMATION

AI System Name and Version: Including the system name and version helps clearly document what system was used, which supports transparency, reproducibility, and review of the research. (e.g., “GPT-4.1,” “custom convolutional neural network v2.3”)

Whether the AI System is Being Used or Developed: Your description should reflect whether you are using an existing AI system or developing one as part of the research. 

  • Existing AI system is being used “as is”
  • Existing AI system is being used in a way that it was not intended to be used
  • A new AI system is being developed as part of the research

In general, studies using an existing system should focus on how the system is applied, while studies modifying the use of AI or developing AI should also describe how the system is built, trained, or updated.

For AI development, indicate what development stage the AI system is in (e.g. record review for data testing and training). 

Development stages relevant to protocol submissions: Design; Train; Test; Validate

What the AI System Does or What the AI System Will Do: Describe the AI system’s function in plain language:

  • Recruitment (e.g., pre-screening electronic records based on enrollment criteria)
  • Data transcription (e.g., focus group audio recordings)
  • Data collection (e.g., chatbot administering survey questions)
  • Data analysis (e.g., identifying patterns in imaging data)
  • Prediction or classification (e.g., predicting hospital readmission risk)

WHEN ADDITIONAL DISCLOSURE IS IMPORTANT

The following situations do not necessarily require additional detail, but they should be clearly disclosed in the protocol, as they may raise important ethical, regulatory, or oversight considerations.

General Use:

  • Direct Interaction with Participants: (e.g., an AI chatbot administers survey questions, asks follow-up questions based on participant responses, or conducts an interview in real time)
  • Participant Intervention: (e.g., participants wear a device that continuously collects physiological data—such as heart rate or activity level—and the AI system prompts them to increase the intensity of their activity in real time)

Non-Standard or Higher Risk Use:

  • Use Outside Intended Purpose: (e.g., repurposing a diagnostic system for screening in a new population)
  • Downstream Clinical Intent: Indicate whether the non-standard use of an AI system has a downstream intent to diagnose, treat, cure, mitigate, or prevent disease; and whether it may influence clinical care.
  • Use in a Clinical Trial: Clarify whether the AI influences endpoints, treatment decisions, or stratification. (e.g., in a drug trial, an AI system monitors patient response data and flags participants for dose adjustment or identifies when clinical endpoints—such as disease progression—have been met)

Developing or Integrating AI Systems in Clinical Care:

  • Role in Clinical Decision-Making: Explain if the AI system will be informing decisions (decision support) or making automated decisions
  • Regulatory Context for Development of an AI System: Indicate whether the software may meet the FDA definition of Device Clinical Decision Support Software, and reference applicable guidance when relevant. FDA Digital Health Policy Navigator Tool 
  • Adaptive or Evolving Algorithms: If the system updates during the study, describe how those changes are monitored or controlled.

COMMON PITFALLS

  • Describing the AI in purely technical terms without explaining what the system does or how it will be used
  • Not clarifying whether the system is new, modified, or pre-existing
  • Omitting whether the AI interacts directly with participants or influences care decisions

AI Data Description

AI Data Description: In the Data Collection section of a protocol application, researchers are asked to describe what data are collected and why. Here, researchers should focus on describing what collected data will go into the AI system. Because AI systems depend on data to function, the type and source of these data are key to understanding potential risks related to privacy, confidentiality, consent, and group harms.

Use the following guidance to clearly describe what data the AI system will access:

GENERAL INFORMATION

An AI data description should indicate whether:

Collecting Data from Living Individuals: (e.g., through surveys, chatbots, wearable devices)

De-identified Secondary Data: (e.g., commercial, licensed, or publicly available datasets used as input to the AI system)

Electronic Health Records or Imaging Data: Specify whether data are retrospective or prospective, and how they are accessed or integrated into the AI system.

Student Records: (e.g., educational data protected under FERPA or institutional policies)

Combining Data with Other Datasets or Auxiliary Information: (e.g., linking datasets, enriching with external sources, or merging across systems)

Known limitations or biases in the dataset(s): See AI Glossary of Risks

COMMON PITFALLS

  • Assuming “de-identified” data eliminates all privacy risk
  • Not describing how datasets are sourced, accessed, or combined
  • Omitting whether data originate from living individuals

AI Data Security

In the Records, Privacy, and Confidentiality section of a protocol application, researchers are asked to describe how data are stored and who has access. Because AI systems may process data in external or cloud-based environments, understanding how data move through and are controlled within these systems is essential for assessing risks to privacy, confidentiality, and unauthorized access.

Use the following guidance to clearly describe where the AI system is hosted and who may access, retain, or use the data:

GENERAL INFORMATION

Compliance with Institutional and Regulatory Requirements: Per AR 10.7, Protected Health Information (PHI) or Personally Identifiable Information (PII) should not be entered into a generative AI tool or other software unless the UK HealthCare InfoSec Data Sharing Committee has confirmed the tool is HIPAA-compliant and supports PHI

Where the AI system is Hosted: Clearly indicate whether the AI system is:

  • Hosted in a third-party or cloud-based environment, or
  • Running locally within a UK-managed or protected environment

If using a cloud-based or third-party system, describe whether data entered in the system may be stored, accessed, or used by the vendor (e.g., for service improvement or system training).

When AI systems are used within a UK-protected environment (e.g., UK Centered for Applied AI), both the system and the data remain under institutional control. A helpful way to think about this is with a Hotel California analogy. The AI system can “check in” to analyze the data, but it cannot leave the environment with the data.

A white building with trees and mountains

Data Access and Ownership: Indicate whether the AI vendor has access to the data and whether the vendor retains any ownership, usage rights, or licensing permissions over the data.

UK ITS/GRC: Indicate whether a data privacy review was conducted by UK Information Technology Services (ITS) Governance, Risk and Compliance (GRC) team.

Additional Data Security and Privacy Controls: Describe safeguards in place to mitigate AI-related risks (e.g., access controls, encryption, data minimization, monitoring)

AI Food and Drug Administration (FDA) Frequently Asked Questions (FAQs)

How do I find out if the software/application/algorithm I'm developing is a medical device?

Consult the FDA Digital Center for Excellence and use the Digital Health Policy Navigator to determine if subject to FDA oversight and applicable FDA legal and regulatory requirements.

How do I find out if the Clinical Decision Support (CDS) software/application/algorithm I'm developing is a medical device?

According to the FDA Digital Health Policy Navigator tool, the software/application/algorithm I'm developing may be subject to FDA oversight as a medical device. How do I classify or engage FDA to determine procedure for clearance or approval?

For formal classification of a product, a 513(g) request for information to the applicable Office of Health Technology (OHT). You may also engage the FDA or request feedback through the Pre-Submission program.

How is the FDA regulating the clearance and approval of Artificial Intelligence (AI) medical devices that use Machine Learning?

How do I determine what other laws and regulations apply to the software or mobile application I'm developing?

Use the Federal Trade Commission (FTC) Mobile Health App Interactive Tool for determining which laws and rules may apply.

AI Glossary of Risk

TERM DEFINITION
Anthropomorphism Also known as over-personification, anthropomorphism ascribes human features/characteristics to the model. This can lead to overconfidence in the model's performance and lax human oversight.
Bias AI bias occurs when data output perpetuates existing prejudices. Bias can be imbedded in discriminatory data training sets, or it can be introduced through subjective algorithm development. 
Data Drift Data drift is a reaction to statistical and characteristic changes in input data that the model is not trained to handle. The model cannot generalize beyond the training data. This can lead to off-purpose data output and performance decline.
Data Fusion Data fusion is the process of combining multiple data sources. The sources generally include raw data and produce false positives that can lead to inaccurate data profiles. Raw data is unlabeled data that has not been cleaned, organized, or summarized. Data fusion can also lead to re-identification.
Data Leaks Data leaks refer to intended and unintended exposure of sensitive, private, or proprietary data. Data leakage commonly refers to vendor access and ownership of protected data as a stipulation for use of the third party’s AI model. Leaks can occur at any point in AI use or development. 
Data Minimization Data minimization is the process of identifying and inputting the least amount of data points to fulfill the model's purpose. The intention is privacy protection. However, it can also lead to a loss of data that may limit results and impact accuracy. Another concern would be the potential for unintentionally including bias (e.g., excluding race, gender, or age data points).
Deepfakes A deepfake is audiovisual content intentionally altered to disseminate false information. Deepfakes can contaminate results when inputting unsupervised data from open AI sources. Contaminated results could potentially infiltrate supervised databases and/or peer reviewed publications. This could lead to perpetuating the falsehood and raise intellectual property concerns.
False Negative A false negative is a data output prediction/decision that incorrectly indicates an attribute/condition is not present when it is present. False negatives can lead to missed opportunities to participate in research in a pre-clinical effort to locate a target population. False negatives can also lead to a misdiagnosis in a clinical trial.
False Positive A false positive is a data output prediction/decision that incorrectly indicates an attribute/condition is present when it is not present. A high rate of false positives generally indicates biased training data. False positives can lead to carrying forward misguided prejudice as a result.
Hallucinations A hallucination is nonsensical and inaccurate data output from a Large Language Model (LLM). Underlying causes for hallucinating LLM responses include the lack of real-world context and insufficient or poor-quality data. Vague prompting combined with an expectation to “guess” can also produce hallucinations.
Interpretability Interpretability describes how a model makes a prediction or decision. In a model that combines different types of data, interpretability also includes how different data types interact.
Misclassification AI data classification is the process of sorting and labeling data inputted into AI models. Misclassification occurs when the AI model incorrectly sorts and/or labels data output.
Overfitting Overfitting is when a model performs poorly on new data because it memorized the training data. It occurs for a variety of reasons (e.g., insufficient training data, training too long on the same data, too much emphasis on noise in the data that is real-world uncommon).
Over-optimization Optimization is the process of adjusting the mathematical parameters of an algorithm to improve accuracy and reduce errors. Over-optimization occurs when an algorithm metric is too narrowly defined on a specific task. A hyper-tuned algorithm can develop blind spots that lead to misidentification or misunderstanding in a prediction or decision.
Over-reliance Over-reliance involves placing too much trust in the output of a model. This can cause performance errors, and that might otherwise be identified through more diligent human scrutiny.
Poisoned Data Poisoned data is incorrect, biased, or mislabeled data that can contaminate training sets. Data pulled from the internet, third-party platforms, and government can advance toxicity in the early stages of model development.
Re-identification Re-identification is a process that results in linking a de-identified data source back to the identity of the de-identified individual. It generally occurs when data sets are combined, allowing an AI model to match data points in de-identified data and other publicly available data.
Skewed Data Models trained on skewed data generate unequal predictions and decisions. Skewed data can produce higher/lower rate patterns of inconsistencies for a specific demographic population without justification. This can amplify already existing biases.
Synthetic Data Synthetic data is trained on real-world data for the purpose of generating statistically identical data. This can lead to the re-identification of the de-identified real-world data. It can also generate an inaccurate representation of the real world and reinforce demographic inequities.

 

AI Adventures in Human Subjects Research Video Series

The AI Adventures in Human Subjects Research Video Series offers researchers responsible-use strategies for mitigating the risk of employing AI in human subjects research. 

Episode 1. AI in Human Subjects Research

Episode 1. AI in Human Subjects Research (Cartoon- robot holding device hand)

Episode 1 provides the purpose for creating the video series, presents the beneficence conundrum, and introduces the data ancestors. [7:10]

View Ep. 1 AI in Human Subjects Research

Episode 2. Grandfather Data

Episode 2. Grandfather Data (Cartoon - device with hat, cane, glasses, arms, legs, and gray mustache)

Episode 2 offers strategies for mitigating the risk of inputting potentially biased historical data sets into AI models. [4:54]

View Ep. 2 Grandfather Data

Episode 3. Aunt Data

 

Episode 3. Aunt Data (Cartoon - device with arms, legs, purse, and makeup)

Episode 3 offers strategies for mitigating the risk of combining de-identified data with publicly available auxiliary information. [4:53]

View Ep. 3 Aunt Data

Episode 4. Sister Data

Episode 4. Sister Data (Cartoon - device with arms, legs, microphone, tambourine, and eyes)

Episode 4 offers strategies for mitigating the risk of mining open-source social media content that may or may not be publicly available. [6:43]

View Ep. 4 Sister Data

Episode 5. Uncle Data

Episode 5. Uncle Data (Cartoon - device with arms, legs, and mustache, dressed as jester with horn)

Episode 5 offers strategies for mitigating the risk of unintentionally including hallucinations or fabricated data in research involving human subjects. [6:10]

View Ep. 5 Uncle Data

Additional Resources

UK Center for Applied AI

Since 2023, the UK Center for Applied Artificial Intelligence has been at the forefront of AI, making investments in people and technology to empower others to explore AI and create meaningful solutions. Today, the Center is a specialized community who empowers faculty, staff, researchers, and clinicians to use AI to advance research, improve health outcomes, enhance student experiences, and drive productivity. Our team includes early and late-career software developers, project managers, data scientists, and advisors who guide our collaborators to overcome common AI barriers, such as technical expertise and access to secure compute resources. Without requiring a technical background or costly investments, we help our collaborators quickly turn ideas into prototypes and prototypes into solutions that make an impact for the communities they serve. Since our inception, we’ve supported more than 100 projects, have a network of 43 partners and over 120 individual collaborators. Please fill out our Collaboration Intake Form to connect. 

UK ADVANCE

Recommendations on the Use of Generative AI in Research and Scholarly Activity. UK ADVANCE offers guidance in response to frequently asked questions (FAQs) about the use of artificial intelligence from the UK research community.