医学影像大模型的演进、技术架构与临床展望评述

首页 > 按月查看>2026年第4月 >469-477

医学影像大模型的演进、技术架构与临床展望评述
DOI:
                        
作者:
                        
作者单位:
作者简介:
通讯作者:
基金项目:四川省自然科学基金项目（2024NSFSC0656）

Foundation models in medical imaging: a review of evolution, architecture and clinical prospects

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

摘要:

近年来，人工智能在医学影像分析领域正经历从“专用模型”向“基础模型”范式的转变。传统单任务模型高度依赖专家标注且缺乏跨任务泛化能力，而医学影像大模型（LMIMs）通过海量多模态数据自监督预训练，仅需少量微调即可适应多种下游任务，是迈向医疗通用人工智能的关键路径。本文系统评述了医学影像大模型的最新研究进展。首先，将现有模型分为视觉基础模型、视觉-语言大模型以及通用与智能体模型三大类。其次，深入剖析了核心架构（如大核卷积神经网络、Vision Transformer及其混合架构）、对比学习、掩码建模等预训练学习范式。最后，探讨了数据构建与跨中心泛化的落地挑战，重点梳理了其在肿瘤等重大疾病中的临床应用潜力，并对结合因果推理、检索增强生成等技术破解部署瓶颈进行了展望。综上，医学影像大模型代表了医学人工智能发展的重要里程碑，未来有望深刻变革诊断流程，提升诊疗质量与效率，最终惠及全球患者健康

Abstract:

In recent years, artificial intelligence in medical image analysis has been experiencing a paradigm shift from "task-specific models" to "foundation models". Traditional single-task models rely heavily on expert annotations and lack cross-task generalization capabilities. In contrast, Large Medical Imaging Models (LMIMs), pre-trained on massive multimodal data via self-supervised learning, can be adapted to a wide range of downstream tasks with only minimal fine-tuning, representing a critical pathway toward artificial general intelligence in healthcare.This article systematically reviews the latest research progress on LMIMs. First, existing models are categorized into three main classes: vision foundation models, vision-language large models, and generalist and agent models. Second, we provide an in-depth analysis of the core architectures (such as large-kernel Convolutional Neural Networks, Vision Transformers, and their hybrid architectures) and pre-training learning paradigms, such as contrastive learning and masked modeling. Finally, we discuss the practical challenges of data construction and cross-center generalization, highlight their clinical application potential in major diseases such as oncology, and provide perspectives on overcoming deployment bottlenecks by integrating technologies like causal inference and retrieval-augmented generation.In summary, LMIMs represent a significant milestone in the development of medical artificial intelligence, holding the promise of profoundly transforming diagnostic workflows, improving the quality and efficiency of clinical care, and ultimately benefiting global patient health

参考文献

相似文献

引证文献

引用本文

复制

文章指标

点击次数:
下载次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2026-04-17

引用本文

分享

文章指标

历史