大语言模型在多重耐药菌感染防控领域的应用与评估
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

R181.3+2;R197.323

基金项目:

2024年辽宁省教育厅基本科研项目(LJ112410159078)


Application and evaluation of a large language model in the prevention and control of multidrug-resistant organism infection
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 文章评论
    摘要:

    目的 评估大语言模型ChatGPT-4o在多重耐药菌感染防控领域的知识能力,探讨其在医院感染控制领域的应用前景。方法 基于指南和专家共识,构建涵盖5类问题(基础概念、传播风险识别、感染控制实践、抗菌药物管理、监测评估)的专业知识问答数据集,输入ChatGPT-4o生成回答,并组织4名医院感染管理领域专家采用Likert五点评分法对模型输出结果进行评估,对比分析大语言模型在多重耐药菌感染防控专业领域的准确性、完整性和可用性。结果 在50道问题中,ChatGPT-4o总体回答的正确率为58.0%,部分正确率为40.0%,不正确率为2.0%。针对基础概念类问题,ChatGPT-4o表现最佳,正确率达86.0%;在抗菌药物管理类问题中表现相对较差,正确率为40.0%。准确性、完整性和可用性的得分均值分别为4.63、4.70、4.57分。结论 ChatGPT-4o在多重耐药菌感染防控领域展现出良好的基础知识应答能力。尽管其在复杂感染控制情境中存在内容泛化、决策支持能力不足等局限性,但未来大语言模型在医院感染控制领域具有广泛的应用前景。

    Abstract:

    Objective To evaluate the knowledge capability of the large language model (LLM) ChatGPT-4o in the field of multidrug-resistant organism infection prevention and control, and explore its potential applications in healthcare-associated infection (HAI)control. Methods A professional knowledge question-and-answer dataset covering 5 categories (basic concept, transmission risk identification, infection control practice, antimicrobial stewardship, surveillance evaluation) was constructed based on guidelines and expert consensus. The dataset was input into ChatGPT-4o to generate responses. Four experts in the HAI management field evaluated the model’s output results using a 5-point Likert scale. The accuracy, completeness, and usability of LLM in the field of multidrug-resistant organism infection prevention and control were compared and analyzed. Results Among the 50 questions, ChatGPT-4o achieved an overall accuracy rate of 58.0%, a partially accuracy rate of 40.0%, and an inaccuracy rate of 2.0%. ChatGPT-4o performed best on "basic concept" questions, with an accuracy rate of 86.0%, and worst on "antimicrobial stewardship" questions, with an accuracy rate of 40.0%. The average scores for accuracy, completeness, and usability were 4.63, 4.70, and 4.57 points, respectively. Conclusion ChatGPT-4o demonstrates an excellent response capability for basic knowledge in the field of multidrug-resistant organism infection prevention and control. Although it has limitations such as content generalization and insufficient decision support ability in complex infection control situations, LLM in the field of HAI control has broad application prospects in the future.

    参考文献
    相似文献
引用本文

于涵,王丽华,杨颖,等.大语言模型在多重耐药菌感染防控领域的应用与评估[J]. 中国感染控制杂志,2026,25(5):645-651. DOI:10.12138/j. issn.1671-9638.20263074.
YU Han, WANG Lihua, YANG Ying, et al. Application and evaluation of a large language model in the prevention and control of multidrug-resistant organism infection[J]. Chin J Infect Control, 2026,25(5):645-651. DOI:10.12138/j. issn.1671-9638.20263074.

复制
分享
文章指标
  • 摘要阅读次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-30
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-05-29
  • 出版日期: 2026-05-28