天津医药 ›› 2021, Vol. 49 ›› Issue (3): 324-329.doi: 10.11958/20202276

• 应用研究 • 上一篇    下一篇

乳腺癌患者血清的傅里叶变换中红外光谱研究

沈婕1,2,朱丽英2,3,朱科静2,代龙光2,许永劼4,许雯2,刘歆蕾2,李兴5,潘卫1,2,4△   

  1. 1贵州医科大学附属医院贵州省产前诊断中心(邮编550004);2贵州医科大学医学检验学院;3贵州医科大学附属医院临床检验中心;4贵州医科大学公共卫生学院;5贵州中医药大学
  • 收稿日期:2020-08-14 修回日期:2020-11-19 出版日期:2021-03-15 发布日期:2021-03-15
  • 通讯作者: 潘卫 E-mail:313831139@qq.com
  • 作者简介:沈婕(1996),女,硕士在读,主要从事乳腺癌发生发展的分子机制研究。E-mail:aizailvtu1234@163.com
  • 基金资助:
    贵州省区域内一流学科建设项目-公共卫生与预防医学(黔教科研发2017[85]号)

Study on Fourier transform mid-infrared spectroscopy in serum of breast cancer patients

SHEN Jie1,2, ZHU Li-ying2,3, ZHU Ke-jing2, DAI Long-guang2, XU Yong-jie4, XU Wen2, LIU Xin-lei2, LI Xing5, PAN Wei1,2,4△   

  1. 1 Guizhou Prenatal Diagnosis Center, the Affiliated Hospital of Guizhou Medical University, Guiyang 550004, China; 2 School of Clinical Laboratory Science, Guizhou Medical University; 3 Clinical Laboratory Center, the Affiliated Hospital of Guizhou Medical University; 4 School of Public Health, Guizhou Medical University; 5 Guizhou University of Traditional Chinese Medicine
  • Received:2020-08-14 Revised:2020-11-19 Published:2021-03-15 Online:2021-03-15

摘要: 目的 基于傅里叶变换中红外光谱技术建立一种有效区分和鉴别健康人群和乳腺癌患者的方法。方法 采集86例健康女性和85例乳腺癌患者的血清样品光谱,绘制两类人群血清样品光谱图;对两类血清样品分别提取主成分信息,绘制主成分得分2D和3D散点图,进一步计算主成分1~10的得分(PC1~PC10);利用判别分析法原理建立判别分析模型,基于马氏距离对所有样本进行判别;在波数范围3 931~619 cm-1内计算不同光谱预处理方式下所建模型的性能指标评分,选择最佳预处理方式建立模型。结果 女性健康人群和女性乳腺癌患者的光谱在波数 3 363 cm-1、2 360 cm-1、1 641 cm-1、1 552 cm-1、663 cm-1处的峰强差异均有统计学意义(P<0.05);主成分分析结果显示,2组人群PC1~PC4差异有统计学意义(P<0.05),PC5~PC10差异无统计学意义(P>0.05);与正常组相比,乳腺癌组患者到N的马氏距离值高,到C的马氏距离值低(P<0.05);所建模型的验证集正判率为100%;对光谱不进行任何处理下所建模型最优,性能指标评分为94.1分。结论 傅里叶变换中红外光谱法可用于区分和鉴别健康人群和乳腺癌患者,有望成为乳腺癌辅助诊断的一种方法。

关键词: 乳腺肿瘤;血清;诊断;谱学, 傅里叶变换红外;判别分析;傅里叶变换中红外光谱技术

Abstract: Objective To establish an effective method for distinguishing and identifying normal people and breast cancer patients based on Fourier transform mid-infrared spectroscopy technology. Methods The serum samples of 86 female normal people and 85 female breast cancer patients were collected, and the spectra of the serum samples were drawn for the two groups of people. The principal component information of the two types of serum samples were extracted. The principal component score 2D and 3D scatter plots were drawn, and the scores of principal component 1-10 (PC1-PC10) were further calculated. Using the principle of discriminant analysis to establish a discriminant analysis model, and all samples were judged based on Mahalanobis distance. The performance index scores of models built under different spectrum preprocessing methods were calculated within the wave number range of 3 931-619 cm-1, and the best preprocessing method was selected to build the model. Results There were significant differences in the peak intensities of the spectrum at wave numbers 3 363 cm-1, 2 360 cm-1, 1 641 cm-1, 1 552 cm-1, and 663 cm-1 between female normal population and female breast cancer patients (P<0.05). The results of principal component analysis showed that there were significant differences in PC1-PC4 between two groups (P<0.05), and there were no significant differences in PC5~PC10 between two groups (P>0.05). Compared with the normal group, the Mahalanobis distance to N in the breast cancer group is higher, and the Mahalanobis distance to C is lower (P<0.05), and the positive judgment rate of the validation set of the built model was 100%. The model built without any processing on the spectrum was the best, and the performance index score was 94.1 points. Conclusion Fourier transform mid-infrared spectroscopy can be used to identify and distinguish normal people and breast cancer patients, and it is expected to become a method for assisting breast cancer diagnosis.

Key words: breast neoplasms, serum, diagnosis, spectroscopy, Fourier transform infrared, discriminant analysis, Fourier transform mid-infrared spectroscopy