Understanding traditional Chinese medicine via statistical learning of expert-specific Electronic Medical Records
摘要Background:Traditional Chinese medicine (TCM) has been attracting lots of attentions from various disciplines recently.However,TCM is still mysterious because of its unique philosophy and theoretical thinking.Due to the lack of high quality data,understanding TCM thoroughly faces critical challenges.In this study,we introduce the Zhou Archive,a large-scale database of expert-specific Electronic Medical Records containing information about 73,000 + visits to one TCM doctor for over 35 years.Covering the full spectrum of diagnosis-treatment model behind TCM practice,the archive provides an opportunity to understand TCM from the data-driven perspective.Methods:Processing the text data in the archive via a series of data processing steps,we transformed the semistructured EMRs in the archive to a well-structured feature table.Based on the structured feature table obtained,a series of statistical analyses are implemented to learn principles of TCM clinical practice from the archive,including correlation analysis,enrichment analysis,embedding analysis and association pattern discovery.Results:A structured feature table of 14,000 + features is generated at the end of the proposed data processing procedure,with a feature codebook,a term dictionary and a term-feature map as byproducts.Statistical analysis of the feature table reveals underlying principles about the diagnosis-treatment model of TCM,helping us better understand the TDM practice from a data-driven perspective.Conclusion:Expert-specific EMRs provide opportunities to understand TCM from the data-driven perspective.Taking advantage of recent progresses on NLP for Chinese,we can process a large number of TCM EMRs efficiently to gain insights via statistical analysis.
更多相关知识
- 浏览4
- 被引0
- 下载0

相似文献
- 中文期刊
- 外文期刊
- 学位论文
- 会议论文


换一批



