Giter VIP home page Giter VIP logo

ptm's Introduction

PTM

The dataset and implementation of Prescription Topic Model in our paper:

Liang Yao, Yin Zhang, Baogang Wei, Wenjin Zhang, Zhe Jin. (2018). "A Topic Modeling Approach for Traditional Chinese Medicine Prescriptions". IEEE Transactions on Knowledge and Data Engineering (TKDE) 30(6), pp.1007-1021.

Require

Java 7 or above, I use Java 8 in this project.

Eclipse

Data

The Copyright holder of the dataset is China Knowledge Centre for Engineering Sciences and Technology (CKCEST). The dataset is for research use only. Any commercial use, sale, or other monetization is prohibited.

98,334 raw prescriptions with herbs and symptoms are in /data/prescriptions.txt . Each line is for a prescription, symptoms are on the left and herbs are on the right.

The preprocessed 33,765 prescriptions: /data/pre_herbs.txt, /data/pre_symptoms.txt.

Training set: /data/pre_herbs_train.txt, /data/pre_symptoms_train.txt

Test set: /data/pre_herbs_test.txt, /data/pre_symptoms_test.txt

Note:

  1. Each line in above files is for a prescription, the same line in /data/pre_herbsX.txt and /data/pre_symptomsX.txt (X is _train or _test or ' ' ) is for the same prescription.

  2. Each number in above files means an herb or a symptom, each number is an index of the following herb list or symptom list. For example, '5' in /file/pre_herbs_train.txt means the 6th herb in the herb list /data/herbs_contains.txt, '17' in /file/pre_symptoms_train.txt means the 18th symptom in the symptom list /data/symptom_contains.txt.

Herb list: /data/herbs_contains.txt

Symptom list: /data/symptom_contains.txt

TCM MeSH herb-symptom correspondence knowledge: /data/symptom_herb_tcm_mesh.txt

Symptom Category: /data/symptom_category.txt

Demo

PTM(a): /src/test/RunPTMa.java (reproducing prescribing patterns discovery results)

PTM(b): /src/test/RunPTMb.java

PTM(c): /src/test/RunPTMc.java

PTM(d): /src/test/RunPTMd.java

Herbs and symptoms prediction/recommendation tasks

(reproducing herbs/symptoms predictive perplexity and precision@N results)

PTM(a): /src/test/PTMaPredict.java

PTM(b): /src/test/PTMbPredict.java

PTM(c): /src/test/PTMcPredict.java

PTM(d): /src/test/PTMdPredict.java

Topic herb precision

/src/test/TopicPrecisionSymToHerb.java

Prescription predictive perplexity

PTM(a): src/perplexity/PTMaPerplexity.java

PTM(b): src/perplexity/PTMbPerplexity.java

PTM(c): src/perplexity/PTMcPerplexity.java

PTM(d): src/perplexity/PTMdPerplexity.java

Topic symptom coherence

/src/test/TopicKnowCoherence.java

ptm's People

Contributors

yao8839836 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

ptm's Issues

关于数据预处理的代码

作者您好,

想问下您从98334条医案中提取出33765条医案的数据预处理代码可以分享下么,或者是这33765条医案的原始数据可以公开么?我想进一步从33765条医案中提取出来剂量属性,谢谢!

关于Herb Efficacy Knowledge的数据预处理

请问从TCM MeSH中药的efficacy description中提取了symptom-herb correspondences 知识,
efficacy description是data文件夹下包含的herb category么?
herb_category
止吐药
紫苏叶.txt[紫苏叶的功效描述文本]

谢谢!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.