ohdsi / thebookofohdsi Goto Github PK

View Code? Open in Web Editor NEW

104.0 104.0 81.0 711.19 MB

The Book of OHDSI repository

Home Page: https://ohdsi.github.io/TheBookOfOhdsi/

License: Creative Commons Zero v1.0 Universal

Dockerfile 0.05% Shell 0.21% TeX 23.40% CSS 1.05% HTML 1.08% R 74.21%

thebookofohdsi's People

Contributors

Stargazers

Watchers

thebookofohdsi's Issues

METADATA section (in ETL chapter or elsewhere?)

the ETL chapter may discuss some relevant METADATA that should be created.

e.g., the CDM_SOURCE table
what version of vocab was used
short name for the dataset
how many patients were deleted (missing year of birth)

Line 432 of PatientLevelPrediction chapter

In the line 432 of PatientLevelPrediction chapter,

Measurement: Construct covariates for each measurement concept ID and time interval selected and if a patient has the concept ID recorded during the specified time interval prior to the cohort start date in the measurement table, the covariate value is 1, otherwise 0.

Measurement Value: Construct covariates for each measurement concept ID with a value and time interval selected and if a patient has the concept ID recorded during the specified time interval prior to the cohort start date in the measurement table, the covariate value is the measurement value, otherwise 0.

To my knowledge, the measurement value is 'continuous value' not a binary value. How do you think @PRijnbeek @jreps @schuemie ?

Meddra shouldn't be a part of the excercise as it's linence restricted vocabulary

https://forums.ohdsi.org/t/questions-about-book-exercises-ch-4-and-5/13676

Chapter 5, exercise 5.3.
Meddra shouldn't be a part of the excercise as it's linence restricted vocabulary

Exercises?

The patient-level prediction chapter draft currently doesn't have any exercises . Do we think chapters should have those, for example to be used in classroom settings?

@PRijnbeek ?

"Lauren's story" link outdated?

"Lauren's story" link in section 4.3.1 now just redirects to the main Endometriosis UK front page. Seems it's no longer anywhere on their website.

discrepancy in montherapy query

For the mono-therapy query in the cohort query, this is using the drug_exposure table rather than the drug_era table. The problem with using the drug exposure table directly is that it will exclude individuals who happen to have 2 records that provide exact same drug. It seems like this would need to be collapsed, and that's what the drug_era table does. The generated SQL uses the drug_era table when computing this particular condition.

Rename 'Methods Library' to 'HADES' everywhere

We've decided to rename the Methods Library, and this is referenced throughout the book.

Add json files to create the examples in the book

I suggest to add json files (or link to json files) to help with the ATLAS examples (cohorts, characterization, pathways, incidence and more).
In the current status, if I want to reproduce the pathway analysis examples of first line treatment for hypertension that is given in the book (Chapter 11.9), I need to define many concept sets and cohorts. Or I need to look for one that was created in the ATLAS-demo and make sure that it corresponds the description which is not a trivial task.

I would like to contribute, but I'm not sure what would be the best option. I thought of severals:

Link to (or give the id of) the example in ATLAS-demo. The limitation - If someone change the example on the web if it is not locked.
Create a json file and add as plain text to the book of OHDSI
Create a json file and put a link in the Book. Q: Where to store the link? Under books's files (e.g. extras folder) or external location (personal repo?).
Create a script that load a json file to the ATLAS using WebAPI. Limitation - I'm not sure if it's fully implemented for characterization, PLP etc., or it is only support cohort and concept sets upload.

Any suggestions?

Contributor list not shown on `book.ohdsi.org` version

Am using Safari. Does this happen on other systems?

Allow book readers to comment and ask questions or use github smartly for that

It would be super cool to have a platform, where book readers can annotate book sections with

comments
questions that arise

That would allow improving the books.
They can file an issue in github but that is a lot of steps.

Or, OK, use github for it but have tags for each chapter or chapter section so that they are easy to find when refresh of chapter is being authored.

References in PatientLevelPredicition.Rmd render before (and after) heading

I am unsure how to fix this issue. Maybe @schuemie or @PRijnbeek have some ideas?

Back-of-the-book index has incorrect page numbers and links

The index is created by xelatex. Perhaps related to issue #24

Should all citations have DOIs (if available)?

Currently some citations in PatientLevelPrediction.Rmd have DOIs and many do not (although DOIs exist for these papers).

What should be our standard across chapters?

http://howoften.org error 504

TheBookOfOhdsi/OpenScience.Rmd

Line 41 in af3910b

 Because of the privacy-sensitive nature of healthcare data, fully open, comprehensive patient-level datasets are typically not available. However, it is possible to leverage OMOP mapped datasets to publish important aggregated data and results sets, such as the earlier mentioned http://howoften.org and other public result sets that are published to http://data.ohdsi.org. Also, the OHDSI community provides simulated datasets such as SynPUF for testing and development purposes, and the OHDSI Research Network (see \@ref(NetworkResearch)) can be leveraged to run studies in a network of available datasources that have mapped their data to OMOP. In order to make the mapping between the source data and the OMOP CDM transparent, it is encouraged for data sources to re-use the OHDSI ETL or 'mapping' tools and publish their mapping code as open source as well. 

http://howoften.org is redirected to http://www.ohdsi.org/web/howoften/

config: Object { method: "GET", jsonpCallbackParam: "callback", url: "/seir_api/drug_list", … } data: "<html><body><h1>504 Gateway Time-out</h1>\nThe server didn't respond in time.\n</body></html>\n\n" headers: function xd(d) status: 504 statusText: "Gateway Time-out" xhrStatus: "complete" seir.js:18621:29

question about / 10.7.1 Exposure cohort

Hello. I was just scanning the chapter on SQL and was looking at 10.7.1 Exposure cohort. There are two discrepancies.

First, the text below says We take the first drug exposure per person but in fact, the SQL does not do exactly this -- it constructs a synthetic structure having the minimal begin and minimal dates for the given person that match the concept criteria. Now, this might be a case if exposures do not intersect.

Second, the SQL here doesn't do what the corresponding auto-generated SQL does. The auto-generated SQL for drug exposure criteria handles the possibility of a NULL end-date,
COALESCE(C.drug_exposure_end_date, (C.drug_exposure_start_date + 1*INTERVAL'1 day')), it also has slightly different logic, it ensures the end_date is within the observation period. This may not be relevant.

conditioned Cox regression after PS matching in the example

@schuemie

In the chapter 12. PLE, the example shows the conditioned Cox regression after PS matching
(PS matching in Figure 12.14 and Conditioned Cox regression in Fig 12.15)
Wouldn't it better to show 'Unconditioned Cox regression after PS matching' or 'Conditioned Cox regression after PS stratification'?

I know current ATLAS doesn't support 'unconditioned Cox regression' setting (I don't know the reason why I cannot set 'unconditioned Cox' in current ATLAS)

Do we need to modify fig 5.6?

In the Figure 5.6, the caption denotes that "Atrial fibrillation" is the main concept to be illustrate. However, "Atrial fibrillation" is denoted as [Descendant] and "Fibrillation" as [Concepts] in the figure.

To fix it,
Concepts to Atrial fibrillation; Descendant to Controlled atrial fibrillation ...

perhaps use drug_era for exposure cohort

For the exposure cohort consider using drug_era rather than occurrence? I suggest this since a bulk of this query is focused on normalizing for overlapping medications, which is something that drug_era does already (it is some sort of calculated table, no?). In particular, even as written, with overlapping drug_exposures, this query is now going to grab which ever end_date happens to be first...

Achilles version is wrong in software list

In book it says 1.6.6 while in Achilles repo the latest is 1.6.3

Adding a chapter on NLP to the book of OHDSI

We are planning to add a chapter on NLP to the book of OHDSI. This is to start a discussion to know your thoughts in general, and comments on the content and specific topics to include.
[Posted on behalf of Hua Xu - OHDSI NLP WG lead]

Add hypothes.is to the Book Of Ohdsi website so anyone can annotate and comment on the book

Hypothes.is can add interactivity to a book created with Rmarkdown. Readers would be able to ask questions, suggest edits, and make comments all in their web browser.

Here are examples of Rmarkdown books that uses hypothesis.is:

Here is some documentation about how to add this to a book: https://www.crumplab.com/OER_bookdown/hypothes-is-1.html

This functionality might make it easier for more people in the community who are not familiar with Github and Rmarkdown to comment and suggest edits or highlight places for clarification.

Table 4.6 mistake ?

OHDSI newbie here. I am reading the doc and think there might be a mistake in table 4.6:

Column name	Value	Explanation
ADMITTED_FROM_ CONCEPT_ID	0	If known, this is contains a Concept representing where the patient was admitted from. This concept should have the domain "Visit". For example, if the patient were admitted to the hospital from home it would contain 8536 ("Home").
ADMITTED_FROM_ SOURCE_CONCEPT_ID	NULL	This is the value from the source that represents where the patient was admitted from. Using the above example, this would be "home".

Unless I got everything wrong, shouldn't it be ADMITTED_FROM_ SOURCE_VALUE instead of CONCEPT_ID ?

Also I spotted this possible mistake because the example mixes "NULL" and "0" for fields that are supposed to be ID's. Is there a reason why the value given in the examples is sometimes NULL and sometimes 0 ?

thanks
Thomas

link relevant short videos to chapters/sections

treatment patway has a great short video

here
https://www.youtube.com/watch?v=rdniIztguys

idea: link relevant short videos (by nice icon)
right after a subchapter heading

Update the Data Quality chapter

The Data Quality Dashboard has matured, and we’re about to launch the CohortDiagnostics package, which I’d like to describe here.

@clairblacketer , what do you think?

question about fig 5.6

In fig 5.6, Fibrillation is represented as a child of Supraventricular arrhythmia

But, when you search for Fibrillation in ATLAS, you can find it is not.

The CDM v6 contains 15 Clinical Event tables (not 16 Clinical Event Table)?

TheBookOfOhdsi/CommonDataModel.Rmd

Line 131 in 81a3897

 The CDM contains 16 Clinical Event tables, 10 Vocabulary tables, 2 metadata tables, 4 health system data tables, 2 health economics data tables, 3 standardized derived elements, and 2 Results schema tables. These tables are fully specified in the CDM Wiki.[^cdmWikiUrl1] 

I count 15 Clinical Event Tables, not 16!! Where are they coming from?
The rest is fine!

Figure 4.1: CDM version 6.0:

TheBookOfOhdsi/CommonDataModel.Rmd

Lines 15 to 18 in 81a3897

 An overview of all the tables in the CDM is provided in Figure \@ref(fig:cdmDiagram). \index{Common Data Model!data model diagram} 

 ```{r cdmDiagram, fig.cap="Overview of all tables in the CDM version 6.0. Note that not all relationships between tables are shown.",echo=FALSE, out.width="100%"} 

 knitr::include_graphics("images/CommonDataModel/cdmDiagram.png")

Update the Network Research chapter

With information about the new ohdsi-studies organization and app.

Add chapter on Broadsea

Add a chapter on Broadsea, to help new sites understand best practices around setting up OHDSI tool stack.

multi-language support

for example how to translate contents into chinese ,spanish?
is the best practise to fork another repo?
since the content is continuing to change

Rethinking Chapter 9 with Eunomia in Mind

Hi all,

Was recently helping someone on the OHDSI forums with a question on Chapter 9.
Although Eunomia is mentioned later at the end of the chapter, for beginners, Eunomia should be mentioned earlier and leveraged in the examples throughout the chapter in my opinion.
What do you all think?

PDF pages with both a table and a figure overflow outside margin

For example the "The cohort method design" section in the PLE chapter

CONDITION_STATUS_ CONCEPT_ID description

TheBookOfOhdsi/CommonDataModel.Rmd

Line 271 in b79b189

 |CONDITION_STATUS_ CONCEPT_ID|0|If known, the this tells the circumstance and . For example, a condition could be an admitting diagnosis, in which case the concept ID [4203942](http://athena.ohdsi.org/search-terms/terms/4203942) was used.| 

The first sentence of the "CONDITION_STATUS_ CONCEPT_ID" description is incomplete : "If known, the this tells the circumstance and . "

11.2 cohort caracterization - extend with example

add example from
https://github.com/cukarthik/IUDEstimationStudy/blob/master/R/AdditionalAnalysis.R#L27

Need chapters about mapping, but not sure where

Right now, we have a chapter about Usagi. That needs to be complemented with specific mapping processes we developed. Maybe like this:

Mapping and QA of codes to Standard Concepts

Mapping codes locally versus through the OHDSI Standard Vocabularies
Usagi
Systematic mapping of Drug codes
Systematic mapping of Condition codes
Systematic mapping of Procedure codes
Systematic mapping of other codes

Thoughts

error of caption about Table B.5

There is an error of caption about Table B.5.

The title of the caption should be 'Acute myocardial Infarction'.

Thanks.

Chapter 13 - PLE Edits

Just a couple of minor edits and suggestions as I read through the PLE chapter:

Sub-chapters 13.3 and 13.4 should be titled 'Implementing....' or 'Implementation of.....'not just 'Implementation.....'
I would be inclined to make the descriptive section (section 13.1) on propensity scores a sub-chapter (but beware that 13.4.4 is already a sub-header for propensity scores)
Having been part of the PLE workgroup at the OHDSI F2F, I can honestly say that it was a great experience being part of this collaboration. Great use of the screenshots especially the little icons - they really help the viewer navigate through their cohort.

document cohort exit magic

In the cohort exit section it talks about "magic". While the SQL's implementation may be magical and need not be further discussed, the exact logical algorithm that this query implements is not documented. This is quite important to understanding what it means to generate an exit cohort in OHDSI. The algorithm could be discussed informally, but it should express what considerations and/or cases are accounted for.

Consider refactoring this code so that previous queries populate a temporary cohort era table, with subject, begin/end. Then, a separate SQL fragment that might be more digestible could be written that collapses these eras. Doing both the era extraction and the collapse at the same time make the logic hard to follow. If these two operations were separated, perhaps both could be more easily explained.

Change field names from lower case to upper case everywhere

In the CDM workgroup the convention has changed: field names now must be written in UPPER CASE. We want to be consistent in the book, but so far we've followed the old convention.

Typo in Figure 16.1

TheBookOfOhdsi/ClinicalValidity.Rmd

Line 33 in ab4def6

 ```{r matrix, fig.cap='Confusion matrix.', echo=FALSE, out.width='75%', fig.align='center'} 

The figure lists "False Negative" in the upper right-hand square when it should list "False Positive"

Type concepts in Excercises anwers need to be updated

https://forums.ohdsi.org/t/questions-about-book-exercises-ch-4-and-5/13676

There are still an old Type concepts, while we got a new set of Type concepts after their refactoring.
So these should be replaced

	An overview of all the tables in the CDM is provided in Figure \@ref(fig:cdmDiagram). \index{Common Data Model!data model diagram}

	```{r cdmDiagram, fig.cap="Overview of all tables in the CDM version 6.0. Note that not all relationships between tables are shown.",echo=FALSE, out.width="100%"}
	knitr::include_graphics("images/CommonDataModel/cdmDiagram.png")

ohdsi / thebookofohdsi Goto Github PK

thebookofohdsi's People

Contributors

Stargazers

Watchers

Forkers

thebookofohdsi's Issues

Recommend Projects

Recommend Topics

Recommend Org