Giter VIP home page Giter VIP logo

clinvar-this's Introduction

clinvar-this's People

Contributors

dependabot[bot] avatar github-actions[bot] avatar holtgrewe avatar varfish-bot avatar xiamaz avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Forkers

genomicmedlab

clinvar-this's Issues

Create command to write out gene-to-phenotype link

Is your feature request related to a problem? Please describe.
In many situations it is useful to see the links between HPO terms and genes from ClinVar.

Describe the solution you'd like
Now that we have the handy JSONL format, generating this is easy. A similar approach as in #102 can be taken.

Describe alternatives you've considered
N/A

Additional context
N/A

Create tool to export data for lollipop plots along a gene

Is your feature request related to a problem? Please describe.
Lollipop plots are a useful visualization of variants in a gene. See for example the ClinVar ones. They would be useful to have in the output.

image

Describe the solution you'd like
Besides the coordinates of the transcript / exon (which would have to come from Mehari), we would need to write out (separate for each genome release) for each gene, all variant:

  • SPDI information (chrom, pos, deleted bases, inserted bases)
  • molecular impact
  • RCV
  • ACMG rating
  • review status

Describe alternatives you've considered
N/A

Additional context
N/A

Add command to write out small/structural variants as JSONL

Is your feature request related to a problem? Please describe.
For annonars and display we need to have files with small and structural variants.

Describe the solution you'd like
Write out such files that contain the variant, the RCV, and the assessment and review status.

Describe alternatives you've considered
N/A

Additional context
N/A

Add support for parsing ClinVar XML

Is your feature request related to a problem? Please describe.
When working with ClinVar, it is useful to work with their full database dumps published in XML format. However, the XML format is hard to work with.

Describe the solution you'd like
Add functionality to parse the ClinVar data based on xmltodict and further processing to easy-to-use Python attrs classes. Importantly, allow export to JSONL format as this is easy to work with in external tools as wells.

Describe alternatives you've considered
N/A

Additional context
N/A

Write out submitting organisation and local key to phenotype gene link

Is your feature request related to a problem? Please describe.
In the case that more than one variant is submitted for a patient, we currently have no way of detecting this.

Describe the solution you'd like
Write out the submitting organisation and local key (when available) for each record in the phenotype to gene link step.

Describe alternatives you've considered
N/A

Additional context
N/A

Add documentation of use cases

Document the following use cases:

  • import from various formats into the internal JSON format
  • submission and result retrieval
  • submission and resubmission
  • deletion of records

Adjust to latest clinvar API version.

Email from ClinVar

Dear ClinVar API Submitter,

Thank you for using the ClinVar Submission API!
We are making changes to the API submission "test mode" to allow users to test the new schema before its production release.
The test endpoint for testing submission of data https://submit.ncbi.nlm.nih.gov/apitest/v1/submissions could be unstable during the transition to this new test schema from November 11-13, 2022.
From November 14, 2022, it will only accept the requests adhering to the new upcoming schema.
Key changes in new submission schema:

  • We have moved 'assertionCriteria' to top level. Note that only one single assertion criteria (a citation or an electronic document) may be provided for a submission. This assertion criteria will be applied to all interpretations in the submission.
  • We have moved 'releaseStatus' to the top level as 'clinvarSubmissionReleaseStatus'. This status will be applied to all the interpretations in the submission.

Please see the new submission schema attached for details.

Note: The main API enpoint https://submit.ncbi.nlm.nih.gov/api/v1/submissions/ still accepts the current schema at https://www.ncbi.nlm.nih.gov/clinvar/docs/api_http/#h2_schema_raw.
We will do the production release of this new schema on the main endpoint on November 21, 2022.
If you have any questions, please contact us at [email protected].

Thank you for sharing data through ClinVar!

ClinVar staff
[email protected]

ClinVar API Error Message

Additional properties are not allowed ('assertionCriteria', 'releaseStatus' were unexpected)

Properly augment documentation for new parsing features

Is your feature request related to a problem? Please describe.
Having the ClinVar XML parsing features is useful but this has not been reflected in the documentation yet.

Describe the solution you'd like
Augment the documentation both in terms of manual and API docs.

Describe alternatives you've considered
N/A

Additional context
N/A

Complete API documentation

We need more Python sphinx docstrings for members of the messages etc. from the ClinVar API documentation.

Create gene-impact-summary report command

Is your feature request related to a problem? Please describe.
Having the JSONL files is handy as now it will be easy to create a per-gene report of how many variants with each molecular impact and each clinvar pathogenicity are present.

Describe the solution you'd like
Create a clinvar-this data gene-impact-report command for this.

Describe alternatives you've considered
N/A

Additional context

Create command to extract variants per classification and frequency

Is your feature request related to a problem? Please describe.
It can be useful to know the distribution of variants classified as ACMG1-5 for each gene.

Describe the solution you'd like
Create a command that reports for each gene the number of variants in each ACMG classification in buckets

  • 0 / no frequency
  • <0.00001
  • <0.000025
  • <0.00005
  • <0.0001
  • <0.00025
  • <0.0005
  • <0.001
  • <0.0025
  • <0.005
  • <0.01
  • <0.025
  • <0.05
  • <0.1
  • <0.25
  • <0.5
  • <1

Describe alternatives you've considered
N/A

Additional context
N/A

Implements models for extra files

We need to add models for the extra files, e.g.

{
  "submissionName": "SUB673156",
  "submissionDate": "2021-03-25",
  "batchProcessingStatus": "Success",
  "batchReleaseStatus": "Not released",
  "totalCount": 3,
  "totalErrors": 0,
  "totalSuccess": 3,
  "totalPublic": 0,
  "submissions": [
    {
      "identifiers": {
        "localID": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success1",
        "clinvarLocalKey": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success1",
        "localKey": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success1",
        "clinvarAccession": "SCV000839746"
      },
      "processingStatus": "Success"
    },
    {
      "identifiers": {
        "localID": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success2",
        "clinvarLocalKey": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success2",
        "localKey": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success2",
        "clinvarAccession": "SCV000839747"
      },
      "processingStatus": "Success"
    },
    {
      "identifiers": {
        "localID": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success3",
        "clinvarLocalKey": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success3",
        "localKey": "adefc5ed-7d59-4119-8b3d-07dcdc504c09_success3",
        "clinvarAccession": "SCV000839748"
      },
      "processingStatus": "Success"
    }
  ]
}

Add getting started tutorial

Document getting started, beginning with registering an org, getting an API key, importing data using the apitest and dry-run, and finally removing the records again.

Implement API models

The message records from #1 are to be used for server communication. The aim of this ticket is to abstract away the length-1 arrays and lowerCamelCase to snake_case.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.