Giter VIP home page Giter VIP logo

drpowell / degust Goto Github PK

View Code? Open in Web Editor NEW
66.0 66.0 29.0 12.41 MB

An interactive web-tool for RNA-seq analysis

Home Page: http://degust.erc.monash.edu/

License: GNU General Public License v3.0

Ruby 1.52% JavaScript 76.53% CoffeeScript 6.93% CSS 1.17% HTML 3.24% Makefile 0.01% Python 0.60% Shell 0.03% Vue 9.24% Dockerfile 0.04% SCSS 0.06% Haml 0.35% Handlebars 0.28%
d3js interactive rna-seq

degust's People

Contributors

bioinformaticsbejo avatar dependabot[bot] avatar drpowell avatar kimberrypi avatar kiwiroy avatar multimeric avatar pansapiens avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

degust's Issues

Clean error message for 1 vs 1 analysis

We all know that 1 vs 1 isn't ideal, but hey, sometimes its what we have to analyse. Clean error messaging when voom/edge fails due to this would be good. And ideally a version that also allows visualisation of such data (with all the caveats that you would expect).

Option to choose between glmFit/glmQFit

Degust uses glmFit, while glmQFit is a more stringent. To quote one of the authors of edgeR:

glmQLFit will provide more accurate type I error rate control as it accounts for the uncertainty of the dispersion estimates in a more rigorous manner than the glmFit pipeline

An option to choose which model to fit would be great.

Ignore CSV header lines beginning with #

This would allow counts files from featureCounts to be used directly without modification - which is not always ideal, but useful for some users (eg, when RNAsik Degust-compatible count file generation fails).

Since there is a chance that the CSV column labels row may begin with a #, this should be a checkbox option or text entry ('Ignore header lines beginning with #', or 'Skip X lines [header region]') on the configuration page.

Basic understanding

Hi,
I was wondering what is the meaning of the color scheme in KEGG pathways - they don't match the colors given to different treatments in the MDS plot, for example. I am also wondering if it is possible to customize and save the parallel coordinates plot (e.g. select only a custom set of genes to be displayed).
Degust is a great tool, and I apologize in advance if the answers to my questions are in the Degust website, I was unable to find them.
Thank you in advance!

Selecting on the parallel co-ordinates plot

Hi Dave,

When you make a selection on the parallel co-ordinates plot by mouse, you can't then select and copy information out of the gene table. Even after removing the selection on the plot, until you reload the page, you still can't select the gene table. This only occurs with the parallel co-ordinate plot, the MA and the Volcano plots don't have this problem.

Enhancement: Widescreen

Being able to adjust the % of the screen width to see more data (in the genes table), and to allow for more space for conditions labels would be useful. For those with bigger screens only about 50% of the screen width is actually used.

User removal of info columns from results page (Nice to have)

Hi guys,

Would be useful for the user to be able to

  • Remove info columns dynamically. (such that it doesn't affect the config)

We find that we want a lot of annotation, yet being able to hide it quickly would really help. Export would still include all annotation columns.

Cheers
/Alisatir

Report confidence interval of logFC ?

ie. FC is the average, but some genes (eg. low expressed) the conf interval would be large
we typically don't report the confidence interval - we should though. Obvjiously p-value is related.

-- you

Feature request: parallel coordinates information

Firstly, the parallel coordinates part being interactive is really great. Some minor comments that would increase usability:

  • more/minor tickmarks in parallel corordinates (say 0,1,2,3...)
  • Ability to snap the selection to increments of 0.5 (or whatever the minor tickmarks are)
  • Onscreen update of the values selected in the parallel coordinates (on the coordinates of perhaps as a table elsewhere on screen (i.e. below options)?

automatic adjust row height in the degust html report

@drpowell Hi, Is there a way to adjust the row height in the degust html report? We've got genes with annotation from multiple DBs but only the first item was displayed. For example,
image

The full content of Desc for the first gene is acutally

[GO] GO:0055114 Biological Process:oxidation-reduction process
GO:0050660 Molecular Function:flavin adenine dinucleotide binding
GO:0016491 Molecular Function:oxidoreductase activity
GO:0016614 "Molecular Function:oxidoreductase activity, acting on CH-OH group of donors"
[InterPro] IPR000172 "Glucose-methanol-choline oxidoreductase, N-terminal"
IPR007867 "Glucose-methanol-choline oxidoreductase, C-terminal"
IPR012132 Glucose-methanol-choline oxidoreductase
IPR023753 FAD/NAD(P)-binding domain
[UniPro] Protein HOTHEAD OS=Arabidopsis thaliana GN=HTH PE=1 SV=1
[TREMBL] Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003422mg PE=4 SV=1

Thank you.

Duplicate column names are silently accepted by Degust

I uploaded a counts file in which I accidentally had two duplicate column names (but had different counts across the two columns). Degust didn't chuck an error at that, I was able to put two samples into the same condition and still load a Degust session. In the session however, it seems to pick one of the columns and duplicates it as when I checked to show counts, I had identical numbers for the two samples.

Enhancement: dealing with sex differences in groups

This may well be overkill, however it is still the case that we sometimes end up with sex differences in our samples. The option of being able to label the samples as M/F, and having that go into the fit would be really nice.

Selecting view after save changes produces error (ok when view from config screen)

Bit of a strange error this one. Using chrome.

settings=%7B%22csv_format%22%3Atrue%2C%22replicates%22%3A%5B%5B%22KI%22%2C%5B%22S232_KIr%22%2C%22S233_KIr%22%2C%22S242_KIr%22%5D%5D%2C%5B%22WT%22%2C%5B%22S176_WTr%22%2C%22S225_WTr%22%2C%22S228_WTr%22%5D%5D%5D%2C%22fc_columns%22%3A%5B%5D%2C%22info_columns%22%3A%5B%22Gene+ID%22%2C%22gene_biotype%22%2C%22name%22%2C%22entrezgene%22%2C%22chromosome_name%22%2C%22start_position%22%2C%22end_position%22%2C%22strand%22%2C%22Length%22%2C%22description%22%5D%2C%22analyze_server_side%22%3Atrue%2C%22link_column%22%3A%22Gene+ID%22%2C%22name%22%3A%22SVI_SRSF2_HsclCreYFP_STAR_mm10_rmdup_counts_annotated%22%2C%22primary_name%22%3A%22%22%2C%22init_select%22%3A%5B%22KI%22%2C%22WT%22%5D%2C%22hidden_factor%22%3A%5B%5D%2C%22min_counts%22%3A10%2C%22min_cpm%22%3A1%2C%22min_cpm_samples%22%3A2%7DHTTP/1.1 200 OK
Date: Wed, 30 Aug 2017 05:24:26 GMT
Server: Apache/2.4.18 (Ubuntu)
X-Frame-Options: SAMEORIGIN
X-XSS-Protection: 1; mode=block
X-Content-Type-Options: nosniff
Content-Type: text/html
Content-Disposition: inline; filename="compare.html"
Content-Transfer-Encoding: binary
Cache-Control: private
X-Request-Id: 90e1ff14-3615-46b4-9dad-d40abcb724a3
X-Runtime: 0.002492
Set-Cookie: _degust-jobs_session=SHVIU2xJQXEyVVMwOW9nL01DamZySGtpZzhqN3RCNzQ2N3R1UFZJQlQ0Rno0L1U1OUV0M28zVTVURUg2Q1F3MVIza0FnNkR2K2w5WFhJQThkd1krN1ZCT3hIcU1yU01TSXV3VDNTOWlhK3FkTFYwL3dxZm9sQ3dVYXRINHVpaWExeUl6dFZvdEFqQUk2Q09ibTROcElha2VtNVpjWWtoVm1DWDlRRjZSNlFFPS0tWG5uV1NEZ1RNaEszT3FQZktpRjF6dz09--0187109c89db3122880ea03ef96dfd4b1f35b5f4; path=/; HttpOnly
Content-Encoding: gzip
Keep-Alive: timeout=5, max=99
Content-Length: 423
Accept-Ranges: none
Connection: keep-alive

�‹���������SA’Ó0�¼ç�ƒ.>YJ ¸€í�ð‚}�b�íÉÊ–JšÄ›¢ø;²­„�PÅ.'«¦Õ=3­v5ðh›�@5 6àƒëÈb-�fÿI©yžåüAºÐ«÷ûýGuØ«|E,¤D³4=C@[�jÝ$€¯>ÑiÔ=?õ�†€].DÕéËrM.ˆjv›Ä»²„o/ŒaÒ6—§'(Ë?Ô#_-Æ�‘³d!U�£²t”é[À]îßœÖ�c�b¥mó2¾ð�‰Eæõ^�ü› üZ.¶�<?â'}Ñ[U@ í*·
tŠES©
kÞ@�–Úç>�ùo…Û.¿ó+µÄb=��¹fIC� S‹€Þê�Ë��´VÇX‹ô¸¬iÂ�ó‘ogôt��ŽCÊÉ
]‚wh¾b�Žœš��꾩t6<§Ñ�ïf´VöÄÃù(É)³2Uq—Ð
PL9Æ´�M=\Ý9€Ñ¬¥”�ë��KÅ•¡™’óRVÊ?ô¦±¿YsKnššKë´Á�úwÅ}?•�ÌÆlÇWø~#Ï4�7ˈÌiØ�5|‡�Ÿ³Øã3lî'‡Ößu÷�âO°m·���

CPM gene plot as default?

Would be more meaningful to have cpm scale as default in gene plots in our opinion. And/or a check box in the config for this?

Cheers and thanks as always

Sample Group Selection when more than 3X3

So when one has a count table of over 20 samples for instance, is it possible to upload a pdata table with sample name, group as a csv file, rather than checking the sample names?

Particularly good for scRNASeq, but also with this dataset I am using at the moment. which is 157 samples of RNA Seq.

Enhancement: Experiment metadata / text description

It would be useful to have a text box where we can include a full description of the processing done on the dataset - a materials and methods if you like. Useful to keep track of documentation within degust for easy access.

Automatic filename templates for saving images

It would be useful when saving MDS plots to png/svg, for the information used in making the plot to be saved at the same time:

This could be:
Dataset name + params for mds + dimension. (or at least for that option to be the default).

For gene expression this could be:
Dataset name + Gene info cols + cpm/count

R code - use input_file param?

Defining/using these at the start of the R code would be helpful for people reproducing things in R.

input_file
output_dir
output_file

docker installation

--Hi,

after complete installation with docker, i can't reach authentification with google_oauth2 from localhost:8001
is there a solution to use degust without auth step ?
thank you --

Request: export gene list to clipboard (and/or gmt)

Some possible "nice to have" ideas

Add option for:

  1. Exporting gene list as text to clipboard (for pasting into enrichr or other tools).

  2. Create .gmt file (using gene id) to create a molsigdb style .gmt gene set for up/downregulated genes

  • Name = dataset name_Up (maybe incuding filter information?)
  • Gene ids logFC > 0 (tab delimited)
  • Name = dataset name_Down
  • Gene ids logFC < 0 (tab delimited)
  1. Add filter for upregulated only / downregulated only

Show and Hide heatmap on production v3.1

Hi there, love the ability to show/hide heatmap from the tab views, just noticed this in the MDS tab. possible to have this across the tabs, and then also a hyperlink to hide the heatmap again (at the moment not obvious you have to right click on the heatmap to reveal the hide toggle).

chur!

Trailing space in sample name breaks the R script

Not sure if it's worth fixing but I came across a case where someone had a trailing space in one of their sample names. This ended up breaking the R script when it tried to subset the data using a non-existent colname.

Attempting to save 3D MDS view causes browser freeze

When viewing a 3D MDS plot, it is possible to right click to save as PNG/SVG. This causes a session crash with a unresponsive box - clicking the menu options will escape but it does loose memory of what samples were selected e.t.c.

I suspect this shouldn't be an option at all.

Also occurs if animation has been toggled off.

R code export usability improvements (for integration into EGSEA workflow)

Ideally the output from degust/R code would pipe directly into the EGSEA sample example (or something very similar to it for downstream pathway/gene set analysis), so updating the y object so that it contains all the necessary parts would be quite useful. (https://bioconductor.org/packages/release/bioc/html/EGSEA.html)

This may not be complete but I found these two to start with.

  1. y$E does not contain rownames (needed for EGSEA) (Entrez IDs - but should be whatever the input into degust was would be ideal)
  2. y$targets could also contain $group - again from config showing the sample groups used.
  3. y$genes to contain annotation information as input into degust as description fields.

3D Mds legend possible?

More a feature request, possible to present a legend for the 3D mds plot? (again on behalf of an interested end-user).

Download of original input data

It would be handy to be able to download the input data as it was originally input, as it makes the receiver of the degust link more autonomous.

This could be from the datasets / your data page, or from the configure page (or both).

duplicate rownames not allowed => change error notification?

duplicate rownames not allowed caused by input with Gene ID, name not in first columns. Would be more robust to not rely on the order being correct, but find Gene ID column and put it first.

Alternatively, a better error message thown: "Gene ID expected as first column of input".

I expect a bunch of additional format checks of the input would be useful including parts to strip potentially malicious code from the input file.

Login/password based authentication

We would like to use login/password based authentication.

Please integrate the OmniAuth Identity Provider as an alternative to Twitter/Google.

Allow deletion of dataset from config page

Minor useful addition:

While deleting a page from the list of uploads is great, sometimes you want to do it from within the view dataset part (because you are reviewing what you wish to delete).

Can't validate p-values

Hi I have been trying to replicate the edgeR quasi-likelihood analysis with Degust. I managed to get the same values for logFC and logCPM but not the p-values. R-code looks alright to me. Any possible explanation to that?

Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.