paularthurm / pitviper Goto Github PK
View Code? Open in Web Editor NEWSnakemake pipeline for meta-analysis of functional screening data
License: GNU General Public License v3.0
Snakemake pipeline for meta-analysis of functional screening data
License: GNU General Public License v3.0
Add cell lines sub-selection
Allow to select all options at once
Alignment statistics should be save in a results sub-directory, such as results/{token}/bowtie2/statistics.txt
.
Replace Jupyter Lab automatic opening with Jupyter Notebook after PitViper execution when using GUI.
Add bowtie2=2.2.4 and samtools=1.14 to env.
To do:
--very-fast
Same as: -D 5 -R 1 -N 0 -L 25 -i S,1,2.00
--fast
Same as: -D 10 -R 2 -N 0 -L 22 -i S,1,1.75
--sensitive
Same as: -D 15 -R 2 -N 0 -L 20 -i S,1,0.75 (default in --local mode)
--very-sensitive
Same as: -D 20 -R 3 -N 0 -L 20 -i S,1,0.50
Min. reads cut-off in GUI.
Remove:
and
Move in GUI:
A refactoring of the previous script is necessary to make the purpose of the script clear.
Add a check button for union selection instead of default intersection in integration module.
Allow user to give as input a BED file for automatic annotation of targeted elements, i.e. super-enhancers.
To improve the function, we could use Python's subprocess library to execute the command instead of using os.system to run the command. This would allow us to access the output of the command and take action based on it. Additionally, we could use string formatters to make the command string more readable and easier to debug. Here is an example of how the function could be improved:
import subprocess
def run_pitviper(configfile, dry_run, jobs, notebook):
if notebook != "":
nb_opt = f"--edit-notebook {notebook}"
else:
nb_opt = ""
if dry_run:
cmd = f"snakemake -s workflow/Snakefile -n --configfile {configfile} --use-conda --cores {jobs}"
elif not dry_run:
cmd = f"snakemake -s workflow/Snakefile --configfile {configfile} --use-conda --cores {jobs} {nb_opt}"
print("Command:", cmd)
result = subprocess.run(cmd, shell=True, capture_output=True)
if result.returncode == 0:
# command succeeded
else:
# command failed
To do:
Our script seems to add a column in the design matrix that is not useful.
Improve visualization by:
Add description and explanations of how to use jupyter notebbok report and export to HTML.
Let users define negatives controls in GUI.
Add (MM10,MM9,MM8,HG18,HG19,HG38) to GUI.
Add an option to choose between "per cell line" or "per primary tissue" for depmap mutations heatmap.
Validate form in GUI.
Example: SFPQ gene return an error when using show_sgRNA_counts(token)
:
ValueError: The condensed distance matrix must contain only finite values.
Should add a non-zero value to all cells before using Clustergrammer2.
Add cut-off in GUI for IH method
Should be possible to enter multiple elements separated a comma in TextInput widgets.
Extensibility was one of the primary goal of PitViper. However, the current implementation of the report make it very difficult and tedious to add new tools.
We should think to a manner to generalize how all functions are used.
Creating a tool agnostic class as a common interface for all results could be a solution. This class should have several characteristics:
To create a new method, two features are mandatory: the name of the elements and at least one numerical score to rank the value (FDR, Bayesian Factor, etc.)
Find a better and more representative name for "in-house method". Then rename it in all scripts... :(
Some references are missing in GUI and report.
Use depmap_release() from http://bioconductor.org/packages/release/data/experiment/manuals/depmap/man/depmap.pdf to get data version and use it in file name to check if user have the last version.
"in-house" method default cut-off is different between modules in report.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.