Giter VIP home page Giter VIP logo

screenhca's Introduction

ScreenHCA (SHCA)

Python package

Prerequisites

Getting Started

Install modules for python with pip:

cd python
pip install -r requirements.txt

Run the following command to test your installation:

cd python
python ./main.py ./test_input.json

Usage

To find similar screens to a new screen, you can create a new JSON file that contains basic informations about the screening. Below is an example that lists the informations needed (if you see strings that are devided by a | it means that you have to choose one of the supported strings, feel free to open an issue if there is a missing choice):

{
  "SCREEN_ID": "<integer (if it is a new screen you can type '-1' here)>",
  "SCORES_SIZE": "<integer>",
  "FULL_SIZE": "<integer>",
  "NUMBER_OF_HITS": "<integer>",
  "SCREEN_TYPE": "Negative Selection | Positive Selection | Phenotype Screen",
  "DURATION": "<integer> Days",
  "METHODOLOGY": "Knockout | Inhibition | Activation",
  "ENZYME": "CAS9 | d-Cas9-KRAB | SAM (NLS-dCas9-VP64/MS2-p65-HSF1) | sunCas9"
}

Once you created the JSON file you can run the script with the path to the file as a parameter:

# From the root folder of this repo
cd python
python ./main.py ./your-file.json

From there you can see the clustering visualized and you can find CSV files in the results folder that show the separate clusters as well as a PNG file of the diagram for later use.

Configuration

The file ./python/config/config.yaml can be edited to change behaviours and wordings. The access_key field must be set to a valid key in order for this script to work! You can generate a new access key here: https://orcsws.thebiogrid.org

orcs:
  access_key: "<enter secret here or set BIOGRID_ACCESSKEY as environment variable>"
  base_url: "https://orcsws.thebiogrid.org"

clustering:
  pruning: 4
  max_distance: 11

results:
  folder_path: "./results"
  diagram_file_name: "diagram.png"
  plot:
    title: "Agglomerative Clustering with pruning = 4 and max. distance threshold = 11"
    x_label: "Number of points in node (or index of point if no parenthesis)"
    y_label: "Distance"
  input_data_csv_name: "input_data.csv"
  cluster_data_txt_name: "cluster_data.txt"
  cluster_data_csv_folder: "./results/clusters"
  cluster_data_csv_prefix: "cluster-"

screenhca's People

Contributors

czolbem avatar wujood avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.