Giter VIP home page Giter VIP logo

spark-saturday-advanced-hail's Introduction

Spark Saturday Advanced - Hail 0.2 on Databricks

Instructions to Register for Free Databricks Community Edition

Instructions for Creating Hail Resource Libraries

The Hail resources necessary for running this tutorial are located in the resources folder.

(Based on Hail-0.2-7a98b6a65d44. For other builds, please see resources in Hail's artifact bucket located at gs://hail-common/builds)

After downloading them, navigate to the Workspace/Users section of the notebook, and then to your user folder.

workspace

Within your user folder, right click and select create library.

createlibrary

On the next screen, create a library for the hail-all-spark.jar (name it whatever you want). Select the option to automatically have this library attached to all clusters (or manually attach it at a later point). Repeat the process for the .egg file.

Instructions for Creating a Cluster

Navigate to "Clusters" on the left-hand side of the screen and select "Create Cluster" on the top of the page that appears.

cluster

Name your cluster and then fill in the Spark Config like below.

sparkconfig

spark.serializer org.apache.spark.serializer.KryoSerializer
spark.kryo.registrator is.hail.kryo.HailKryoRegistrator
spark.databricks.delta.preview.enabled true
spark.driver.extraClassPath ./hail-all-spark.jar
spark.executor.extraClassPath ./hail-all-spark.jar

Finally, click create cluster.

Importing the Hail Notebook

Please note that this notebook was adapted from https://hail.is/docs/stable/tutorials/01-genome-wide-association-study.html for use on Databricks.

Useful Links

spark-saturday-advanced-hail's People

Contributors

mptrepanier avatar

Stargazers

 avatar Gregory Grubbs avatar Trinadh Gupta avatar Japneet Singh avatar Maurice Nsabimana avatar  avatar Freeman Wang avatar Wendy Wong avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.