Giter VIP home page Giter VIP logo

strata-sanjose-2017's Introduction

Strata-2017-SAN-JOSE

This tutorial can either be run in spark-shell or in an IDE (IntelliJ or Scala IDE for Eclipse)

Below are the steps for the setup.

Pre-requisites for Installation

Java/JDK 1.7+ has to be installed on the laptop before proceeding with the steps below.

Running in spark-shell

Download Spark 2.1.0

Download Spark 2.1.0 from here : http://spark.apache.org/downloads.html

Direct Download link : http://d3kbcqa49mib13.cloudfront.net/spark-2.1.0-bin-hadoop2.7.tgz

Install Spark 2.1.0 on Mac

tar -zxvf spark-2.1.0-bin-hadoop2.7.tgz

export PATH=$PATH:/Users/path_to_downloaded_spark/spark-2.1.0-bin-hadoop2.7/bin

Running spark-shell on mac

  • spark-shell

Install Spark 2.1.0 on Windows

Unzip spark-2.1.0-bin-hadoop2.7.tgz

Add the spark bin directory to Path : ...\spark-2.1.0-bin-hadoop2.7\bin

Set up winutils.exe on Windows (not needed on mac)

  • download winutils.exe from https://github.com/steveloughran/winutils/tree/master/hadoop-2.6.0/bin
  • move it to c:\hadoop\bin
  • set HADOOP_HOME in your environment variables
    • HADOOP_HOME = C:\hadoop
  • run from command prompt:
    • C:\hadoop\bin\winutils.exe chmod 777 /tmp/hive
  • run spark-shell from command prompt with extra conf parameter
    • spark-shell --driver-memory 2G --executor-memory 3G --executor-cores 2 -conf spark.sql.warehouse.dir=file:///c:/tmp/spark-warehouse

Pasting code in spark-shell

When pasting larger sections of the code in spark-shell, use the below:

scala> :paste

Running in IDE

If you prefer to use IDE over spark-shell, below are the steps.

You can either use IntelliJ or Scala IDE for Eclipse.

IntelliJ

Scala IDE for Eclipse

Summary of Downloads needed

Have the following downloaded before the session

Git

Nice to have

strata-sanjose-2017's People

Contributors

whitefangbuck avatar vartikas avatar jshmain avatar jayantshekhar avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.