sagemakerwithfedb's Introduction

SageMakerWithFEDB

make a realtime ml application on sage maker with fedb

执行特征处理任务

创建emr集群并且登陆到master节点

在aws上面选择Spark2.4.4集群创建，然后通过ssh登陆到集群的master节点, 如何登陆到master请参考aws ssh 登陆到emr master节点 emr 版本请选择emr-6.0.0

提交任务

git clone https://github.com/4paradigm/SageMakerWithFEDB.git
cd SageMakerWithFEDB
# 将s3上面的包下载下来
aws s3 cp s3://xxxxx/fesql-spark-0.0.1-SNAPSHOT-with-dependencies.jar .
#下载jar包
wget https://storage.4paradigm.com/api/public/dl/euvK52oV/fesql-spark-0.0.1-SNAPSHOT-with-dependencies.jar
wget https://storage.4paradigm.com/api/public/dl/EZ55KLMS/train_data.snappy.parquet
aws s3 cp train_data.snappy.parquet {你的s3目录}

更新fe.py 里面fe_input 变量为 train_data.snappy.parquet 在s3上面的路径和更新fe_output 变量为你的一个s3目录，用于保存特征输出结果

# 通过spark-submit提交任务到yarn集群
sh fe.sh

执行完成后在输出s3目录看到如下文件

_SUCCESS
partxxxxx.csv
...

则表示正确产生了特征输出

入群交流

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

cece00 / sagemakerwithfedb Goto Github PK