Topic: hadoop-docker Goto Github
Some thing interesting about hadoop-docker
Some thing interesting about hadoop-docker
hadoop-docker,A Spark/Hadoop-Docker Cluster template for working with Big Data
User: adisve
hadoop-docker,Apache Hadoop docker image
Organization: big-data-europe
hadoop-docker,
User: christinali91
hadoop-docker,Experiments with Hadoop cluster setups in Docker
User: codito
hadoop-docker,
User: fredrikhgrelland
hadoop-docker,Bigdata stack with Hadoop + Hive +Spark + Zeppelin + Hue + Superset
User: hoangnv2001
hadoop-docker,Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
User: hyeonsangjeon
hadoop-docker,Hadoop-Cluster
User: imdeepanshugpt
hadoop-docker,Build Hadoop with Docker for Ubuntu. See releases for different architectures such as armv7l
User: jbw
hadoop-docker,Hadoop cluster on Docker (single host)
User: jbw
hadoop-docker,based Docker
User: jinho-yoo-jack
hadoop-docker,Hadoop deployment on docker and Docker Swarm
User: juancasado
Home Page: http://www.mrblissfulgrin.com
hadoop-docker,Compile hadoop in docker container
User: kevin85421
hadoop-docker,Run Hadoop Cluster within Docker Containers
User: lyingbo
hadoop-docker,Data processing using docker containers, kafka, spark, and hadoop
User: marycboardman
hadoop-docker,Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
User: mengmsun
hadoop-docker,We explore data by using Big Data Analysis and Visualization skills. To obtain this, we perform 3 main operations. i.e. i)Data Aggregation through different sources. ii) Big Data Analysis using MapReduce and iii) Visualization through Tableau. Data Analysis is very critical in understanding the data, and what we can do with the data. For small datasets it is easier to process and obtain the results. But as for big companies, it becomes crucial for them to obtain the trends of the company for any changes need to be made. Hence we introduce Big Data Analysis to solve this problem. In this lab, we collect close to 20000 tweets, 500 articles on New York Times and 500 articles on Common Crawl Data about Entertainment, which is our main topic of discussion. Using this data, we perform preprocessing and feed it to a MapReduce to find the Word Count and Word Co-Occurrence. Using this, we find the trend of the data collected in this topic. We have used Python to perform Data Analysis.Data Analysis is very critical in understanding the data, and what we can do with the data. For small datasets it is easier to process and obtain the results. But as for big companies, it becomes crucial for them to obtain the trends of the company for any changes need to be made. Hence we introduce Big Data Analysis to solve this problem. In this lab, we collect close to 20000 tweets, 500 articles on New York Times and 500 articles on Common Crawl Data about Entertainment, which is our main topic of discussion. Using this data, we perform preprocessing and feed it to a MapReduce to find the Word Count and Word Co-Occurrence. Using this, we find the trend of the data collected in this topic. We have used Python to perform Data Analysis.
User: mgosi
hadoop-docker,Run Apache Hadoop 2.7 inside docker container in Multi-Node Cluster mode
User: mjaglan
hadoop-docker,Run Apache Hadoop 2.7 inside docker container in pseudo-distributed mode
User: mjaglan
hadoop-docker,This is an automated hadoop cluster building tool,which implements distributed computing for creating the cluster over the network. This is implemented in python 2.7
User: mr-ravin
hadoop-docker,Apache Pig Latin script to count letters in multiple input text files, using the HortonWorks Hadoop Sandbox or Google Cloud Platform
User: rishabhindoria
hadoop-docker,Setup hadoop cluster manually and automatically
User: rohit9314
hadoop-docker,基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
User: ruoyu-chen
hadoop-docker,Hive 3 In Docker container
Organization: sharpdata
Home Page: https://hub.docker.com/repository/docker/sharpetl/hive3
hadoop-docker,Построение рекомендательной системы на основе алгоритма коллаборативной фильтрации и технологии Hadoop Streaming
User: sidl419
hadoop-docker,
User: simonprewo
hadoop-docker,Exercise files for Apache Hadoop Big Data Training
Organization: tertiarycourses
Home Page: https://www.tertiarycourses.com.sg/big-data-training-courses-singapore.html
hadoop-docker,Apache Hadoop Cluster Docker images
User: vietanh85
hadoop-docker,Toy Hadoop cluster combining various SQL-on-Hadoop variants
User: waltherg
hadoop-docker,Dockerizing an Apache Spark Standalone Cluster
User: wittline
Home Page: https://wittline.github.io/apache-spark-docker/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.