Topic: hadoop-docker Goto Github

Some thing interesting about hadoop-docker

👇 Here are 31 public repositories matching this topic...

adisve / hadoop-spark-cluster

hadoop-docker,A Spark/Hadoop-Docker Cluster template for working with Big Data

User: adisve

big-data docker docker-hadoop docker-spark hadoop hadoop-docker pyspark spark big-data-docker spark-docker

big-data-europe / docker-hadoop

hadoop-docker,Apache Hadoop docker image

Organization: big-data-europe

hadoop-docker docker-hadoop hadoop hadoop-cluster docker

christinali91 / mapreduceproject_google-search-auto-complete

hadoop-docker,

User: christinali91

hadoop-docker mapreduce

codito / hadoop-expt

hadoop-docker,Experiments with Hadoop cluster setups in Docker

User: codito

docker docker-compose hadoop hadoop-cluster hadoop-docker

fredrikhgrelland / docker-hadoop

hadoop-docker,

User: fredrikhgrelland

hadoop docker hadoop-docker datamesh

hoangnv2001 / docker-hadoop-hive-spark-zeppelin-hue-superset

hadoop-docker,Bigdata stack with Hadoop + Hive +Spark + Zeppelin + Hue + Superset

User: hoangnv2001

docker docker-hadoop hadoop hadoop-docker

hyeonsangjeon / dataplatform

hadoop-docker,Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

User: hyeonsangjeon

hadoop hadoop-cluster hadoop-docker hadoop-mapreduce hadoop-ecosystem hive pyspark-notebook zeppelin-notebook

imdeepanshugpt / hadoop

hadoop-docker,Hadoop-Cluster

User: imdeepanshugpt

hadoop hadoop-mapreduce hadoop-filesystem hadoop-cluster hadoop-docker hadoop-streaming hadoop-framework docker docker-compose docker-container

jbw / build-hadoop

hadoop-docker,Build Hadoop with Docker for Ubuntu. See releases for different architectures such as armv7l

User: jbw

armv7 docker hadoop hadoop-docker raspberry-pi

jbw / hadoop-docker-cluster

hadoop-docker,Hadoop cluster on Docker (single host)

User: jbw

docker hadoop hadoop-cluster hadoop-docker hadoop-mapreduce

jinho-yoo-jack / hadoopcluster

hadoop-docker,based Docker

User: jinho-yoo-jack

docker-compose hadoop hadoop-cluster hadoop-docker

juancasado / hadoop-docker

hadoop-docker,Hadoop deployment on docker and Docker Swarm

User: juancasado

Home Page: http://www.mrblissfulgrin.com

hadoop hadoop-docker pig-latin flume twitter hbase hive postgresql

kevin85421 / docker-compile-hadoop

hadoop-docker,Compile hadoop in docker container

User: kevin85421

hadoop-docker hadoop-compile docker

lyingbo / hadoop-cluster-docker

hadoop-docker,Run Hadoop Cluster within Docker Containers

User: lyingbo

hadoop-cluster hadoop-docker hadoop-3-2-0

marycboardman / assessment-attempts

hadoop-docker,Data processing using docker containers, kafka, spark, and hadoop

User: marycboardman

cloudera cloudera-hadoop cloudera-hadoop-framework digital-ocean digitalocean docker docker-compose docker-container docker-image hadoop hadoop-docker hadoop-hdfs kafka pyspark spark spark-sql sparksql zookeeper

mengmsun / hadoop-in-docker

hadoop-docker,Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.

User: mengmsun

docker-compose hadoop-cluster hadoop-docker hdfs-cluster hdfs-docker hadoop hdfs docker

mgosi / big-data-analysis-using-mapreduce-in-hadoop

hadoop-docker,We explore data by using Big Data Analysis and Visualization skills. To obtain this, we perform 3 main operations. i.e. i)Data Aggregation through different sources. ii) Big Data Analysis using MapReduce and iii) Visualization through Tableau. Data Analysis is very critical in understanding the data, and what we can do with the data. For small datasets it is easier to process and obtain the results. But as for big companies, it becomes crucial for them to obtain the trends of the company for any changes need to be made. Hence we introduce Big Data Analysis to solve this problem. In this lab, we collect close to 20000 tweets, 500 articles on New York Times and 500 articles on Common Crawl Data about Entertainment, which is our main topic of discussion. Using this data, we perform preprocessing and feed it to a MapReduce to find the Word Count and Word Co-Occurrence. Using this, we find the trend of the data collected in this topic. We have used Python to perform Data Analysis.Data Analysis is very critical in understanding the data, and what we can do with the data. For small datasets it is easier to process and obtain the results. But as for big companies, it becomes crucial for them to obtain the trends of the company for any changes need to be made. Hence we introduce Big Data Analysis to solve this problem. In this lab, we collect close to 20000 tweets, 500 articles on New York Times and 500 articles on Common Crawl Data about Entertainment, which is our main topic of discussion. Using this data, we perform preprocessing and feed it to a MapReduce to find the Word Count and Word Co-Occurrence. Using this, we find the trend of the data collected in this topic. We have used Python to perform Data Analysis.

User: mgosi

big-data big-data-analytics tableau common-crawl twitter-api tweet-collector data-pipeline hadoop-docker docker hdfs data-processing

mjaglan / docker-hadoop-distributed-mode

hadoop-docker,Run Apache Hadoop 2.7 inside docker container in Multi-Node Cluster mode

User: mjaglan

docker dockerfile hadoop hadoop-docker

mjaglan / docker-hadoop-pseudo-distributed-mode

hadoop-docker,Run Apache Hadoop 2.7 inside docker container in pseudo-distributed mode

User: mjaglan

docker dockerfile hadoop hadoop-docker

mr-ravin / smart-hadoop-cluster-smhacl

hadoop-docker,This is an automated hadoop cluster building tool,which implements distributed computing for creating the cluster over the network. This is implemented in python 2.7

User: mr-ravin

python-2 hadoop-cluster docker automation distributed-systems big-data hadoop hadoop-hdfs hadoop-docker