Topic: apache-hadoop Goto Github
Some thing interesting about apache-hadoop
Some thing interesting about apache-hadoop
apache-hadoop,Batch processing runtime analytics
User: 0lifr1
apache-hadoop,This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.
User: aaqib-ahmed-nazir
apache-hadoop,This project aims to establish a data streaming pipeline with storage, processing, and visualization
User: abdelhakim-gh
apache-hadoop,The goal of this project is to learn data processing using Spark with practical examples on datasets and also apply programming with Scala.
User: abdelhakim-gh
apache-hadoop,A BASH script to setup Apache Hadoop and Apache Hive with Derby database on Debian GNU/Linux
User: aquib-sh
apache-hadoop,Big Data pipeline for real-time sensor fusion and predective analysis.
User: bahaabrougui
apache-hadoop,
User: bdoepf
apache-hadoop,Implementation of Statistical Methods via Hadoop Map-Reduce Library.
User: berksudan
apache-hadoop,Preparação de um ambiente de desenvolvimento e testes para Apache Hadoop.
User: carlosemsantana
apache-hadoop,Kubernetes operator for managing the lifecycle of Apache Hadoop Yarn Tasks on Kubernetes.
User: chriskery
apache-hadoop,Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.
User: coursal
apache-hadoop,The source code developed and used for the purposes of my thesis with the same title under the guidance of my supervisor professor Vasilis Mamalis for the Department of Informatics and Computer Engineering of the University of West Attica.
User: coursal
apache-hadoop,Samples related to data engineering, e.g. spark, embulk, airflow, etc.
User: esakik
apache-hadoop,A project for Advanced Topics in Database Systems course of ECE, NTUA for fall semester of academic year 2020-2021.
User: faystatha
apache-hadoop,An email spam filter using Apache Spark’s ML library
User: felidsche
apache-hadoop,AWS Cloudera Hadoop setup with H2O, Spark, MR
User: gangodu
apache-hadoop,A Spark application to merge small files on Hadoop
User: guru107
apache-hadoop,Set of Input Formats for Hadoop Streaming
User: haodemon
apache-hadoop,Exercises in the Scala programming language with an emphasis on big data programming and applications in Apache Hadoop and Apache Spark.
User: heracliteanflux
apache-hadoop,This repository provides a guide to preprocess and analyze the network intrusion data set using NumPy, Pandas, and matplotlib, and implement a random forest classifier machine learning model using Scikit-learn.
User: jagdish4501
apache-hadoop,Solving simple tasks with Apache Hadoop.
User: jaskier07
apache-hadoop,Instructions for Installing Giraph-1.2.0
User: jordan396
Home Page: https://jordan396.github.io/Giraph-1.2.0-Installation/
apache-hadoop,An python implementation of Minimal Mapreduce Algorithms for Apache Spark
User: kowaalczyk
apache-hadoop,Full term Project of the exam of Parallel Computing of University of Florence. Implementation of Twitter Sentiment Analysis using Hadoop, Apache Storm and HBase to obtain parallelization.
User: lorenzogianassi
apache-hadoop,This project implemented a lambda architecture for analyzing domestic flight data in the US from 2009 to 2020. It used Apache Spark for batch processing, Spark Streaming for real-time analysis, and SVM models to predict flight cancellations and delays, with Docker for cluster management and Grafana for real-time visualization.
User: lucass97
apache-hadoop,Hadoop, HBase, Phoenix, and Zookeeper Integration
User: luckyp71
apache-hadoop,Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
User: mahmoudparsian
Home Page: http://mapreduce4hackers.com
apache-hadoop, MapReduce, Spark, Java, and Scala for Data Algorithms Book
User: mahmoudparsian
Home Page: http://mapreduce4hackers.com
apache-hadoop,This is projects of Cloud Computing Course
User: mohammadtavakoli78
apache-hadoop,The implementation of Apache Spark (combine with PySpark, Jupyter Notebook) on top of Hadoop cluster using Docker
User: nghoanglong
apache-hadoop,Export Hadoop YARN (resource-manager) metrics in prometheus format
Organization: pbwebmedia
apache-hadoop,Intalasi WSL2 untuk Praktikum ABD
User: rachmanz
apache-hadoop,Apache Hadoop multi-node setup using ansible
User: rahulinux
apache-hadoop,A fast, scalable and distributed community detection algorithm based on CEIL scoring function.
Organization: rbc-dsai-iitm
apache-hadoop,Containerized Apache Hive Metastore for horizontally scalable Hive Metastore deployments
Organization: realtimedatalake
apache-hadoop,HADOOP 3.1.0 winutils
User: s911415
apache-hadoop,Built a Large Scale Distributed Data Processing system for Streaming Analytics using Hadoop Ecosystem (Apache Spark and HDFS), in Cloud for real-time spatial analytics.
User: saitejavishalj
apache-hadoop,Big Data Technologies can be defined as software tools for analyzing, processing, and extracting data from an extremely complex and large data set with which traditional management tools can never deal
User: sawadogosalif
apache-hadoop,🌟Spark Ceph Connector: Implementation of Hadoop Filesystem API for Ceph
User: shuuji3
apache-hadoop,Applying MapReduce in Java on a Twitter dataset using Apache Hadoop
User: smohammadhejazi
apache-hadoop,A small code to validate the Census data on the basis of Aadhar Data
User: surbhitawasthi
apache-hadoop,hadoop-cos(CosN文件系统)为Apache Hadoop、Spark以及Tez等大数据计算框架集成提供支持,可以像访问HDFS一样读写存储在腾讯云COS上的数据。同时也支持作为Druid等查询与分析引擎的Deep Storage
Organization: tencentyun
Home Page: https://cloud.tencent.com/document/product/436/6884?!preview=true&lang=zh
apache-hadoop,My portfolio | under development
User: trentbrunson
Home Page: https://trentbrunson.github.io/
apache-hadoop,COVID-19 data analysis with MapReduce
User: trisha11r
apache-hadoop,This repository contains all the material related to this big data certification.
User: umer86
apache-hadoop,Learning Apache Hadoop for Big Data. Moreover, exploring Map Reduce, Apache Spark RDD, Distributed Processing and Stream Processing
User: unobatbayar
apache-hadoop,💡 Sistema de recomendações desenvolvido no Bootcamp Backend Developer Carrefour, utilizando Apache Mahout
User: victorpereira01
apache-hadoop,Simplified Hadoop Setup and Configuration Automation
User: whoami-anoint
Home Page: https://github.com/whoami-anoint/EasyHadoop.wiki.git
apache-hadoop,logback appender for apache-flume
User: yingzhuo
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.