MapReduce, Spark, Java, and Scala for Data Algorithms Book
-
Updated
Oct 14, 2024 - Java
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Big data projects implemented by Maniram yadav
K-Means algorithm implementation with Hadoop and Spark for the course of Cloud Computing of the MSc AIDE at the University of Pisa.
A collection of mapreduce problems and solutions
This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
Hadoop MapReduce word counting with Java
中文文本挖掘|舆情分析|Hadoop|Java|MapReduce
Projects done in the Cloud Computing course.
Source code for the examples in the book Cloud Computing Solutions Architect: A Hands-On Approach by Arshdeep Bahga and Vijay Madisetti
Data Engineering Course
Student projects in Big Data field.
Twitter + Flume + Hadoop (HDFS, MapReduce) + Neo4j + Pyhton
Helm chart for Apache Hadoop using multi-arch docker images
Search Engine projects
2021 Spring (Distributed Computing Systems) 分布式系统与编程
Add a description, image, and links to the hadoop-mapreduce topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-mapreduce topic, visit your repo's landing page and select "manage topics."