Skip to content
#

cloudera-hadoop

Here are 40 public repositories matching this topic...

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

  • Updated Jul 23, 2025
  • HiveQL

The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of …

  • Updated Apr 22, 2018
  • Java

Improve this page

Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."

Learn more