Author: Yingren Wang
Course: CS 488 Big Data
Semester: Fall 2020
Project: Data Analysis of Iris and Indian Pines Datasets
Summary: The project is designed for data visualization and analysis for Iris and Indian Pines Datasets using Python. The project contains various analysis including linear regression analysis using sklearn function, pre-clustering analysis using Elbow method, unsupervised learning using K-Means, and hierarchical clustering (with Euclidean, and Cosine distances) using certain amount of clusters, and computation of the cluster validity indices (Davies Bouldin and Silhouette index).