Data Analytics Bootcamp - Labs
This repo contains most of the coding labs completed as part of the Data Analytics Bootcamp at Ironhack (full-time cohort, April-June 2022).
The repo is organized following the 3 modules of the bootcamp (see full index below)
Module
Content
Module 1
Introduction to Data analytics and Python (data wrangling & cleaning, API & Web scraping, Git, SQL, Python)
Module 2
Advanced Data Analytics (statistics & probability, inferential statistics, hypothesis testing, Tableau)
Module 3
Machine Learning Fundamentals (supervised & unsupervised learning; build, train, and evaluate ML models)
Labs in this repo - Index
Week
Labs
Language
Description
1
intro-to-pandas
Python
Pandas (Series and DataFrames), how to work with them, how to obtain them from other data structures, and how to perform basic calculations with them.
1
list-comprehension
Python
Constructing list comprehensions and using them to extract and filter information in a variety of scenarios.
1
numpy
Python
Introduction to NumPy array.
1
string-operations
Python
Practice how to manipulate strings, and to use string manipulation techniques to create Bag of Words (BoW).
1
tuple-set-dict
Python
Practice Python native data structures and become proficient at using them.
2
advanced-pandas
Python
Practice advanced functions, changing the index and method chaining in Pandas.
2
data-cleaning
Python
Practice data cleaning techniques.
2
dataframe-calculations
Python
Refining problem-solving process. Breaking down a complex problem into a subset of less complex problems, then tackle each sub problems in a progressive order.
2
import-export
Python
This lab discuss the task of importing and exporting data into pandas using different file formats.
2
sql-first-queries
SQL
Practice SQL queries to answer some questions.
2
mysql-select
SQL
Practice how to use the MySQL SELECT statement.
2
mysql
SQL
Practice how to design, create, and manage a database.
3
advanced-regex
Python
Practice how to put together regular expression.
3
api-scavengers
Python
Practice how to make requests to APIs and parse the JSON responses to extract the information we need.
3
matplotlib-seaborn
Python
Create different types of visualizations using matplotlib and seaborn: bar charts, scatter charts and box plots among many others.
3
pandas-deep-dive
Python
Perform a variety of operations using the Pandas library.
Week
Labs
Language
Description
4
descriptive-stats
Python, Statistics
Practice better understanding of basic descriptive statistics, how to compute the basic descriptive metrics and compare them in different use cases.
4
pivot-table-and-correlation
Python
Practice Pandas pivot table to extract insights from data, and to describe the strength and direction of the relationship between two variables.
4
regression-analysis
Python, Statistics
Apply different types of regressions, and use them to understand the trends in data, and predict future values of the outcome.
4
subsetting-and-descriptive-stats
Python, Statistics
Use Pandas library to extract insights from your data by dividing it into into several subsets, and use Pandas descriptive statistics functions.
5
mini-project
Python, Statistics
Review the concepts of Inferential Statistics. Clean data, do Exploratory Data Analysis, and do hypothesis testing.
5
confidence-intervals
Python, Statistics
Apply the knowledge on confidence intervals, using normal, student's t and chi-squared distributions.
5
hypothesis-testing-1
Python, Statistics
Construct one sample hypothesis test and confidence intervals.
5
hypothesis-testing-2
Python, Statistics
Construct one sample hypothesis test and confidence intervals.
5
intro-probability
Python, Statistics
Tackle some basic probability questions.
5
probability-distribution
Python, Statistics
Practice probability distribution to discover meaningful relationship between events and make better data-driven decision.
Week
Labs
Language
Description
7
intro-to-ml
Python
Practice how to properly prepare the data for ML algorithms.
7
supervised-learning-feature-extraction
Python
Explore the techniques to extract meaningful information from data, by transforming the data using derived columns, grouping the data and using aggregated information, or cleaning and reformatting the data.
7
supervised-learning-sklearn
Python
Explore the scikit-learn library in the context of supervised learning.
7
supervised-learning
Python
Predict malicious vs benign websites using supervised learning.
The content and org of this repo have been inspired by our amazing TA at Ironhack Lisbon Gladys Mawarni and other former bootcamp students.