Skip to content

Koray-afk/SectionC_Group16_Hotel_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

40 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation


πŸ”₯ Hotel Booking Performance & Cancellation Intelligence Dashboard

Industry-Style Analytics Project β€” Analyzing booking data to drive smarter hospitality decisions



This project analyzes hotel booking data to uncover patterns in cancellations, revenue trends, seasonality, and customer behavior. The outcome is an executive-style, interactive dashboard built in Google Sheets to support data-driven decision-making for hotel management teams.
πŸ“ŠOverall booking performance
🚫Cancellation risk drivers
πŸ’°Revenue and pricing trends
πŸ“‘Channel and customer segment performance


Context: The hospitality industry faces high revenue volatility due to booking cancellations and fluctuating seasonal demand.

Core Problem: High cancellation rates and inconsistent booking behavior reduce revenue predictability and operational efficiency.

Objective β€” Use historical booking data to:

  • βœ… Reduce cancellation rates
  • βœ… Improve revenue stability
  • βœ… Optimize pricing and deposit policies
  • βœ… Identify high-value, stable customer segments

❓ Key Business Question

"How can hotel management reduce cancellations and maximize revenue using booking behavior insights?"


Property Details
πŸ“ Dataset NameHotel Booking Dataset
πŸ“ TypeStructured transactional data
πŸ“ Rows~8,700 (post-cleaning)
πŸ“Š Columns30+
πŸ“… Time Period2015–2017
πŸ”— SourceApproved academic dataset (imported into Google Sheets)

Key Attributes:

Category Fields
🏷️ Booking Status is_canceled, reservation_status
πŸ“… Time Features arrival_date_year, arrival_date_month
πŸ‘€ Guest Details adults, children, babies, country
πŸ“‘ Channel Info market_segment, distribution_channel
πŸ’΅ Revenue Proxy adr


All cleaning and preprocessing were performed in Google Sheets as per capstone requirements.

Step Description
πŸ” Duplicate Removal Duplicate booking records removed using built-in deduplication
❓ Missing Values Numeric fields β†’ 0 Β· Categorical fields β†’ "Unknown" Β· "NULL" standardized
πŸ”’ Data Type Standardization Converted numeric fields stored as text into proper numeric format
⚠️ Invalid Values Negative ADR values flagged and handled based on business logic
βœ‚οΈ Text Normalization Trimmed whitespace and standardized category labels
🌍 Country Mapping ISO country codes mapped to full country names for dashboard readability

πŸ“ A detailed Logs/Audit sheet documents each transformation step for traceability.



Derived features created to support KPI and dashboard analysis:

Feature Formula / Logic
πŸ‘₯ Total Guests adults + children + babies
πŸŒ™ Total Stay Length weekday nights + weekend nights
πŸ‘¨β€πŸ‘©β€πŸ‘§ Family Flag Family vs Non-Family bookings
πŸ’° Revenue (Derived) ADR Γ— Total Stay Length
πŸ“… Month Number For chronological sorting of monthly trends


πŸ“‹

Total Bookings

🚫

Total Cancellations

πŸ“‰

Cancellation Rate (%)

πŸ’°

Total Revenue

πŸ’΅

Average Daily Rate

These KPIs provide an executive snapshot of booking performance and revenue stability.



Pivot tables were created in Google Sheets to support dashboard visualizations:

  • πŸ“Š Cancellation Rate by Market Segment
  • πŸ“ˆ Monthly Revenue & Booking Trends
  • πŸ’΅ ADR by Hotel Type and Month
  • ⏱️ Lead Time Group vs Cancellation %
  • 🏷️ Deposit Type vs Cancellation %
  • πŸ‘€ Customer Type Performance

These pivots serve as the data source for all charts in the dashboard.



The final dashboard presents decision-ready insights for non-technical stakeholders.

πŸ”ΉComponentDescription
πŸ“‹KPI CardsBookings, Cancellation Rate, Revenue, ADR
πŸ“ˆLine ChartRevenue trend by month
πŸ“ŠBar ChartsCancellation by market segment, deposit type
πŸ“‰Column ChartLead time vs cancellation
πŸŽ›οΈFilters / SlicersHotel type, year, market segment, customer type

🎨 The dashboard is designed with a clean, executive layout for quick interpretation.



πŸ”΄Group and OTA bookings exhibit the highest cancellation rates.
🟠Long lead-time bookings show higher cancellation risk.
🟒Revenue peaks during specific seasonal periods.
πŸ”΅Repeat guests demonstrate lower cancellation probability.


# Recommendation
1️⃣ Introduce stricter or non-refundable policies for high-risk segments
2️⃣ Adjust pricing dynamically during peak demand months
3️⃣ Promote direct and corporate channels to reduce cancellation risk
4️⃣ Offer loyalty benefits to repeat guests to improve booking stability


⚠️ Limitations πŸš€ Future Scope
Revenue derived using ADR (approximation) Predictive cancellation modeling
No external market or competitor data Time-series forecasting of demand
Static historical analysis (no real-time feed) Real-time dashboard integration
β€” Deeper segmentation using ML models


Resource Link
πŸ“Š Google Sheets (Dashboard & Analysis) Open Spreadsheet
🎀 Presentation (PPT) Open on Canva
πŸ“ Project Report (PDF) View Report


Kasula Lalithendra

Abhiman Singh

Vridhi Chaudhary

Ritik Raj

Anant Singh

Rudraksh Sharma


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors