Skip to content

JinnOppa/Hotel-Booking-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Hotel Booking Data Analysis

This project is my individual endeavor to perform exploratory data analysis (EDA) on a hotel booking dataset. The dataset provides valuable insights into various aspects of bookings, cancellations, customer behavior, pricing, and room types.

Dataset Source

The dataset used for this analysis is obtained from Kaggle. You can find the dataset on the Kaggle website at the following link: Hotel Booking Demand.

Dataset Description

The hotel booking dataset contains information about hotel bookings and includes various details such as:

The dataset contains information about hotel bookings and includes the following details:

  • hotel: Hotel type (Resort Hotel or City Hotel)
  • is_canceled: Binary variable indicating if the booking was canceled (1) or not (0)
  • lead_time: Number of days between booking date and arrival date
  • arrival_date_year: Year of arrival date
  • arrival_date_month: Month of arrival date
  • arrival_date_week_number: Week number of the year for the arrival date
  • arrival_date_day_of_month: Day of the month for the arrival date
  • stays_in_weekend_nights: Number of weekend nights (Saturday or Sunday) the guest stayed
  • stays_in_week_nights: Number of weeknights (Monday to Friday) the guest stayed
  • adults: Number of adults
  • children: Number of children
  • babies: Number of babies
  • meal: Type of meal booked (e.g., Bed & Breakfast, Full board, etc.)
  • country: Country of origin of the guest
  • market_segment: Market segment designation (e.g., Online TA, Offline TA/TO, etc.)
  • distribution_channel: Booking distribution channel (e.g., Direct, Corporate, etc.)
  • is_repeated_guest: Binary variable indicating if the guest is a repeated guest (1) or not (0)
  • previous_cancellations: Number of previous booking cancellations by the guest
  • previous_bookings_not_canceled: Number of previous bookings not canceled by the guest
  • reserved_room_type: Code for the type of room reserved
  • assigned_room_type: Code for the type of room assigned
  • booking_changes: Number of changes made to the booking
  • deposit_type: Type of deposit made for the booking (Non-refundable, Refundable, No Deposit)
  • days_in_waiting_list: Number of days the booking was on the waiting list
  • customer_type: Type of booking (Transient, Contract, Group, Transient-Party)
  • ADR: Average Daily Rate (Price per room, not per person)
  • required_car_parking_spaces: Number of car parking spaces required by the guest
  • total_of_special_requests: Number of special requests made by the guest
  • reservation_status: Current status of the reservation (e.g., Check-Out, Canceled, No-Show)
  • reservation_status_date: Date of the last reservation status update

Key Features

  1. Data Exploration: Load the dataset, examine its size, and check for missing values.
  2. Data Preprocessing: Handle missing values and perform feature engineering.
  3. Correlation Analysis: Visualize the correlation between numerical features.
  4. Cancelation Analysis: Analyze the cancelation rate by hotel and market segment.
  5. Customer Analysis: Explore customer types, number of adults, children, and babies, and customer behavior based on market segment and hotel type.
  6. Pricing Analysis: Examine the average daily rate (ADR) by hotel and the distribution of the average total price per booking.
  7. Room Analysis: Analyze the room type distribution and the relationship between reserved and assigned room types.
  8. Behavior Analysis: Investigate the distribution of bookings with different deposit types, the number of special requests, and the car parking space requirement.
  9. Predictive Model: Build time series forecasting models for cancelations and future bookings.

Requirements

  • Python 3
  • Libraries: pandas, numpy, matplotlib, seaborn, statsmodels, scikit-learn

Getting Started

To start exploring the hotel booking dataset and performing the analysis, follow these steps:

  1. Clone the Repository: Clone this GitHub repository to your local machine using the command git clone https://github.com/yourusername/hotel-booking-analysis.git.
  2. Install Required Libraries: Install the necessary libraries by running pip install -r requirements.txt.
  3. Open the Notebook: Open the Jupyter Notebook hotel_booking_analysis.ipynb in your preferred Python environment.
  4. Run the Notebook: Execute the notebook cells one by one to perform the data analysis and visualize the results.

Contact Information

For any further inquiries or information regarding this project, please feel free to contact me at eugene.winata@gmail.com.

Enjoy exploring the Hotel Booking Data Analysis project!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published