Giter VIP home page Giter VIP logo

helping_your_hosts's Introduction

Helping Your Hosts - Airbnb Reviews and Ratings

Author: Ben McCarty


Overview

In a post-COVID world, hospitality faces challenges as travel restrictions are imposed and lifted (and then re-imposed). Travel and tourism came to a crashing halt in 2020 and still face challenges in returning to pre-2020 business levels.

As restless travelers look to escape the confines of their homes, they expect the same high-quality services and experiences as pre-COVID. Competition within the hospitality industry is stronger than ever, putting more pressure on businesses to regain, retain, and expand their guest ledgers.

The main performance metric for every company involved in hospitality is guest satisfaction. If a guest isn't satisfied, they are less likely to return for another visit and may share their experience with others, pushing away potential business.

Airbnb hosts face the same challenges as traditional hotels in these aggressive and challenging market conditions. In order to maximize their profitability and to distinguish themselves from traditional hotels, Airbnb needs to know which aspects of a host property are the strongest predictors of whether a guest will give a satisfaction score of 4.8 or higher (out of 5).


Data:

With the question of how to predict high satisfaction scores in mind, I obtained data about Airbnb host properties from the Inside Airbnb project for the Washington, D.C. area.

The dataset includes:

  • Details about the hosts (when they started hosting and host-provided bios)
  • Property details (bedrooms, bathrooms, and room/property types)
  • Booking details (price, availability, and min/max number of nights)

Methods:

I started my analysis with basic statistical overviews, identifying several columns of irrelevant data (URLs; details about how/when the data was collected; etc.). Additionally, I found large percentages of property listings that lacked 6 or more entries and several features that were missing 25%+ entries. Due to the substantial amount of missing data, I chose to drop those rows and columns.

Additional steps in the cleaning and preparation process included:

  • Converting "t"/"f" letters into 1/0 values to represent True/False values.

  • Converting "price" from text into a number.

  • Creating a new feature, "Years Hosting," to indicate number of years of hosting experience for each listing.

  • Splitting the long text strings of host verification methods, neighborhoods, and amenities into separate features for each method, neighborhood, and amenity, respectively.

After preparing the data, I used machine learning modeling techniques to determine my most important features for the region.


Results:

The "Random Forest"-style classification model showed the strongest performance in terms of properly identifying whether a property would meet the threshold of 4.8 or higher. The model is correct 67% of the time, which is better than my baseline model of 50%.

Potential factors for the inaccuracy may include:

  • Extreme values in the data
  • Non-standardized amenity names (unique amenity names are treated individually vs. grouped with similar amenities)
  • A slight class imbalance (roughly 40% of the data fell below the threshold, with 60% above).

The 5 top predictors are:

  1. Superhost Status
  2. Regional Listings
  3. Overall Listings
  4. Price
  5. Minimum Average Nights

Visualizing Results

visual_1


Recommendations:

Based on my model results, I would recommend for Airbnb to focus its efforts on converting hosts into "Superhosts:"

  • "Superhosts are experienced hosts who provide a shining example for other hosts, and extraordinary experiences for their guests." Source

Superhosts must meet the following criteria:

  • Complete at least 10 trips OR completed 3 reservations that total at least 100 nights
  • Maintain a 90% response rate or higher
  • Maintain a 1% cancellation rate (1 cancellation per 100 reservations) or lower
  • Maintain a 4.8 overall rating

These requirements are more readily achievable than having a host offer additional listings, which would often require purchasing new properties. Furthermore, these requirements directly translate to high guest satisfaction. In my experience as a hotelier, being responsive and welcoming to guests are two of the most critical traits to ensure satisfaction.


For More Information

Please review the full analysis in the Jupyter Notebook or the presentation.

For any additional questions, please contact:

Ben McCarty

Repository Structure

├── README.md                           
├── Helping_Your_Hosts.ipynb
├── DS_Project_Presentation.pdf
├── data
├── clf_functions
└── img

helping_your_hosts's People

Contributors

benjmccarty avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.