Giter VIP home page Giter VIP logo

sayamalt / flight-price-prediction Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 3.62 MB

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

Home Page: https://flight-price-prediction-r3uj.onrender.com/

Jupyter Notebook 99.13% HTML 0.71% Python 0.16%
data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

flight-price-prediction's Introduction

About Dataset

INTRODUCTION

The objective of the study is to analyze the flight booking dataset obtained from the “Ease My Trip” website and to conduct various statistical hypothesis tests in order to get meaningful information from it. The 'Linear Regression' statistical algorithm would be used to train the dataset and predict a continuous target variable. 'Easemytrip' is an internet platform for booking flight tickets, and hence a platform that potential passengers use to buy tickets. A thorough study of the data will aid in the discovery of valuable insights that will be of enormous value to passengers.

Research Questions

The aim of this study is to answer the below research questions:

a) Does the price vary with Airlines?
b) How is the price affected when tickets are bought just 1 or 2 days before departure?
c) Does the ticket price change based on the departure time and arrival time?
d) How does the price change with changes in Source and Destination?
e) How does the ticket price vary between Economy and Business class?

DATA COLLECTION AND METHODOLOGY

The Octoparse scraping tool was used to extract data from the website. Data was collected in two parts: one for economy class tickets and another for business class tickets. A total of 3,00,261 distinct flight booking options were extracted from the site. Data was collected for 50 days, from February 11th to March 31st, 2022. The data source was secondary data and was collected from the Ease My Trip website.

DATASET

The dataset contains information about flight booking options from the website Easemytrip for flight travel between India's top six metro cities. There are 300261 datapoints and 11 features in the dataset.

FEATURES

The various features of the dataset are explained below:

Feature Description
Airline The name of the airline company is stored in the airline column. It is a categorical feature with six different airlines.
Flight Flight stores information regarding the plane's flight code. It is a categorical feature.
Source City City from which the flight takes off. It is a categorical feature with six unique cities.
Departure Time This is a derived categorical feature, obtained by grouping time periods into bins. It stores information about the departure time and has six unique time labels.
Stops A categorical feature with three distinct values that stores the number of stops between the source and destination cities.
Arrival Time This is a derived categorical feature, obtained by grouping time intervals into bins. It has six distinct time labels and stores information about the arrival time.
Destination City City where the flight will land. It is a categorical feature with six unique cities.
Class A categorical feature that contains information on seat class; it has two distinct values: Business and Economy.
Duration A continuous feature that displays the overall amount of time it takes to travel between cities in hours.
Days Left This is a derived characteristic that is calculated by subtracting the trip date from the booking date.
Price Target variable that stores information of the ticket price.

flight-price-prediction's People

Contributors

sayamalt avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.