Giter VIP home page Giter VIP logo

ie0005-dsai-group-barley's Introduction

Contributors Forks Stargazers Issues MIT License


Logo

Flight Price Prediction Project


Explore the docs »

View Demo · Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contact
  8. Acknowledgments

About The Project

Product Name Screen Shot

The objective of the study is to analyse the flight booking dataset obtained from “Ease My Trip” website and to conduct various statistical hypothesis tests in order to get meaningful information from it. The 'Linear Regression' statistical algorithm would be used to train the dataset and predict a continuous target variable. 'Easemytrip' is an internet platform for booking flight tickets, and hence a platform that potential passengers use to buy tickets. A thorough study of the data will aid in the discovery of valuable insights that will be of enormous value to passengers.

Research Questions

  • Does price vary with Airlines?
  • How is the price affected when tickets are bought in just 1 or 2 days before departure?
  • Does ticket price change based on the departure time and arrival time?
  • How the price changes with change in Source and Destination?
  • How does the ticket price vary between Economy and Business class?

DATA COLLECTION AND METHODOLOGY

Octoparse scraping tool was used to extract data from the website. Data was collected in two parts: one for economy class tickets and another for business class tickets. A total of 300261 distinct flight booking options was extracted from the site. Data was collected for 50 days, from February 11th to March 31st, 2022. Data source was secondary data and was collected from Ease my trip website.

Dataset

Dataset contains information about flight booking options from the website Easemytrip for flight travel between India's top 6 metro cities. There are 300261 datapoints and 11 features in the cleaned dataset.

FEATURES

The various features of the cleaned dataset are explained below:

  • Airline: The name of the airline company is stored in the airline column. It is a categorical feature having 6 different airlines.
  • Flight: Flight stores information regarding the plane's flight code. It is a categorical feature.
  • Source City: City from which the flight takes off. It is a categorical feature having 6 unique cities.
  • Departure Time: This is a derived categorical feature obtained created by grouping time periods into bins. It stores information about the departure time and have 6 unique time labels.
  • Stops: A categorical feature with 3 distinct values that stores the number of stops between the source and destination cities.
  • Arrival Time: This is a derived categorical feature created by grouping time intervals into bins. It has six distinct time labels and keeps information about the arrival time.
  • Destination City: City where the flight will land. It is a categorical feature having 6 unique cities.
  • Class: A categorical feature that contains information on seat class; it has two distinct values: Business and Economy.
  • Duration: A continuous feature that displays the overall amount of time it takes to travel between cities in hours.
  • Days Left: This is a derived characteristic that is calculated by subtracting the trip date by the booking date.
  • Price: Target variable stores information of the ticket price.

(back to top)

Built With

This section should list any major frameworks/libraries used to bootstrap your project. Leave any add-ons/plugins for the acknowledgements section. Here are a few examples.

  • Python
  • Jupyter
  • Numpy

(back to top)

Getting Started

This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.

Prerequisites

Have python install in your local system. (Select python version 3.10)

  1. Python
https://www.python.org/
  1. After installation, Open the Run dialog box by pressing the Windows key + R.

    Image Alt Text
  2. Type cmd and hit Enter.

    Image Alt Text
  3. For a list of all the locations added to your PATH variable, type echo %PATH% into the command prompt, then press Enter.

    Image Alt Text
  4. If you find a path like C:\Python39\Scripts (depending on your Python version), the path was added to the PATH variable. If pip hasn’t been added, try the next fix.

Fix 2: Add Pip to the PATH Environment Variable

Here’s how to manually add pip to the PATH environment using the Windows GUI and command prompt. Once the path has been added, open a new command window and try to install a pip package to see whether the problem was solved.

Add pip to the PATH using Windows GUI:

  1. Open the Run dialog box by pressing the Windows key + R.

    Image Alt Text
  2. Type in sysdm.cpl and press Enter to access System Properties.

    Image Alt Text
  3. Select the Advanced tab, then Environment Variables.

    Image Alt Text
  4. Go to System variables and select Path.

    Image Alt Text
  5. Click the Edit button.

    Image Alt Text
  6. Click on New to add the pip installation path. The default location is: C:\users\“your-username“\AppData\Programs\Python\Python39 for Python 3.9.

    Image Alt Text

Installation

  1. Clone the repo
    git clone https://github.com/chongwei49/IE0005-DSAI-Group-Barley.git
    
  2. Install Jupyter Notebook
    pip install notebook
    
  3. To run the notebook
    jupyter notebook
    

(back to top)

Usage

Use this space to show useful examples of how a project can be used. Additional screenshots, code examples and demos work well in this space. You may also link to more resources.

For more examples, please refer to the Documentation

(back to top)

Roadmap

  • Add Changelog
  • Add back to top links
  • Add Additional Templates w/ Examples
  • Add "components" document to easily copy & paste sections of the readme
  • Multi-language Support
    • Chinese
    • Spanish

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Contact

Chongwei - [email protected]

Project Link: https://github.com/your_username/repo_name

(back to top)

Acknowledgments

Use this space to list resources you find helpful and would like to give credit to. I've included a few of my favorites to kick things off!

(back to top)

ie0005-dsai-group-barley's People

Contributors

chongwei49 avatar tkm321 avatar zhaoyiiii avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.