Create a classification model, Database Management System

Assignment Help:

Use again the GE Flight Quest Data and build on the work done for homework 1 and 2. The goal is to create a classification model which classifies flights into "on-time" (you can define how many minuteslate you still accept) and "delayed". Use all available data (there is weather information available). Notealso that you have data for several days which you can use for training and testing.

Then, add 4 columns for the delay. These delay are:

  • gate_departure_delays =>(actual_gate_departure-scheduled_gate_departure )
  • gate_arrival_delays =>(actual_gate_arrival-scheduled_gate_arrival)
  • runway_departure_delays =>(actual_runway_departure-scheduled_runway_departure)
  • runway_arrival_delays =>(actual_runway_arrival-scheduled_runway_arrival)

Then work on this file.

Follow the CRISP-DM framework

3. Data Preparation

  • Construct variables as needed.
  • Deal with missing data.

Depend on the model that you use, how it deals with messing data, such as replace them with the mean values or find out what values should be, or anything you comfortable with, and provide a short description for that.

  • Combine the flight history data with additional data (weather, etc.)

Add column for weather to the file history that we have. You can find weather data in the data files.

4. Modeling

  • Create at least 3 different classification models (different techniques) and discuss theadvantages of each model for this classification task.

They could be 3 different trees, and describe each model. For example, put the tree or part of it then explain everything "what does it mean? What it tell the airport? What does it mean for them? And what can they do?

  • Assess how well each model performs (use training/test data, cross validation, etc. asappropriate).

Which of these models work better?

5. Evaluation

  • How useful is your best model for the airline industry? How would you measure the model'svalue?

6. Deployment

  • How would you deploy the model?
  • Write a report covering in detail all steps of the project. The results have to be reproducibleusing this documentation.

Related Discussions:- Create a classification model

Failures and recovery, Failures and Recovery In practice various things...

Failures and Recovery In practice various things might occur to prevent a transaction from completing. Recovery methods are used to bring database, which does not satisfy consi

What is meant by physical blocks, What is meant by Physical blocks? The...

What is meant by Physical blocks? The input and output operations are completed in block units. The blocks residing on the disk are referred to as physical blocks.

Serialisability, Serialisability: Any schedule that makes the similar resul...

Serialisability: Any schedule that makes the similar results as a serial schedule is known as a serialisable schedule. But how can a schedule are determined to be serialisable or n

Failure controlling methods, Failure Controlling Methods: Failures can be h...

Failure Controlling Methods: Failures can be handled using different recovery methods that are talk about later in the unit. But the first question is doing we really require recov

What are the three classes of sql expression, What are the three classes of...

What are the three classes of SQL expression? SQL expression consists of three clauses:  Select  From  Where

What is bankers algorithm, What is banker's algorithm?  Banker's algori...

What is banker's algorithm?  Banker's algorithm is a deadlock avoidance algorithm that is applicable to a resource-allocation system with multiple instances of each resource ty

Explain the exists and unique functions of sql, Explain the EXISTS and UNIQ...

Explain the EXISTS and UNIQUE functions of SQL. Give an example for each? EXISTS: The EXISTS function takes one parameter that is a SQL statement. If any records exist which

Lack of standards and experience, Lack of standards and experience: The la...

Lack of standards and experience: The lack of standards has considerably limited the potential of distributed DBMSs. As well, there are no tools or methodologies to help users cha

What is distributed system, What is Distributed system ? Distributed S...

What is Distributed system ? Distributed System - A distributed system consists of a number of processing parts in which are interconnected through a computer network and in

Dbms built for online transaction processing, Problem: A DBMS built for...

Problem: A DBMS built for Online Transaction Processing (OLTP) is usually regarded as unsuitable for data warehousing. Each system is designed with a different set of requireme

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd