Create a classification model, Database Management System

Assignment Help:

Use again the GE Flight Quest Data and build on the work done for homework 1 and 2. The goal is to create a classification model which classifies flights into "on-time" (you can define how many minuteslate you still accept) and "delayed". Use all available data (there is weather information available). Notealso that you have data for several days which you can use for training and testing.

Then, add 4 columns for the delay. These delay are:

  • gate_departure_delays =>(actual_gate_departure-scheduled_gate_departure )
  • gate_arrival_delays =>(actual_gate_arrival-scheduled_gate_arrival)
  • runway_departure_delays =>(actual_runway_departure-scheduled_runway_departure)
  • runway_arrival_delays =>(actual_runway_arrival-scheduled_runway_arrival)

Then work on this file.

Follow the CRISP-DM framework

3. Data Preparation

  • Construct variables as needed.
  • Deal with missing data.

Depend on the model that you use, how it deals with messing data, such as replace them with the mean values or find out what values should be, or anything you comfortable with, and provide a short description for that.

  • Combine the flight history data with additional data (weather, etc.)

Add column for weather to the file history that we have. You can find weather data in the data files.

4. Modeling

  • Create at least 3 different classification models (different techniques) and discuss theadvantages of each model for this classification task.

They could be 3 different trees, and describe each model. For example, put the tree or part of it then explain everything "what does it mean? What it tell the airport? What does it mean for them? And what can they do?

  • Assess how well each model performs (use training/test data, cross validation, etc. asappropriate).

Which of these models work better?

5. Evaluation

  • How useful is your best model for the airline industry? How would you measure the model'svalue?

6. Deployment

  • How would you deploy the model?
  • Write a report covering in detail all steps of the project. The results have to be reproducibleusing this documentation.

Related Discussions:- Create a classification model

Relation instance or relation state, A relation instance indicates as r is ...

A relation instance indicates as r is a collection of tuples for a given relational schema at a exact point of time. A relation state r of the relation schema R (A 1 ,A 2 ,.....

What is ocr, What is OCR? OCR (Optical Character Recognition) is a tech...

What is OCR? OCR (Optical Character Recognition) is a technology that enables you to change different types of machine-readable documents, like scanned paper documents, PDF fil

DW Job Scheduling, Define a job scheduling strategy that will meet business...

Define a job scheduling strategy that will meet business requirement of reporting availability by 6am CST for the following cubes? Show the job scheduling dependencies in a pictori

Describe domain constraints, Describe domain constraints. Domain constr...

Describe domain constraints. Domain constraints contain entity integrity and referential integrity. The domain is a set of values that might be assigned to an attribute. The en

Two phase locking technique, growing phase and shrinking phase are disjoint...

growing phase and shrinking phase are disjoint, why ?

Finkelstein methodology, PART MARKING: For this section, we will use part m...

PART MARKING: For this section, we will use part marking. This means we will supply model answers to all questions but we will only mark six questions. We will choose these randoml

BI, (a) Why did SAP introduce the extended star schema?

(a) Why did SAP introduce the extended star schema?

What is relationship, What is relationship? Give examples A relationshi...

What is relationship? Give examples A relationship is an association between several entities. Example: A depositor relationship associates a customer with every account tha

Implementing a distributed database system, A distributed database is defin...

A distributed database is defined as "a collection of multiple logically interrelated database which are physically distributed over a computer network" (a) Give any three advan

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd