Reference no: EM132285317
Goals of this assignment:
1. Students will be exposed to a problem where the method of analysis is not given.
2. Students will learn to use Kaggle.
Kaggle is a great way to get more experience with machine learning. The website hosts competitions, provides training, and more. In STAA578 we will use Kaggle through their classroom competitions feature. This week will be a trial run with a simple data set to make sure all of us can get Kaggle working for future assignments. Please bear with me as this is a work in progress!
The basic idea of the assignment:
• First some vocab: the training data set has a response and a set of predictor variables for each observation. The test data set only has the predictor variables.
• I have posted a problem and data for you to model on Kaggle. You'll develop a model to predict the responses using your model. When you think you have a good fit, use the test data set to produce predicted responses for the test data set. You will upload your predictions for the test data set. Kaggle has the true response values for the test data. Kaggle will compute the cost function and will list your ranking on the leaderboard.
• There will be two rankings: one on the public data set and one on the private data set.
The test data have 2 parts - a public subset of the test data set and a private subset. You won't know which is which. One subset produces the public ranking. Another produces a private ranking.
Steps you should take to complete the assignment:
1. Log into Kaggle. Your first time there you'll have to create a Kaggle account.
2. Go to our classroom Kaggle competition link for Homework 3
3. Go to our classroom competition for Homework
4. Download the data.
5. Develop a method to predict the data.
6. Upload your answer in Kaggle and see your standing on the leaderboard
7. If desired, improve your model to improve your standing on the leaderboard.
8. To turn in:
(1) When you are happy with your model, upload your final predictions on Kaggle.
(2) Turn in on Canvas a summary of your final model. Write up a one page summary describing the model you fit. Provide a plot or some useful summary of the model. Attach your final code.
As part of your summary you may also provide a short summary of other methods.
Attachment:- Homework.rar