Examine the dataset and eliminate mistakes and bad records

Assignment Help Applied Statistics
Reference no: EM131097329

Assignment- Exploratory Data Analysis (EDA) using Watson Analytics

This is an individual assignment. Each student will complete the assignment outlined below and post his/her written results to the appropriate assignment. Please note that only 1 document is allowed to be submitted. See content on p.2.

Activities

1. Select from the dataset provided (or ones designated by your instructor). Provide a brief description of the datasets to include the number of cases, description of the inputs, description of the variables that could be used to develop predictive models, etc.

2. Examine the dataset and eliminate mistakes, bad records, data entry errors, and outliers.

Using Watson Analytics:

3. Explore the dataset, including:

a) Examine the initial set of questions posed by Watson Analytics. Provide any insights gained from this initial dataset.

b) Develop new specific questions which provide additional insights into and answer specific questions from the dataset. Discuss how these insights could be useful. Did Watson Analytics provide the answers necessary? Discuss how you would improve the relevancy.

c) Experiment with the available filters and visualization options at the bottom of the screen and summarize the results. Create and explain at least one insightful global and one local filter for your dataset.

d) Create and explain at least one insightful calculation. Discuss why this would be useful.

4. Refine the dataset.

a) Which variables have the highest quality score? Which ones have the lowest quality score and why? Discuss how the quality of the dataset could be improved.

b) Utilize the available grouping, filtering and hierarchical functionalities to refine the data. Summarize the approach you took and the outcome. What suggestions or insights are gained?

Submission

Each student will submit a single document conforming to the guidelines and standards outlined below.

Document format:

- limited to 5 pages (excludingtitle page, references, and appendix),
- Double-spaced, 12 point Times New Roman font, 1" margins, Bottom-right page numbering.

Note: Submitted report must be either in MS Word or PDF format and titled:

"Assignment_LastName".

Only one document will be allowed to submit.

Content(note that the document must have clearly marked sections for the items listed below)

1) Title page (1 page limit): course number and term, assignment number and project title, student name and contact information, instructor's name. Format it so it looks pleasant and presentable. Follow formatting guidelines above.

2) Introduction. Provide a brief outline of the dataset you are using for this assignment. Briefly explain the content of the data. Include a screenshot of the data (not all, but partial as far as all relevant variables are visible).

3) Discuss the data exploration process followed and the results. Include any specific ideas or suggestions as to how this could be used in your organization.

4) Discuss the data refinement process followed and the results.

5) References (1 page limit): List all references in APA format used in preparing this report. It is strongly recommended to use outside knowledge in setting-up the analysis or discussing the results where possible.

6) Appendix (4 page limit):

a) Appendix A: Include any appropriate workbooks and/or screenshots (figures, tables, diagrams) used in this assignment. Make sure all tables, figures, or diagrams are properly numbered and titled. For example, "Table 1. Model Results". Make sure all tables or figures or diagrams are easily readable and visually presentable.

https://www.dropbox.com/s/zpv4cq2nswvmw89/data_files.zip?dl=0

Reference no: EM131097329

Questions Cloud

Summary on:the political dynamics of higher education policy : Write a summary of the following Article: The Political Dynamics of Higher Education Policy.
Determine the equivalent stress in the cylinder : Determine the maximum shear stress at the outer surface of the cylinder.
Expected utitily of income and utility of expected income : John has a risk asset worth Y and derives utility from the consumption of his wealth. Suppose W=100, L=36, P=0.5 , Find John's expected Utitily of income and Utility of expected income and explain that intuition behind each. Now suppose that he has a..
Determine the moment of this force about point o : 2/43 As a trailer is towed in the forward direction, the force F = 120 lb is applied as shown to the ball of the trailer hitch. Determine the moment of this force about point O.
Examine the dataset and eliminate mistakes and bad records : Examine the initial set of questions posed by Watson Analytics. Provide any insights gained from this initial dataset. Examine the dataset and eliminate mistakes, bad records, data entry errors, and outliers.
Determine the corresponding velocity and acceleration : Determine the corresponding velocity and acceleration of block A.
Do university faculty who prepare teachers model technology : What types of technology use, if any, are demonstrated by teacher candidates in those courses. Although both were found to be dependent on the type of technology integration, overall results indicated (a) faculty do not model most technology integ..
Write an essay about pros and cons of tariffs : Write a minimum of a five-page essay, using proper APA format, on the topic of pros and cons of tariffs. Use a minimum of three scholarly sources. You have the freedom to take any aspect of unemployment that you desire to research.
How you perceive the availability of fresh water : what are your own personal, specific recommendations for wisely managing this water "crisis" in the Southwest? Be sure to justify your answer, and include your own perspective on urban growth as well.

Reviews

Write a Review

Applied Statistics Questions & Answers

  Lengths of chromosomes in a certain species are uniformly

Lengths of chromosomes in a certain species are uniformly

  Us residents with a college degree and us residents

The General Social Survey described in Exercise 5.8 included random samples from two groups: US residents with a college degree and US residents without a college degree. For the 505 sampled US residents with a college degree, the average numbe..

  Involving the selection of two numbers

Rework problem 31 in section 4.1 of your text, involving the selection of two numbers. Assume that you select two 2-digit numbers at random from the set of consecutive integers from 00 through 99. The selections are made with replacement and are inde..

  Conduct a hypothesis test to determine

Conduct a hypothesis test to determine if chewing gum makes you less accurate while target shooting.  Each of the following participants shot at a target (maximum score of 100) while chewing gum and without.  Assume that all conditions of the test ha..

  Compute the direct materials allowed per bunny in ounces

Compute the direct materials allowed per bunny in ounces. Compute the direct labor hours allowed per bunny in hours. Set up a standard cost card for the prime cost of one chocolate bunny.

  How large sample of sales figures is need to make confident

How large a sample of sales figures is needed to make us 95 percent confident that x, the sample mean sales dollars per square foot?

  Write the formula for the exponential probability curve of x

Write the formula for the exponential probability curve of x. Assuming that the maintenance department's claim is true, find the probability that the time between successive breakdowns is at most five hours.

  Took for 8 participants to solve a puzzle.

Recorded the time in seconds it took for 8 participants to solve a puzzle. These times appear below. However, when the data was entered into the statistical program, the score that was supposed to be 22.1 was entered as 21.2.

  Calculate the mean, median and mode

State the statistical assumptions of this test and using the data set and variables you have selected, use SPSS to calculate the Mean and Median.

  Captain jack sparrow is stranded on an island awaiting rescu

Captain Jack Sparrow is stranded on an island, awaiting rescue. Suppose that the probability for a ship to come by in a particular day is 20%. Assuming each day is independent of the next, for at least how many days should he wait (including the day ..

  Compute a five-year weighted moving average using weights

IListed below is the number of movie tickets sold at the Library Cinema-Complex, in thousands, for the period from 2001 to 2013. Compute a five-year weighted moving average using weights of 0.1, 0.15, 0.25, 0.12, and 0.38, respectively. Describe the ..

  An independent-measures study produces

An independent-measures study produces t(21)=3.00, p

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd