Summarize and interpret statistical findings

Assignment Help Other Subject
Reference no: EM131301146

Data:

When you google "General Social Survey Data 2006", you will find many links connecting GSS-2006 data. For example, you can use the following link:

https://www.thearda.com/Archive/Files/Descriptions/GSS2006.asp

Then you can download the dataset, say for example, from the section called "Microsoft Excel File", and save it in your computer. Of course you can rename your data file to, say for example, "gss06".

You can download the codebook for all the variables in the survey as well.

Of course you will only use a few variables (columns) of the data for the final exam. So you can delete as many variables (columns) as you like from the original data file. This will save your computer memory when you analyze the data in R or SAS.

Problem 1:

Please work on this problem using SAS.

Import the data into SAS (referred as gss06), and report the number of subjects and number of variables.

Create a dataset (referred as gss06_sub) including only the following 8 variables, AGEWED, AGEKDBRN, WRKSTAT, AGE, CHILDS, EDUC, SEX, and RACE. Read the codebook for the details of these 8 variables.

Print the first 20 observations of the above data named gss06_sub.

Descriptive analysis of two variables "AGEWED" and "AGEKDBRN". For each variable, describe the number of observations, number of missing values (treat "Don't know" and "No answer" as missing), mean and standard deviation, and five-number summary.

Create two datasets; one dataset named "gss1" excluding those subjects who had missing value on variable "AGEWED", and the other dataset named "gss2" excluding those subjects who had missing value of variable "AGEKDBRN". Export these two datasets as CSV data files to be used in R.

Problem 2:

Please work on this problem using R.

Using R to calculate a number based on your initial. For example, the instructor's name is Yixin Fang, so his initial is "yf". The location of "y" in the letters is 25 and the location of "f" in the letters is 6. Then his number is 256, which is equal to 25 × 10 + 6. Hint: You may want to use built-in R object "letters" and R functions "strsplit" and "which".

Use the number you just calculate as the seed to generate a random number between 0 and 1. If the random number is less than 0.5, then you will use the dataset named "gss1" for the following problems. If the random number is bigger than 0.5, then you will use the dataset named "gss2" for the following problems. Name the selected dataset as "mygss". Hint: Use your seed number to generate random numbers so that the grader can reproduce your results.

In your dataset named "mygss", there are 8 variables, but you will only use 7 variables of them. For example, if your dataset is gss1 originally, then the dependent variable is "AGEWED", and then you can ignore variable "AGEKDBRN" and consider the remaining variables (WRKSTAT, AGE, CHILDS, EDUC, SEX, and RACE) as independent variable. Of course, if your dataset is gss2 originally, then the dependent variable is "AGEKDBRN", and then you can ignore variable "AGEWED". Use R to describe and summarize those 7 variables in your dataset named "mygss".

Randomly divide your dataset "mygss" into two halves. In order to this, use R function "sample" to randomly sample m subjects from n subjects without replacement. Here n is the sample size of your dataset "mygss" and m is the largest integer less than n/2. Name the dataset consisting of these m subjects as "mygss_train" and name the data consisting of the remaining subjects as "mygss_test".

Problem 3:

Please work on this problem using SAS.

Use dataset "gss1" to test if "AGEWED" is marginally associated with "WRKSTAT", "AGE", "CHILDS", "EDUC", "SEX", and "RACE", respectively. If one categorical variable has too many categories, you can decide whether or not to dichotomize it.

Use dataset "gss2" to test if "AGEKDBRN" is marginally associated with "WRKSTAT", "AGE", "CHILDS", "EDUC", "SEX", and "RACE", respectively. If one categorical variable has too many categories, you can decide whether or not to dichotomize it.

Summarize and interpret statistical findings you obtained from the above bivariate tests.

Problem 4:

Please work on this problem using R.

Fit a linear regression model using the dataset named "mygss_train", with "AGEWED" or "AGEKDBRN" as dependent variable, and the other 6 variables as independent variables.

Summarize and interpret statistical findings you obtain from the above regression analysis. Report the statistic called "adjusted R-square".

Identify those independent variables that are significantly associated with the dependent variable under significance level of 0.05.

Fit a linear regression model using the dataset named "mygss", with "AGEWED" or "AGEKDBRN" as dependent variable, and but considering only those independent variables that are identified as significant in the preceding step.

Summarize and interpret statistical findings you obtain from the above regression analysis. Report the statistic called "adjusted R-square".

Reference no: EM131301146

Questions Cloud

Leadership role model week : Nelson Mandela serves as a leadership role model this week. Please take time to think about leadership behaviors you have observed in others in your life. What do the skills look like when masterful leaders successfully communicate, negotiate an..
How do you go about doing this ethnography : How do you go about doing this ethnography.write a short ethnography of a cultural scene.ou are probably asking by this time. The exact method will vary according to your chosen topic, but start with the idea that ethnographies are like stories wi..
Determine the appropriate process control charts : Determine the appropriate process control charts that will be used to monitor the performance of that process and determine whether it is predictable (in control). Describe the data you plan to use and operationally define how it is to be collecte..
Compare options for warrants and calls in terms of pricing : Decide if an option is a derivative security. Give reasons to support your decisions. Compare and contrast options for warrants and calls in terms of pricing, and explain their most important differences.
Summarize and interpret statistical findings : Summarize and interpret statistical findings you obtain from the above regression analysis. Report the statistic called "adjusted R-square" - Identify those independent variables that are significantly associated with the dependent variable under s..
Create performance criteria and then rate your bank : Considering the bank you currently use, create performance criteria and then rate your bank against the criteria you developed. Make a suggestion for one area of improvement based on your evaluation.
Compute the pi statistic for project z : Compute the PI statistic for Project Z if the appropriate cost of capital is 7 percent.
Christian mission of grand canyon university : Include information from the sources relating to the three pillars (servant leadership, ethics, and entrepreneurism), as well as a discussing how the pillars relate to the Christian mission of Grand Canyon University.
Predict the future of the currency including the impact : Based on your research of the current EURO currency crisis, predict the future of the currency, including the impact to financial investment and risk within the EURO zone for financial institutions.

Reviews

Write a Review

Other Subject Questions & Answers

  Aspects of piagets theory

Synthesize Piaget's theory of cognitive development and Vygotsky's, including in each the important aspects of their theories.

  Powerful economic and political positions across the globe

How would a functionalist, conflict theorist, symbolic interactionist, and feminist answer the following question: Why do men hold the most powerful economic and political positions across the globe?

  Evaluate how prior health care interventions or lack thereof

Identify the emerging or reemerging infectious disease you selected. Discuss the investigative process used to identify the outbreak, and describe its effect using descriptive epidemiology (person, place, and time).

  Explain the attributes that made those campaigns effective

Describe your selected population health issue and the population affected by this issue. Summarize the two advocacy campaigns you researched in this area. Explain the attributes that made those campaigns effective.

  Development of police agencies and their jurisdiction

Write a 700- to 1000-word paper that includes the following: Discuss the historical development of police agencies and their jurisdiction.

  Discuss the role of stakeholders in quality

Discuss the role of stakeholders in quality and risk management including the relationships between employers and third party payers with health care organizations. What roles do each play in quality and risk management, if any?

  The abbasid caliphate

The Abbasid caliphate:  were overthrown by the Umayyads.  sought to convert non-Muslims to Islam.  relocated the capital to Damascus in Syria.

  Broken down in stomach

A drug employed for leukemia isn't broken down in stomach and is well absorbed by the intestine. However, the molecular form of  drug collected from the blood is not the same as the form that was swallowed by the patient.

  Write essay to argue on crucial topic

Write down the essay in which you argue whether Antigone did right thing. Should she have buried her brother or obeyed decree by her uncle?

  Which of the following is not a part of concept of gender

Which of the following is not a part of the concept of gender? Compare and contrast the three waves of feminism that have existed over time? What were some of their major concerns, successes, ideologies, etc

  Differences between measuring human performance and metrics

Identify one specific activity to be performed within the process change - Describe the activity (task) being evaluated and Describe the differences between measuring human performance and metrics for task completion.

  Current controversial issue

Select a current controversial issue and determine which side of the issue you will support. The issue can be one of local, national, or international concern, such as but not limited to global climate change, immigration, the economy, and terrori..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd