Create a new variable named ethnicity

Assignment Help Applied Statistics
Reference no: EM132271838

Statistics for Political Science Assignment - STATA Problem Set

Problem 1 - Using the dataset BormannGolder2013.dta, perform the following tasks:

1. Rename the variable tier1_avemag as averagma.

2. Label the variable as "Average district magnitude".

3. Ask STATA for basic summary statistics and indicates the mean, the standard deviation, the median and the relevant quartiles.

4. Look at the key values of this variable. If you think that this variable needs to be "clean", please, do appropriate changes in the variable to accurately reflect the information provided by averagma.

5. Plot a Box plot of averagma and briefly comment the key features of this variable.

6. Plot a histogram of averagma and discuss the shape and spread of this variable.

7. Now, look at the variable mixed_type. Ask STATA to show a frequency table of this variable.

8. Using the codebook of this dataset, create a value label for mixed_type to describe what each category of this variable captures.

9. Create a bar graph of mixed_type and describe this variable.

10. Create a bar graph showing the average district magnitude shown in your averagma variable for each type electoral system as shown in your variable mixed_type.

Problem 2 - Using the dataset AVdataset.dta perform the following tasks:

1. Using variable dq11, create a new variable named fptp_tradition with four categories. Category 1 should contain values in dq11 referring to agreeing. Category 2 should contain values in dq11 referring to neither agree nor disagree. Category 3 should contain values referring to disagreeing. Category 4 should reflect those who don't know. Provide labels for the new categories if appropriate. Label the new variable fptp_tradition as "FPT is part of British tradition". Show a distribution of the new variable.

2. Show the confidence intervals for each category of fptp_tradition explain the meaning of the CI for category1.

3. Rename variable dq20 as like_conservatives and show the 95% confidence interval of the mean of this variable. Explain the confidence interval.

4. Using variable dq104, create a new variable named class with three categories. Category 1 should refer to people with less than £ 20,000 and be labeled as "Lower class". Category 2 should refer to people declaring an income of £20,000 but less than £ 80,000 and be labeled as "Middle class". Category 3 should refer to people declaring an income of £ 80,000 or more and be labeled as "Upper class". The rest of the values of variable dq104 should be treated as missing.

5. Show the mean value of the variable like_conservative for every value of the variable class. Looking at this information, we would like to see how social class determines the likes for the conservative party. To do so we test, for every category of the variable class, the hypothesis that

H0 : μ = 4:4

When is the null hypothesis rejected?

6. We would like to compare the mean of like_conservatives from individuals belonging to the lower class with the mean of the rest of people. The hypotheses we would like to test is the following

H0 : μr = μl

Ha : μr > μl

where μl refers to the population mean of those who belong to the lower class and μr is the population mean of those who belong to the rest of social classes. Is the null-hypotheses rejected?

Problem 3 - Using the dataset world.dta perform the following tasks:

1. Provide descriptive statistics for variables hdi2001 and eth_het.

2. Create a new variable named hdi which indicates the value of hdi2001 multiplied by 100. Add an appropriate label to this variable.

3. Create a new variable named ethnicity which indicates the value of eth_het multiplied by 100. Add an appropriate label to this variable.

4. Draw a scatterplot showing the relationship between hdi and ethnicity. Add to the scatterplot a line assuming that there is a liner relationship between the two variables. Describe this relationship also using the value of the correlation.

5. Estimate the following regression

hdi = β0 + β1ethnicity

6. Discuss the main information of the regression including the meaning of β0 and β1, statistical significance and R-sq. (3 lines max)

N.B. Save the dataset to make sure that your new variables are preserved for later.

Problem 4 - Using the dataset world.dta as saved at the end of Problem 3, perform the following tasks:

1. Create a new variable named dem_area which indicates the value of the variable dem_oth multiplied by 100. Label the new variable dem_area as "% of democracies in same area". Look at the variable rural and create a new variable named rural_cat where 1 indicates that more than 50% of the population is rural and 0, otherwise. Provide relevant labels.

2. Estimate the following regression model

hdi = β0 + β1ethnicity + β2dem_area + δ1rural_cat

3. Discuss the main information of the regression including the meaning of β1, β2 and δ1, statistical significance and R-sq. (5 lines max).

4. Show in a graph how hdi changes as ethnicity increases from 0 to 100 in increments of 10. Briefly describe the graph.

5. Show in a graph how hdi changes as dem_area increases from 0 to 100 in increments of 10. Briefly describe the graph.

Attachment:- Assignment Files.rar

Reference no: EM132271838

Questions Cloud

Rate of change of general price level : What is inflation? It is the rate of change of general price level? A natural question to ask is "What is the general price level"?
How your favorite brand has gained market share : Determine and describe how your favorite brand has gained market share and popularity in terms of its brand recognition, brand strategy, and product positioning
Review the multidisciplinary evaluation team case study : Compose a minimum 500-word analysis that identifies additional specialists and other individuals who should be included as part of the MET team.
Potential rifle out of the market : Develop a plan keep the potential rifle out of the market please feel free to use crafts and numbers as part of your plan.
Create a new variable named ethnicity : Statistics for Political Science Assignment - STATA Problem Set. Create a new variable named ethnicity which indicates the value of eth_het multiplied by 100
Discuss why the pricing strategy is effective : Next, you will determine which pricing strategy is represented by your brand's price and discuss why (or why not) the pricing strategy is effective.
Trade between economically different countries : What are the basis for and gains from trade between economically different countries?
Identify the aspects of english that may prove problematic : Identify the aspects of English that may prove problematic for L2 learners from one linguistic background of your choice.
Create a network diagram that shows sequence of activities : Make a list of assumptions that will be used as the basis for planning the wedding. And no, it is not acceptable to assume that Tony and Peggy Sue will just.

Reviews

len2271838

4/1/2019 3:10:45 AM

You are expected to write a do-file that can be executed in STATA. Such do-file will be printed in pdf and submitted to KEATS. To print your do-file in pdf, please, select print from the file menu in the do-file editor and select a pdf printer. If you do not have a pdf printer installed, you can download one for free from the internet (For example, PDFCreator). In any case, you should submit a file that looks exactly like your original do-file.

len2271838

4/1/2019 3:10:38 AM

The do-file will contain a heading where your student ID should be clearly visible. Structure your do-file following the numbering as shown in the assignment sheet. You will be expected to enter some non-numerical answers like comments as part of your answer. When this is the case, I expect short comments no longer than 2/3 lines, so be very precised. You may also find useful to enter some short descriptive comments to better explain what you are doing. In both of these cases, make sure that STATA treats such content as text. Make sure that you run the do-file before submitting it. A do-file that does not run to the end will be penalised with 10% of the total marks. 7. You are encouraged to work in groups. However, note that all of your answers must be the result of your individual judgment. Questions and queries regarding this assignment will be answered only via KEATS.

Write a Review

Applied Statistics Questions & Answers

  What probability of experiencing such drop in water pressure

The community water system will experience a noticeable drop in water pressure when the daily water consumption exceeds 984,000 gallons. What is the probability of experiencing such a drop in water pressure?

  Is there a difference between married and single officers

Is there a difference between married and single officers on perceptions that their job was stressful - It is hypothesized that married officers would perceive the job as more stressful.

  The owner of a fish market has an assistant who has determin

The owner of a fish market has an assistant who has determined that the weights of catfish are normally distributed, with mean of 3.2 pounds and standard deviation of 0.8 pound. What percentage of samples of 4 fish will have sample means between 3.0 ..

  Get a flu shot

Suppose the number of days it takes you to get a flu shot

  Prepare a report using the numerical methods of statistics

Prepare a report (see below) using the numerical methods of descriptive statistics presented in this module to learn how each of the variables contributes to the success of a motion picture.

  How the other variables affect the survival time in days

Read the description of the data. Then fit a suitable model to understand how the other variables affect the Survival time in days from day of diagnosis of the patients

  A severe storm has an average peak wave height

A severe storm has an average peak wave height of 16.4 feet for waves hitting the shore. Suppose that a storm is in progress with a severe storm class rating. Let us say that we want to set up a statistical test to see if the wave action

  Find the probability that all guests will receive a room

Use the normal approximation to the binomial to find the probability that all guests who arrive on July 1 will receive a room.

  Significant change in the mean length of the bars

Has there been a statistically significant change in the mean length of the bars?

  Define the hypothesis and find the standard error

Define the hypothesis and find the standard error of the difference in the means also find the test statistic - Determine the required sample size to be able to use a 99% confidence interval.

  The distribution of systolic blood pressure measurements

The distribution of systolic blood pressure measurements for women over seventy-five.

  Research and data analysis in health care

HMGT 400 Research and Data Analysis in Health Care-Exercise - Descriptive statistics between hospital Based on your findings in which years hospitals

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd