Create a new variable named ethnicity

Assignment Help Applied Statistics

Reference no: EM132271838

Statistics for Political Science Assignment - STATA Problem Set

Problem 1 - Using the dataset BormannGolder2013.dta, perform the following tasks:

1. Rename the variable tier1_avemag as averagma.

2. Label the variable as "Average district magnitude".

3. Ask STATA for basic summary statistics and indicates the mean, the standard deviation, the median and the relevant quartiles.

4. Look at the key values of this variable. If you think that this variable needs to be "clean", please, do appropriate changes in the variable to accurately reflect the information provided by averagma.

5. Plot a Box plot of averagma and briefly comment the key features of this variable.

6. Plot a histogram of averagma and discuss the shape and spread of this variable.

7. Now, look at the variable mixed_type. Ask STATA to show a frequency table of this variable.

8. Using the codebook of this dataset, create a value label for mixed_type to describe what each category of this variable captures.

9. Create a bar graph of mixed_type and describe this variable.

10. Create a bar graph showing the average district magnitude shown in your averagma variable for each type electoral system as shown in your variable mixed_type.

Problem 2 - Using the dataset AVdataset.dta perform the following tasks:

1. Using variable dq11, create a new variable named fptp_tradition with four categories. Category 1 should contain values in dq11 referring to agreeing. Category 2 should contain values in dq11 referring to neither agree nor disagree. Category 3 should contain values referring to disagreeing. Category 4 should reflect those who don't know. Provide labels for the new categories if appropriate. Label the new variable fptp_tradition as "FPT is part of British tradition". Show a distribution of the new variable.

2. Show the confidence intervals for each category of fptp_tradition explain the meaning of the CI for category1.

3. Rename variable dq20 as like_conservatives and show the 95% confidence interval of the mean of this variable. Explain the confidence interval.

4. Using variable dq104, create a new variable named class with three categories. Category 1 should refer to people with less than £ 20,000 and be labeled as "Lower class". Category 2 should refer to people declaring an income of £20,000 but less than £ 80,000 and be labeled as "Middle class". Category 3 should refer to people declaring an income of £ 80,000 or more and be labeled as "Upper class". The rest of the values of variable dq104 should be treated as missing.

5. Show the mean value of the variable like_conservative for every value of the variable class. Looking at this information, we would like to see how social class determines the likes for the conservative party. To do so we test, for every category of the variable class, the hypothesis that

H₀ : μ = 4:4

When is the null hypothesis rejected?

6. We would like to compare the mean of like_conservatives from individuals belonging to the lower class with the mean of the rest of people. The hypotheses we would like to test is the following

H₀ : μ_r = μ_l

H_a : μ_r > μ_l

where μ_l refers to the population mean of those who belong to the lower class and μ_r is the population mean of those who belong to the rest of social classes. Is the null-hypotheses rejected?

Problem 3 - Using the dataset world.dta perform the following tasks:

1. Provide descriptive statistics for variables hdi2001 and eth_het.

2. Create a new variable named hdi which indicates the value of hdi2001 multiplied by 100. Add an appropriate label to this variable.

3. Create a new variable named ethnicity which indicates the value of eth_het multiplied by 100. Add an appropriate label to this variable.

4. Draw a scatterplot showing the relationship between hdi and ethnicity. Add to the scatterplot a line assuming that there is a liner relationship between the two variables. Describe this relationship also using the value of the correlation.

5. Estimate the following regression

hdi = β₀ + β₁ethnicity

6. Discuss the main information of the regression including the meaning of β₀ and β₁, statistical significance and R-sq. (3 lines max)

N.B. Save the dataset to make sure that your new variables are preserved for later.

Problem 4 - Using the dataset world.dta as saved at the end of Problem 3, perform the following tasks:

1. Create a new variable named dem_area which indicates the value of the variable dem_oth multiplied by 100. Label the new variable dem_area as "% of democracies in same area". Look at the variable rural and create a new variable named rural_cat where 1 indicates that more than 50% of the population is rural and 0, otherwise. Provide relevant labels.

2. Estimate the following regression model

hdi = β₀ + β₁ethnicity + β₂dem_area + δ₁rural_cat

3. Discuss the main information of the regression including the meaning of β₁, β₂ and δ₁, statistical significance and R-sq. (5 lines max).

4. Show in a graph how hdi changes as ethnicity increases from 0 to 100 in increments of 10. Briefly describe the graph.

5. Show in a graph how hdi changes as dem_area increases from 0 to 100 in increments of 10. Briefly describe the graph.

Attachment:- Assignment Files.rar

Reference no: EM132271838

Questions Cloud

Rate of change of general price level : What is inflation? It is the rate of change of general price level? A natural question to ask is "What is the general price level"?

How your favorite brand has gained market share : Determine and describe how your favorite brand has gained market share and popularity in terms of its brand recognition, brand strategy, and product positioning

Review the multidisciplinary evaluation team case study : Compose a minimum 500-word analysis that identifies additional specialists and other individuals who should be included as part of the MET team.

Potential rifle out of the market : Develop a plan keep the potential rifle out of the market please feel free to use crafts and numbers as part of your plan.

Create a new variable named ethnicity : Statistics for Political Science Assignment - STATA Problem Set. Create a new variable named ethnicity which indicates the value of eth_het multiplied by 100

Discuss why the pricing strategy is effective : Next, you will determine which pricing strategy is represented by your brand's price and discuss why (or why not) the pricing strategy is effective.

Trade between economically different countries : What are the basis for and gains from trade between economically different countries?

Identify the aspects of english that may prove problematic : Identify the aspects of English that may prove problematic for L2 learners from one linguistic background of your choice.

Create a network diagram that shows sequence of activities : Make a list of assumptions that will be used as the basis for planning the wedding. And no, it is not acceptable to assume that Tony and Peggy Sue will just.

Reviews

len2271838

4/1/2019 3:10:45 AM

You are expected to write a do-file that can be executed in STATA. Such do-file will be printed in pdf and submitted to KEATS. To print your do-file in pdf, please, select print from the file menu in the do-file editor and select a pdf printer. If you do not have a pdf printer installed, you can download one for free from the internet (For example, PDFCreator). In any case, you should submit a file that looks exactly like your original do-file.

4/1/2019 3:10:38 AM

The do-file will contain a heading where your student ID should be clearly visible. Structure your do-file following the numbering as shown in the assignment sheet. You will be expected to enter some non-numerical answers like comments as part of your answer. When this is the case, I expect short comments no longer than 2/3 lines, so be very precised. You may also find useful to enter some short descriptive comments to better explain what you are doing. In both of these cases, make sure that STATA treats such content as text. Make sure that you run the do-file before submitting it. A do-file that does not run to the end will be penalised with 10% of the total marks. 7. You are encouraged to work in groups. However, note that all of your answers must be the result of your individual judgment. Questions and queries regarding this assignment will be answered only via KEATS.

Write a Review

Required(*) Message

User Account

All Pages