What is the highest accuracy you are able to achieve

Assignment Help Microeconomics
Reference no: EM131960409

Assignment

1 Lalonde NSW Data

A. Load the Lalonde experimental dataset with the lalonde data method from the module causalinference.utils. The outcome variable is earnings in 1978, and the co- variates are, in order:

Black       Indicator variable; 1 if Black, 0 otherwise.
Hispanic   Indicator variable; 1 if Hispanic, 0 otherwise.
Age         Age in years.
Married    Marital status; 1 if married, 0 otherwise. Nodegree Indicator variable; 1 if no degree, 0 otherwise. Education Years of education.
E74         Earnings in 1974.
U74         Unemployment status in 1974; 1 if unemployed, 0 otherwise.
E75         Earnings in 1975.
U75         Unemployment status in 1975; 1 if unemployed, 0 otherwise.

Using CausalModel from the module causalinference, provide summary statistics for the outcome variable and the covariates. Which covariate has the largest normalized difference?

B. Estimate the propensity score using the selection algorithm est propensity s. In se- lecting the basic covariates set, specify E74, U74, E75, and U75. What are the additional linear terms and second-order terms that were selected by the algorithm?

C. Trim the sample using trim s to get rid of observations with extreme propensity score values. What is the cut-off that is selected? How many observations are dropped as a result?

D. Stratify the sample using stratify s. How many propensity bins are created? Report the summary statistics for each bin.

E. Estimate the average treatment effect using OLS, blocking, and matching. For matching, set the number of matches to 2 and adjust for bias. How much do the estimates differ?

2 Document Classification

A. From the module sklearn.datasets, load the training data set using the method fetch 20newsgroups. This dataset comprises around 18000 newsgroups posts on 20 topics. Print out a couple sample posts and list out all the topic names.

B. Convert the posts (blobs of texts) into bag-of-word vectors. What is the dimensionality of these vectors? That is, what is the number of words that have appeared in this data set?

C. Use your favorite dimensionality reduction technique to compress these vectors into ones of K = 30 dimensions.

D. Use your favorite supervised learning model to train a model that tries to predict the topic of a post from the vectorized representation of the post you obtained in the previous step.

E. Use the test data to tune your model. Make sure to include K as a hyperparameter as well. Use accuracy score from sklearn.metrics as your evaluation metric. What is the highest accuracy you are able to achieve?

Reference no: EM131960409

Questions Cloud

Find the current news about the sustainability issue : Read the prompt and find the current news about the sustainability issue. Write the summary of the news.
Write paper about the effect of unemployment in saudi arabia : Write 4-5 pages of Literature Review paper about the The effect of unemployment in Saudi Arabia.
Second array contains the number of athletes : You are given two arrays the first array contains the sports at a sporting event and the second array contains the number of athletes playing in each sport.
Describe the factors that differentiated worst experience : Describe the factors that differentiated the worst experience from the best.
What is the highest accuracy you are able to achieve : Use accuracy score from sklearn.metrics as your evaluation metric. What is the highest accuracy you are able to achieve?
What light does the story of shark culling : What has caused public opinion to turn against the Western Australian Government and how effective do you consider the Government's efforts at managing
What is the stock current price : The required rate of return is rs = 10.5%, and the expected constant growth rate is g = 5.5%. What is the stock's current price?
What is the main goal of time multiplexing : 1. What is the main goal of time multiplexing. Give an example of how it can be used in sender to receiver scenario.
Describe typical external disruptions to the supply chains : List and describe typical external disruptions to the supply chains. What steps would you recommend to minimize these disruptions?

Reviews

Write a Review

Microeconomics Questions & Answers

  Construction contracts serve means of pricing construction

How do construction contracts serve as a means of pricing construction and structure allocation of risk to contracting parties? Discuss this with respect to types of contracts. What factors determine the amount of mark up a contractor uses in pricing..

  Affect the equilibrium price level and real gdp

Use an aggregate demand and aggregate supply diagram to illustrate and describe how each of the following will affect the equilibrium price level and real GDP

  Identify any deadweight welfare loss

Illustrate the situation graphically. Identify any deadweight welfare loss and explain why the market is no longer effective at allocating resources.

  Are there multiple break even points

Suppose that a firm sells in a competitive market at a fixed price of $12 per unit. The firm's cost function is: C = 200 + 4Q. Determine the minimum quantity at which the can break even. Are there multiple break even points? Explain in detail.

  Calculate restaurants income elasticity of demand

Becky really likes Macaroni Grill but can only afford to eat out 4 times a year.So calculate for restaurants the income elasticity of demand?

  What is the equilibrium price of ticket for this performance

What will happen to the price of tickets to this event if free market forces were allowed to operate? What is the equilibrium price of a ticket for this performance

  What type of price discrimination they are engaged in

The problem belongs to Economics, Micro-economics and it is explain the problem about Coke being a monopolist and does it engage in price discrimination and the type of price discrimination that coke is engaged in the answer.

  What is the slope

In forecasting, MacDonald's Wing® discovered that when it opened its store to the public, it was able to sell 5,000 parachutes in the first year. Given the equation; +bx. Where 'y' represents the number of sales and 'a' is the number they started wit..

  Create the expected frequency table assumption of

suppose you manage saras burger bar you would like to analyze drink preferences for two different type of customers

  Data on output for a firm in the short run

The table below gives data on output for a firm in the short run. The firm is able to hire labor and its TPP is given. Compute the MPP and MRP for labor if the price of the good is fixed at $12 per unit.  The firm must pay workers $40 a day.  How man..

  Does the herb saint-john-wort relieve major depression

Saint-John's-wort and depression. Does the herb Saint-John's-wort relieve major depression? Here are some excerpts from the report of a study of this issue.24 The study concluded that the herb is no more effective than a placebo. "Design: Randomiz..

  How innovation affect us production possibilities frontier

Technological change is an important driver of economic growth. Refer to the "Technology" column in the Marketplace section of a recent Wall Street Journal. Find a story about a technological innovation that seems interesting to you. How will this..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd