Explain why overall Bayes risk must be concave down

Assignment Help Basic Statistics

Reference no: EM132371371

Probability Problems - Do the following problems.

1. Consider the minimax criterion for a two-category classification problem.

(a) Fill in the steps of the derivation of Eq. 23:

R(P(ω₁)) = λ₂₂ +( λ₁₂ - λ₂₂)∫_{R_1}p(x| ω₂)dx +P(ω₁)[(λ₁₁ - λ₂₂) + (λ₂₁ - λ₁₁)∫_{R_2}p(x|ω₁)dx - (λ₁₂ - λ₂₂)∫_{R_1}p(x|ω₂)dx].

(b) Explain why overall Bayes risk must be concave down as a function of the prior P(ω₁) as shown in Fig. 2.4.

(c) Assume we have one-dimensional Gaussian distributions p(x|ω_i) ∼ N(µ_i, σ_i²), i = 1, 2, but completely unknown prior probabilities. Use the minimax criterion to find the optimal decision point x* in terms of µ_i and σ_i under a zero-one risk.

(d) For the decision point x* you found in (c), what is the overall minimax risk? Express this risk in terms of an error function erf(·).

(e) Assume p(x|ω₁) ∼ N(0, 1) and p(x|ω₂) ∼ N(1/2, 1/4), under a zero-one loss. Find x* and the overall minimax loss.

(f) Assume p(x|ω₁) ∼ N(5, 1) and p(x|ω₂) ∼ N(6, 1). Without performing any explicit calculations, determine x* for the minimax criterion. Explain your reasoning.

2. Let the conditional densities for a two-category one-dimensional problem be given by the Cauchy distribution

p(x|ω_i) = (1/πb)(1/(1+((x-a_i)/b)²), i = 1, 2.

(a) By explicit integration, check that the distributions are indeed normalized.

(b) Assuming P(ω₁)= P(ω₂), show that P(ω₁|x) = P(ω₂|x) if x =(a₁ + a₂)/2, i.e., the minimum error decision boundary is a point midway between the peaks of the two distributions, regardless of b.

(d) How do P(ω₁|x) and P(ω₂|x) behave as x → -∞? x → ∞? Explain.

3. (a) Suppose we have two normal distributions with the same covariances but different means: N(µ₁, ∑) and N(µ₂, ∑). In terms of their prior probabilities P(ω₁) and P(ω₂), state the condition that the Bayes decision boundary not pass between the two means.

(b) Consider an example on pp. 45-46 of Lecture 3 with general normal distributions for a 3-class problem. For the given distributions derive the equations of the boundaries between classes (the plots of the boundaries are given in the notes). Compute numerically the coordinates of the two points where the boundaries of all three classes meet.

4. Consider a two-category classification problem in two dimensions with

(a) Calculate the Bayes decision boundary.

(b) Calculate the Bhattacharyya error bound.

5. Use the classifier given by Eq. 49:

g_i(x) = -½(x - µ_i)^t _i∑^-1(x - µ_i) - d/2 ln2π - ½ln|∑_i| + lnP(ω_i)

to classify the following 10 samples from the table

Sample	ω₁			ω₂			ω₃
Sample	x₁	x₂	x₃	x₁	x₂	x₃	x₁	x₂	x₃
1	-5.01	-8.12	-3.68	-0.91	-0.18	-0.05	5.35	2.26	8.13
2	-5.43	-3.48	-3.54	1.30	-2.06	-3.53	5.12	3.22	-2.66
3	1.08	-5.52	1.66	-7.75	-4.54	-0.95	-1.34	-5.31	-9.87
4	0.86	-3.78	-4.11	-5.47	0.50	3.92	4.48	3.42	5.19
5	-2.67	0.63	7.39	6.14	5.72	-4.85	7.11	2.39	9.21
6	4.94	3.29	2.08	3.60	1.26	4.36	7.17	4.33	-0.98
7	-2.51	2.09	-2.59	5.37	-4.63	-3.65	5.75	3.97	6.65
8	-2.25	-2.13	-6.94	7.18	1.46	-6.66	0.77	0.27	2.41
9	5.56	2.86	2.26	-7.39	1.17	6.30	0.90	-0.43	-8.71
10	1.03	-3.33	4.33	-7.50	-6.32	-0.31	3.52	-0.36	6.43

in the following way. Assume that the underlying distributions are normal.

(a) Assume that the prior probabilities for the first two categories are equal P(ω₁) = P(ω₂) = ½ and P(ω₃) = 0 and design a dichotomizer for those two categories using only the x₁ feature value.

(b) Determine the empirical training error on your samples, i.e., the percentage of points misclassified.

(d) Repeat all of the above, but now use two feature values, x₁, and x₂.

(e) Repeat, but use all three feature values.

(f) Discuss your results. In particular, is it ever possible for a finite set of data that the empirical error might be larger for more data dimensions?

Textbook - Duda, Hart, and Stork, Pattern Classification, Wiley, 2-nd edition, 2001.

Attachment:- Probability Problems.rar

Reference no: EM132371371

Questions Cloud

Benefits of health and wellbeing programs : BSBWOR501 - Manage Personal Work Priorities and Professional Development - Orange International College - Prepare a report of approximately two pages

Paired software development : Paired Software Development - apply knowledge and skills related to current good design principles and practices. You follow Agile system development approach

Implement required design patterns and practise refactoring : ITECH2309 – Software Engineering - Federation University - Paired Software Development - implement the required design patterns and practise refactoring

Explain why health promotion and prevention are important : IHP 501 Small Group Discussion: Millennium Development Goals (MDG). Explain why health promotion, community health, and prevention are important

Explain why overall Bayes risk must be concave down : Probability Problems - Explain why overall Bayes risk must be concave down as a function of the prior P(?1) as shown in Fig. 2.4

Analysis of the bond market : ECON1239 - Principles of Finance - RMIT University - Prepare a report which sets out an analysis of the bond market in two selected countries using real-world.

System for social catchups for dancing event organiser : A system for social catchups focused for Dancing Event Organiser - Contrast the capabilities and limitations of client-side and server-side web code

Create the database using your existing sql skills : ITECH3108 - Dynamic Web Programming - Federation University - Dancing Event Organiser - Create the database using your existing SQL skills

Writing pros of the use of cellphones : Organize with the use of topic sentences that illustrate the main idea of each paragraph.Offering a brief explanation of the history or recent developments.

User Account

All Pages