DATA MINING, Basic Statistics

Assignment Help:
please break this problem down to laymen term so that I understand how you arrived at the answer.

1. AllElectronics caries 1000 products, P1, … P1000. Consider customers Ada, Bob, and Cathy such that Ada and Bob purchase three products in common, P1, P2, and P3. For the other 997 products, Ada and Bob independently purchase seven of them randomly. Cathy purchases 10 products, randomly selected from the 1000 products. In Euclidean distance, what is the probability that dist(Ada, Bob) > dist(Ada, Cathy)? What if Jaccard similarity (Chapter 2) is used? What can you learn from this example? (Problem 11.2, Page 539-)

Book:
Data Mining: Concepts and Techniques, 3rd Edition
Problem 11.2, Page 539

Related Discussions:- DATA MINING

Simulation for structural equation modeling, I want to simulate observed va...

I want to simulate observed variables for structural equation modeling. In real data it is assumed that observed variables are not error free variables, so should i also simulate e

Survey statistics, #qa national poll of 1836 respondents indicated that 36%...

#qa national poll of 1836 respondents indicated that 36% support the NDP. Test whether this is sufficient evidence to show that the NDP support has increased since the election. Us

Interpolation and extrapolation, What is meant by interpolation and extrapo...

What is meant by interpolation and extrapolation. State the assumptions used for interpolation and extrapolation

What is a valuation account, What is a valuation account? In accounting...

What is a valuation account? In accounting, an assessment consideration is usually a stability sheet consideration that is used in combination with another stability sheet cons

Cost accounting , is direct costing variable, relevant or prime costing

is direct costing variable, relevant or prime costing

Quantitative techniques and decision making, Let the national income model ...

Let the national income model be: Y= c+1+G C=20+0.6y I=0.2y G=20 Where y= income, C= consumption, I= investment and G=government expenditure find y, C and I from the model. By quan

F2 test distributed populations, What is F2 Test, These tests were based o...

What is F2 Test, These tests were based on the assumption that the samples were drawn from normally distributed populations, or more accurately that the sample means were normally

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd