DATA MINING, Basic Statistics

please break this problem down to laymen term so that I understand how you arrived at the answer.

1. AllElectronics caries 1000 products, P1, … P1000. Consider customers Ada, Bob, and Cathy such that Ada and Bob purchase three products in common, P1, P2, and P3. For the other 997 products, Ada and Bob independently purchase seven of them randomly. Cathy purchases 10 products, randomly selected from the 1000 products. In Euclidean distance, what is the probability that dist(Ada, Bob) > dist(Ada, Cathy)? What if Jaccard similarity (Chapter 2) is used? What can you learn from this example? (Problem 11.2, Page 539-)

Book:
Data Mining: Concepts and Techniques, 3rd Edition
Problem 11.2, Page 539
Posted Date: 2/19/2013 3:51:34 PM | Location : United States







Related Discussions:- DATA MINING, Assignment Help, Ask Question on DATA MINING, Get Answer, Expert's Help, DATA MINING Discussions

Write discussion on DATA MINING
Your posts are moderated
Related Questions
Q.2. (ii) A company making lamps has drawn a sample from its production line and measured the light output from each. The results, in microamps, are as follows: 9.1 9.8 9.5

I would just like the formula to find the y-axis of a centroid.

A car moves with constant velocity along a straight road. Its position is x1 = 0m at t1 = 0 seconds and is x2 = 56 m at t2 = 5.0 s. what is the cars position at t=2.5 seconds and

Modern hotels and certain establishments make use of an electronic door lock system. To open a door an electronic card is inserted into a slot. A green light indicates that the doo

any of your writer able to use the database given to generate the null, alternative..etc.. into a power point presentation

what is the electric field at point x if q=7*10^-6 C d=6m

Armitage-Dollmodel The model of carcinogenesis in which the basic idea is that the essential variable determining the change in the risk is not age, but time period. The model

importance of time series analysis?

identify a research report published by reputable agencies and evaluate the following ;the problem that was addressed

Which of the following does Utts consider a disaster in sampling?sk question #Minimum 100 words accepted#