DATA MINING, Basic Statistics

Assignment Help:
please break this problem down to laymen term so that I understand how you arrived at the answer.

1. AllElectronics caries 1000 products, P1, … P1000. Consider customers Ada, Bob, and Cathy such that Ada and Bob purchase three products in common, P1, P2, and P3. For the other 997 products, Ada and Bob independently purchase seven of them randomly. Cathy purchases 10 products, randomly selected from the 1000 products. In Euclidean distance, what is the probability that dist(Ada, Bob) > dist(Ada, Cathy)? What if Jaccard similarity (Chapter 2) is used? What can you learn from this example? (Problem 11.2, Page 539-)

Book:
Data Mining: Concepts and Techniques, 3rd Edition
Problem 11.2, Page 539

Related Discussions:- DATA MINING

Discuss single control chart , Question 1 A courier company conducted ...

Question 1 A courier company conducted a brainstorming session amongst drivers to ascertain the reasons why it was unable to deliver items to households, always right first ti

Determine the simple linear regression model, You have collected data from ...

You have collected data from the factory on a critical to quality attribute. The attached Excel spreadsheet lists the response, Y and four potential predictors. You would like to m

Calculate the transition probabilities, Consider a person who repeatedly pl...

Consider a person who repeatedly plays a game of chance (gambling)with two results possible (win or lose) with a probability p = 0, 3 to win. If the person has bet x amount and if

Derive the pure strategy nash equilibrium, There are two firms competing in...

There are two firms competing in quantity. Firm 1 and 2 set their quantities supplied, q1 and q2, respectively. The production costs are zero. The market price is given by

DIFFARENCE , Differentiate between Historigrams and Histogram

Differentiate between Historigrams and Histogram

Estimate the parameters of the normal distribution, The sizes of 15 Califor...

The sizes of 15 California earthquakes are given below. 6.8   6.6  7.5  6.2  6.5  7.1  8.3  5.9  6.1  6.9  7.0  6.2  5.9  6.3  7.3 (a)  Assuming normal distribution for the s

Agglomerative hierarchical clustering methods/procedures, Agglomerative hie...

Agglomerative hierarchical clustering methods/procedures Methods of cluster analysis that start with each individual in the separate cluster and then, in the series of steps, c

Historigrams, Difference between historigram and histogram

Difference between historigram and histogram

.two functions of accounting, Two functions of Accounting. Accounting Pu...

Two functions of Accounting. Accounting Purchase only:  In the guides of Records only a transaction which is relevant to some cash value can be registered.  Posting: The Expl

Depression screening measures , A study of a new anti-depressant drug took ...

A study of a new anti-depressant drug took a sample of 10 individuals with high depression screening measures (DSM) and gave them the drug for three months.  At the end of the thre

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd