DATA MINING, Basic Statistics

Assignment Help:
please break this problem down to laymen term so that I understand how you arrived at the answer.

1. AllElectronics caries 1000 products, P1, … P1000. Consider customers Ada, Bob, and Cathy such that Ada and Bob purchase three products in common, P1, P2, and P3. For the other 997 products, Ada and Bob independently purchase seven of them randomly. Cathy purchases 10 products, randomly selected from the 1000 products. In Euclidean distance, what is the probability that dist(Ada, Bob) > dist(Ada, Cathy)? What if Jaccard similarity (Chapter 2) is used? What can you learn from this example? (Problem 11.2, Page 539-)

Book:
Data Mining: Concepts and Techniques, 3rd Edition
Problem 11.2, Page 539

Related Discussions:- DATA MINING

Relevant costs to decision-making , Foster Company makes 20,000 units per y...

Foster Company makes 20,000 units per year of a part it uses in the products it manufactures. The unit product cost of this part is computed as follows: Direct materials $24.70 Dir

Schedules, what are the characteristics of schedules

what are the characteristics of schedules

Accrue definition in accounts, To history income and expenditures/expenses ...

To history income and expenditures/expenses when they connect with the identification requirements of the finance form engaged regardless of when the money action happens.

Explain hypothesis test, Oprah Winfrey endorsed Barack Obama in the Democra...

Oprah Winfrey endorsed Barack Obama in the Democratic presidential primary. Does Oprah's endorsement make a difference? If so, we would expect support for Obama to be higher among

Bond premium cycle, Bond premium cycle The excess of the price for which a ...

Bond premium cycle The excess of the price for which a connection is acquired or sold over its face value resulting from a disparity connecting the market rate of interest and the

Full and restricted models - testing research hypotheses, 1. Suppose you ar...

1. Suppose you are given a dataset that consists of a random sample of tasters, on which the following variables were obtained: (y) Zpref = taste preference for green beans stor

Population, Explain what is meant by population and sample

Explain what is meant by population and sample

Reply plz, order tomatoes in crates of 25 kg and he is able to stock a maxi...

order tomatoes in crates of 25 kg and he is able to stock a maximum of four crates or 100 kg of tomatoes. His experience taught him that the daily demand ranges from 0 to100 kg of

Histograms, The skewness is a measure of asymmetry and it is positive at 0....

The skewness is a measure of asymmetry and it is positive at 0.15 meaning that it is greater than zero which reveals that the tail extends to the right slightly indicating the dist

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd