DATA MINING, Basic Statistics

Assignment Help:
please break this problem down to laymen term so that I understand how you arrived at the answer.

1. AllElectronics caries 1000 products, P1, … P1000. Consider customers Ada, Bob, and Cathy such that Ada and Bob purchase three products in common, P1, P2, and P3. For the other 997 products, Ada and Bob independently purchase seven of them randomly. Cathy purchases 10 products, randomly selected from the 1000 products. In Euclidean distance, what is the probability that dist(Ada, Bob) > dist(Ada, Cathy)? What if Jaccard similarity (Chapter 2) is used? What can you learn from this example? (Problem 11.2, Page 539-)

Book:
Data Mining: Concepts and Techniques, 3rd Edition
Problem 11.2, Page 539

Related Discussions:- DATA MINING

Spatial ability test and musical ability test, This question has multiple p...

This question has multiple parts. Use the following information to answer the questions below: Spatial Ability Test Musical Ability Test Mean 80 74 Standard Deviation 6

Population, Explain what is meant by population and sample

Explain what is meant by population and sample

The impact of the shift to ifrs, How do you think the banking sector is goi...

How do you think the banking sector is going to get impacted?   Manager 1: The impact of the shift to IFRS is going to be much higher in the banking sector. This impact woul

Standard error of the differnce between means., IN two separate studies, th...

IN two separate studies, the actual difference between the means of a treated group and untreated group is 3 points. However, in one study sm1-m2 is very large and so the 3 points

Marginal and absorption costing, the guinegog is a trader in portable cd-ma...

the guinegog is a trader in portable cd-man. His budgeted output is 5000 units per quarter. The following data was available for the year 1998: Direct labour @ $6 Direct material @

Tchebycheffs theorem, Tchebycheffs theorem As we all know standard deviati...

Tchebycheffs theorem As we all know standard deviation is a most widely used measure of variation. It has certain mathematical properties that facilitate development of statistica

INDEX NUMBER, FUNCTION SIMPLE AGGREGATIVE, SIMPLE RELATIVE

FUNCTION SIMPLE AGGREGATIVE, SIMPLE RELATIVE

Current development project and presentation , review the financial disclos...

review the financial disclosures for two publicly traded companies. Identify recently promulgated (or proposed) accounting pronouncements that have an impact on the companies. In a

Statistical analysis of hedge funds returns, In this problem set we are goi...

In this problem set we are going to analyze returns of indices for three hedge funds strategies (market neutral, risky arbitrage, long/short). The indices are constructed by CSFB/T

Statistic, difference between histogram and historigram

difference between histogram and historigram

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd