What is the type of the kinds of attributes

Assignment Help Basic Statistics
Reference no: EM133245287

1) What is the type of the following kinds of attributes (a) age (in years), (b) salary, (c) ZIP code, (e) height, and (f) intensity of rain? Classify them as continuous or discrete, and as qualitative (nominal or ordinal) or quantitative (interval or ratio).

2)An analyst sets up a sensor network in order to measure the temperature of different locations over a time period. What is the type of attributes collected (temperature)? What is the type of the dataset?

3) It is desired to partition customers into similar groups on the basis of their demographic profile.

a. What features could we use? Provide 3 examples. Would you describe such data as heterogeneous?

b. Which data mining problem is best suited to this task?

4)Suppose that you had a set of arbitrary objects, each representing different characteristics of gadgets. A domain expert gave you the similarity value between every pair of objects. How would you convert these objects into a multidimensional data set for clustering the gadgets ?

5)Suppose that you had a data set, such that each data point corresponds to sea-surface temperatures over a square mile of resolution 10×10. In other words, each data record contains a 10×10 grid of temperature values with spatial locations. You also have some text associated with each 10×10 grid. How would you convert this data into a multidimensional data set? How many features will each data point have?

6) Compute the cosine similarity, Jaccard coefficient (if possible, for binary vectors), Euclidean distance, correlation coefficient for the following vectors, x, y:

a. x = (0, -1, 1, 2,-2), y = (0, -2, 2, 4, -4)

b. x = (0, 1, 0, 0, 0), y = (0, 1, 0, 0, 1)

c. x = (-1, -1, -1, -1, -1), y = (1, 1, 1, 1, 1)

7) Compute the cosine similarity and the Jaccard coefficient, between the two sets {A, B, C} and {A, C, D, E}. Hint: how will you represent each set?

8) Create three documents, A, B, and C such that the Euclidean distance between A and B is smaller than the Euclidean distance between A and C, even though documents A and B have no common words whereas documents A and C have some common words.

9) Are the following similarity measures good or bad for finding similarity in document-term data? Provide a one-line justification for each answer you provide.

a. correlation

b. cosine

c. Euclidean

Reference no: EM133245287

Questions Cloud

What proportion of scores are higher than : Which z-score has approximately 20% of scores in the tail of the distribution?
What is the impact of globalization on the transmission : What are some of the emerging issues in national and international health inequities, and how are health systems attempting to address them?
Identifying ways to improve fuel efficiency : You have been hired to conduct business research for the purpose of identifying ways to improve fuel efficiency without disturbing consumer preference.
Identify the expected stool consistency for ostomies : Stool consistency ranges from liquid to formed, depending on the location of the ostomy. Identify the expected stool consistency for ostomies.
What is the type of the kinds of attributes : 1) What is the type of the following kinds of attributes (a) age (in years), (b) salary, (c) ZIP code, (e) height, and (f) intensity of rain? Classify them as c
What is the value of the test statistic : In order to know whether there is a significant difference between the average yearly incomes of marketing managers in the East and West of the United States, t
True proportion of orange candies : For Mr. p's birthday, Mr. l bought Mr. p a huge Reese's candy machine filled with Reese's Pieces. Mr. l promised Mr. p that 40% of the candies in the machine we
Relationships between variables or differences between group : Think of some challenges you have faced in your current or previous employment. Summarize the problem, develop a research question, and state the null and alter
Mean body weight of a population : 50 years ago, the mean body weight of a population of penguins was 23 kg. Researchers are concerned that the mean body weight is decreasing, so they took a rand

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd