Reference no: EM131372443
Introduction to Data Mining Problems
Use R to devise a book recommendation system for the data uploaded to Blackboard. In particular, develop a system that can recommend up to three books for an arbitrary user that can be entered into R after sourcing your code. Develop such a system using both a:
(a) User-based collaborative filtering approach. Use Euclidean, Manhattan, correlational, and cosine similarity distance measures. What problems (if any) do you run into?
(b) Item-based collaborative filtering approach. Use an adjusted cosine similarity approach as discussed in class. How does this approach compare to the user-based approach?
To load the data into R you will need to use the read.csv function. (i.e. read.csv(filename,header=TRUE)). Please type in ?read.csv" to the R console to see the syntax if you would like further info regarding the function's syntax.
Make your programs functions, where the names of users, can be entered into the R prompt.
(c) What are some general problems with both approaches? Conceptually speaking, how can these issues be ameliorated?
What is probability that average will exceed given value
: Assume this is an average from a population with standard deviation of $1.5. If a random sample of 30 months is selected, what is the probability that its average will exceed $4.00?
|
Probability that a batch will be acceptable to the consumer
: What is the probability that a batch will be acceptable to the consumer? Is the probability large enough to be an acceptable level of performance?
|
Perform research utilizing resources
: To complete the current event activity, perform research utilizing resources such as the Internet, magazine publications, newspapers, and journals on a current event that illustrates the implementation of a new Information Systems technology.
|
Two key strategic decisions made by your current team
: Identify two key strategic decisions made by your current team, department, or organization. How could those decisions have been enhanced by optimization models? Support your rationale with evidence from readings or external research.
|
What problems if any do you run into
: DATS 6103: Introduction to Data Mining Problems. User-based collaborative filtering approach. Use Euclidean, Manhattan, correlational, and cosine similarity distance measures. What problems (if any) do you run into
|
What are the leadership qualities she possesses
: Referring to the Cheung Yan: China’s Paper Queen Case Study (Text, pp 675-683),How has Cheung Yan (Zhang Yin) seen such success as a strategic leader? What are the leadership qualities she possesses? do you contribute Cheung Yan’s success to characte..
|
Enterprise systems consist of several
: Enterprise systems consist of several carefully selected modules that are integrated together as a solution to ensure alignment and synchronisation of operational and business processes.
|
Evaluate business strategies for quality management
: HA540-1: Evaluate business strategies for quality management and continuous improvement of operations. Instructions: In this Assignment, you will be evaluating dimensions of quality in healthcare and how various industries can apply these concepts..
|
What is a confidence interval and why is it useful
: What is a confidence interval, and why is it useful? What is a confidence level?- Explain why in classical statistics describing a confidence interval in terms of probability makes no sense.
|