What problems if any do you run into

Assignment Help Computer Engineering
Reference no: EM131372443

Introduction to Data Mining Problems

Use R to devise a book recommendation system for the data uploaded to Blackboard.  In particular, develop a system that can recommend up to three books for an arbitrary user that can be entered into R after sourcing your code.  Develop such a system using both a:

(a) User-based collaborative filtering approach. Use Euclidean, Manhattan, correlational, and cosine similarity distance measures. What problems (if any) do you run into?

(b) Item-based collaborative filtering approach. Use an adjusted cosine similarity approach as discussed in class. How does this approach compare to the user-based approach?

To load the data into R you will need to use the read.csv function.  (i.e. read.csv(filename,header=TRUE)).  Please type in ?read.csv" to the R console to see the syntax if you would like further info regarding the function's syntax.  

Make your programs functions, where the names of users, can be entered into the R prompt.  

(c) What are some general problems with both approaches? Conceptually speaking, how can these issues be ameliorated?

Reference no: EM131372443

Questions Cloud

What is probability that average will exceed given value : Assume this is an average from a population with standard deviation of $1.5. If a random sample of 30 months is selected, what is the probability that its average will exceed $4.00?
Probability that a batch will be acceptable to the consumer : What is the probability that a batch will be acceptable to the consumer? Is the probability large enough to be an acceptable level of performance?
Perform research utilizing resources : To complete the current event activity, perform research utilizing resources such as the Internet, magazine publications, newspapers, and journals on a current event that illustrates the implementation of a new Information Systems technology.
Two key strategic decisions made by your current team : Identify two key strategic decisions made by your current team, department, or organization. How could those decisions have been enhanced by optimization models? Support your rationale with evidence from readings or external research.
What problems if any do you run into : DATS 6103: Introduction to Data Mining Problems. User-based collaborative filtering approach. Use Euclidean, Manhattan, correlational, and cosine similarity distance measures. What problems (if any) do you run into
What are the leadership qualities she possesses : Referring to the Cheung Yan: China’s Paper Queen Case Study (Text, pp 675-683),How has Cheung Yan (Zhang Yin) seen such success as a strategic leader? What are the leadership qualities she possesses? do you contribute Cheung Yan’s success to characte..
Enterprise systems consist of several : Enterprise systems consist of several carefully selected modules that are integrated together as a solution to ensure alignment and synchronisation of operational and business processes.
Evaluate business strategies for quality management : HA540-1: Evaluate business strategies for quality management and continuous improvement of operations. Instructions: In this Assignment, you will be evaluating dimensions of quality in healthcare and how various industries can apply these concepts..
What is a confidence interval and why is it useful : What is a confidence interval, and why is it useful? What is a confidence level?- Explain why in classical statistics describing a confidence interval in terms of probability makes no sense.

Reviews

len1372443

1/27/2017 1:06:22 AM

There is some flexibility with respect to how you construct the details of your recommendation system beyond your nearest neighbor algorithm. For example, you may use more than one nearest neighbor to make your algorithm better and you can weight the distances appropriately as discussed in class. Please feel free to discuss what your code is doing in a Word document or PDF and submit that along with your assignment. This will make it easier for the grader to understand the logic behind your algorithm. Make sure your program ignores zero values for the purposes of computing distances. Otherwise your recommendation system will be influenced by unrated books. Use an estimated rating of above 5 as a threshold for the recommendation system. If your model cannot provide any recommendations for a particular individual, then please have it say so. You can discuss this in (c).

Write a Review

Computer Engineering Questions & Answers

  Evaluate the benefits of cloud computing for legacy and

cloud computing please respond to the followingbullevaluate the benefits of cloud computing for modern and legacy

  Find and code the simplified logic function

Using the sum-of-products expression, find and code the simplified logic function for Table 1 using the assembly logic instructions. Show your work in the discussion section of your report

  Write a class called "deck" with the traits

Write a class called "Card" with the following traits: it holds 2 private variables, a suit and a value (ace is high). It has public functions that randomly generate and build the suit, value and print out the suit/value to the screen. Write a suit..

  Create and demonstrate an opengl based gui framework

5CC509: Graphics - An OpenGL based GUI framework. One of the core requirements for building this new framework is to use OpenGL for platform independent rendering, so that a tighter UI integration with their 3D applications can be achieved. In this..

  Calculate the total number of bits required

Calculate the total number of bits required for the cache listed in the table, assuming a 32-bit address. Given that total size, find the total size of the closest direct-mapped cache

  Three chapters o each chapter consists of a title text -

create a program that prints a book. the book consists of the following componentsbull a title and authorbull three

  Why is it stated that electronic commerce is a catalyst of

why is it said that electronic commerce is a catalyst of fundamental changes in organizations? search the ashford

  Why would a firm choose one of these approaches

I am having difficulty with the following questions: Differentiate between a two-tier client/server system and a three-tier client/server system. Why would a firm select one of these approaches over the other when implementing a client/server syst..

  Design combinational circuit that add four-bit binary number

Design a combinational circuit that adds one to a 4-bit binary number, A3A3A,Ao. For ex-ample, if the input of the circuit is 43442A, Ao = 1101, the output is 1110. The circuit can be designed using four half-adders.

  Which mna can provide multiple valid solutions

Under what circumstances is MNA incapable of finding a valid solution? Are there any nets for which MNA can provide multiple valid solutions? Provide example netlists to support your answers

  Generally the overall report identifies or investigates

generally the overall report identifies or investigates issues as well as problems of itc government in context of poor

  Boolean expressions using four-variable maps

Simplify the subsequent Boolean expressions using four-variable maps - simplify the Boolean expression using the four-variable maps

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd