Assessment using mapreduce for processing big data

Assignment Help Other Subject
Reference no: EM133160163

ICT303 Big Data - Crown Institute of Higher Education

Assessment - Using MapReduce for processing big data

LO 1: Design appropriate repository structure for storing big data.

LO 2: Design big data solutions using Map-reduce techniques.

Instructions

The following file is from Movielens dataset which shows user ratings for movies:

You can find more about this attached dataset in file

u.data is the full u data set with 100000 ratings by 943 users on 1682 items. Each user has rated at least 20 movies. Users and items are numbered consecutively from 1. The data is randomly ordered. This is a tab separated list of user id | item id | rating | timestamp. The time stamps are unix seconds since 1/1/1970 UTC. For example, the following line of the file

95 546 2 879196566

Is interpreted as follows: User 95 has rated movie 546, 2/5 (rates are in the range 1-5) at time 879196566 (Monday, November 10, 1997 9:16:06 PM, GMT).

Your task is to use MapReduce programming and find the following information for each movie: the average rating and the number of users who rated this movie. Here is an example of the output:

Movie ID

Average Rating

Number of Users Rated

340

3.78

298

499

4.02

532

You can choose the output format. However, the required information must be included in the output.

Hint: You can change the WordCount program such that it ignores all tokens in a line except the third one (rating value in the file exists in the third column).

The program must also print the name of group members on the screen.

Deliverable

You need to submit an MS Word or a PDF file which includes the following items:

- The source code for map and reduce function (copied/pasted into the MS Word or PDF file; no separate file is needed).

- Enough screenshots on the steps taken to get the program running.

- Screenshots for the output generated by the program. The name of group members must be also part of the printed information. Annotate all screenshots with brief descriptions (one line or two is enough).

- A section for discussion on your experience with MapReduce programming. To solve the given problem, what other tools and techniques are available? Compare MapReduce programming with the tools and techniques you mentioned. You can mention several factors like simplicity, scalability, reliability, etc.

Attachment:- MapReduce Programming.rar

Reference no: EM133160163

Questions Cloud

What ways does e-marketing differ from traditional marketing : In what ways does e-marketing differ from traditional marketing? Why are social networks becoming an increasingly important marketing tool?
Find a story from a mainstream news organization : Are you able to find a story from a mainstream news organization about ordinary life that involves people of color?
What values of x are within two standard deviations : Of the 10 DVDs, 9 are expected to last a minimum of 3 years. What values of x are within two standard deviations of the mean
Ksaos in recruitment and selection : What are the different selection methods that are relevant to KSAOs In recruitment and selection?
Assessment using mapreduce for processing big data : Assessment Using MapReduce for processing big data - Design appropriate repository structure for storing big data and Design big data solutions
Calculate bb current cash conversion cycle : Cost of goods sold was 57% of that total. Accounts receivable was $3,240,222, inventory was $842,020, Calculate BB's current cash conversion cycle
Redesign the job of the front desk receptionist : A large meat processing company calls you and says they need help developing ways to redesign the job of the front desk receptionist.
Discussing systemic discrimination : What are the key words you need to understand when discussing systemic discrimination?
Define the elements a leader needs : You are the proud Owner of Centennial's SLT Emporium, a specialty cold treats store. You are taking a course at College and have become aware that there are a v

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd