Design algorithmic models for the application

Assignment Help Other Engineering
Reference no: EM133707674 , Length: word count:2000

Machine Learning Applications

Assessment - Design a Text Retrieval System

Coding and Presentation

Your Task

Design a text retrieval system to find similar movies/shows based on the descriptions.

Assessment Description

We humans communicate using different languages, either by speaking or writing. Text data is abundant in the real world. It's a challenging task to work with natural languages. Your team lead has assigned you one such task of recommending movies based on the movie description.

Data

A movies/shows dataset with description is curated by pre-processing the Kaggle IMDb Movies/Shows with Descriptions dataset and is provided to you in MyKBS. You are encouraged to explore the original source.

The original dataset is pre-processed and is provided in 2 files - train.csv and test.csv. MyKBS provides you these files each containing following columns:

title: Title of the movie/show.
description: Description of the movie/show.

You are required to train a text retrieval system using the train.csv file. And test the system using the test.csv file.

Problem Statement

As an individual, you are required to download the data sets, i.e., train.csv and test.csv files from MyKBS. You must build a text retrieval system to find similar movies/shows based on the descriptions. You should systematically approach the problem by addressing the below tasks:

Load the data sets and pre-process them to fit your requirements. You must use at least two pre-processing techniques. (5 marks)

Design a text retrieval system using TF-IDF (with inverted file) algorithm. (10 marks)

Find the top 3 movies/shows matches in the train.csv based on the descriptions provided in the test.csv. (5 marks)

You are to record a 5-minute video accompanying PowerPoint slides to elaborate the approach and performance of the system using relevant metric(s). In recording this video, you will need to prepare accompanying PowerPoint slides thar are clear, concise, of the required quality and references in accordance with the Kaplan Harvard Referencing style. (20 marks)

Learning Objective 1: Explore programming functions to source, store and prepare data for machine learning applications.

Learning Objective 2: Design algorithmic models for the application of machine learning in information technology.

Learning Objective 3: Create advanced insights of strategic organisational value with the aid of machine learning.

You are required to follow the below guidelines:

You should write your Text Retrieval System code using Python 3 programming language.

The use of any Python third-party package(s) is restricted to the following tasks:
Loading the datasets. E.g., Pandas.
Any necessary text pre-processing steps. E.g., Natural Language Toolkit, etc.
Performing necessary calculations during the building of the system. E.g., NumPy.
Calculating the performance of the system. E.g., Scikit Learn, Matplotlib, Plotly, etc.

Reference no: EM133707674

Questions Cloud

Designing comprehensive and ethical framework : Designing a comprehensive and ethical framework for end-of-life care is undoubtedly difficult.
What contributed to the success of allies in world war ii : What contributed to the success of the allies in World War II?
How has immigration shaped the american story : Since 1877, how has immigration shaped the American story?
Victim of explosion is transported : A victim of an explosion is transported to the emergency department for evaluation and treatment.
Design algorithmic models for the application : Machine Learning Applications - Explore programming functions to source, store and prepare data for machine learning applications
What evidence expect to find from sexual assault examination : Write a paper detailing what evidence you might expect to find from the sexual assault examination and other aspects of the examination of the sexual assault vi
Discuss the crisis you experienced : Discuss the crisis you experienced (i.e., victim of a hostage situation). You should also use outside research as applicable.
How much has changed in american life in past half-century : What do these documents suggest about how much has changed in American life in the past half-century and how much has not changed?
Traumatic Brain Injury Model Systems : Does this make sense I will use the Traumatic Brain Injury Model Systems (TBIMS) National Database, the largest national longitudinal TBI database,

Reviews

Write a Review

Other Engineering Questions & Answers

  Characterization technology for nanomaterials

Calculate the reciprocal lattice of the body-centred cubic and Show that the reciprocal of the face-centred cubic (fcc) structure is itself a bcc structure.

  Calculate the gasoline savings

How much gasoline do vehicles with the following fuel efficiencies consume in one year? Calculate the gasoline savings, in gallons per year, created by the following two options. Show all your work, and draw boxes around your answers.

  Design and modelling of adsorption chromatography

Design and modelling of adsorption chromatography based on isotherm data

  Application of mechatronics engineering

Write an essay on Application of Mechatronics Engineering

  Growth chracteristics of the organism

To examine the relationship between fermenter design and operating conditions, oxygen transfer capability and microbial growth.

  Block diagram, system performance and responses

Questions based on Block Diagram, System Performance and Responses.

  Explain the difference in a technical performance measure

good understanding of Mil-Std-499 and Mil-Std-499A

  Electrode impedances

How did this procedure affect the signal observed from the electrode and the electrode impedances?

  Write a report on environmental companies

Write a report on environmental companies

  Scanning electron microscopy

Prepare a schematic diagram below of the major parts of the SEM

  Design a pumping and piping system

creating the pumping and piping system to supply cool water to the condenser

  A repulsive potential energy should be a positive one

Using the data provided on the webvista site in the file marked vdw.txt, try to develop a mathematical equation for the vdW potential we discussed in class, U(x), that best fits the data

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd