Define about build a neural network

Assignment Help MATLAB Programming
Reference no: EM13855349

Plagiarism is the submission of somebody else's work in a manner that gives the impression that the work is your own. The Department of Computer Science and Information Technology at La Trobe University treats plagiarism very seriously. When it is detected, penalties are strictly imposed.

1. In this question, we are going to build a neural network (NN) classifier to predict red wine quality (represented by an integer ranging from 0 to 10, higher means better) using a set of chemical properties. These properties are presented as attributes below:

fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, density, pH, sulphates, alcohol

The last attribute quality is the class label.
The dataset needs to be split into training and testing datasets. Download the program "DataSplit2.exe" and execute it. Enter your student ID, specify the locations of the red wine dataset file, and the destination folder.

The dataset will be split for you by clicking the "OK" button. Note that your training and testing datasets are unique to others. Make sure the student ID is entered correctly. You are required to submit both training and testing datasets generated, or no marks will be given to the answer of this question otherwise.

a. Load both datasets into the MATLAB workspace. It is recommended to separate the class label (i.e. the attribute quality) from other attributes such that all the class labels of a dataset are stored in a matrix. As a result, there are four matrices after the import process, two for the attribute values from both datasets, and the other two for the class labels.

The class labels require encoding before they can be used for training and evaluating the NN classifier. Since there are 11 distinct class values (0 - 10), each class label is encoded into a column vector of 11 × 1. For a class value k, the k + 1 th row of the column vector is set to 1, while the others are zero. For example, if the class label is 4, then it is encoded into a column vector:
0
F 0 1
I 0 I
I 0 I
I 1 I
I 0 I
I 0 I
I 0 I
I 0 I
I 0 I
L 0 I

Therefore, if the dataset has N samples, then the class labels are encoded into an 11 × N matrix.
Implement this encoding as a MATLAB function. The function source codes are submitted as a MATLAB function file. (.m file). In your written answer, specify clearly what input argument(s) is/are expected, and the expected return from this function. (2 marks)

b. The NN classifier is created using the following parameters: Number of hidden layers: 1
Number of neurons: 10

Use default settings for other parameters. Train the classifier using the training dataset. Show the training performance by pasting the performance curve in your answer. Submit your MATLAB script file for this training.
Hint: Check carefully the dimension arrangement of the NN classifier, i.e. whether it considers a row or a column as a tuple.

c. Use the NN classifier to predict the qualities of the samples in the testing dataset. Obtain and show the confusion matrix. What is the accuracy of the classifier? Submit your MATLAB script file for this testing and evaluation.

Please submit your MATLAB source codes for parts (a) - (c) in separate MATLAB function/script files. No marks will be given to your answer unless the relevant source codes are submitted. Remember to submit the training and testing datasets as well.

2. We are going to mine some association rules from the supermarket transactions using WEKA.

Download the program "TransactionDataGenerator.exe" and execute it. Enter your student ID and specify the location of destination folder. The dataset will be generated for you by clicking the "OK" button. A transaction file will then be generated in CSV format. Each line row represents a single transaction, the first item is the transaction ID and the others are the goods bought by the customer. You are required to submit the generated transaction dataset, or no marks will be given to the answer of this question otherwise.

a. The transaction file generated must be converted to an attribute format (see appendix) that can be imported to WEKA. For example, a transaction file consists of five transactions as follows:

T001, jam
T002, bread, jam T003, bread, butter T004, jam
T005, bread

The converted format is shown below:

t_id

bread

butter

jam

T001

 

 

t

T002

t

 

t

T003

t

t

 

T004

 

 

t

T005

t

 

 

The converted transactions can be saved in CSV format. The content of the above converted format in CSV is like this:

tid,bread,butter,jam T001,,,t
T002,t,,t
T003,t,t,
T004,,,t
T005,t,,

Write a MATLAB conversion program for this task. Submit your MATLAB script file for this conversion, or no marks will be given to this part otherwise. The list of all items is available at the Appendix.

Hints:
i. Since the transactions consist of different number of items, it is recommended to read the whole transaction as a string, i.e. all the N transactions are put in an N × 1 cell array. You may find functions such as textscan or importdata useful.

ii. Following (i), it is then necessary to separate the transactions Id and every item in a single transaction. The delimiter is a comma (","). You may find the regular expression function regexp useful.

iii. A transaction schema (i.e. all possible transaction items in the header line of the above converted format) is needed. You transactions might not cover all the items, but this does not affect the final results.

iv. The transaction schema should be implemented as an array in your source codes. Also, the item order in the array should be identical to the item order in the header line. This helps determining which column to put a ‘t' label for a transaction. You may find the function ismember useful.

b. Mine the association rules from the transactions using WEKA. Specify which algorithm you select and the related parameters such as minimum support and confidence. List the best 10 rules discovered with highest possible support and confidence.

c. Suggest a potential problem you might have when inspecting the association rules mining results.

3. A training dataset is provided as follows:

Weather outlook

Temperature

Wind

Sports

Sunny

20

Strong

Outdoor

Cloudy

7

Weak

Indoor

Cloudy

15

Mild

Outdoor

Sunny

33

Mild

Outdoor

Rainy

10

Mild

Indoor

Cloudy

27

Weak

Outdoor

Rainy

15

Strong

Indoor

Sunny

9

Mild

Outdoor

Sunny

30

Strong

Indoor

Rainy

25

Weak

Outdoor

The class label is sports. Predict the class labels (i.e. play indoor sports or outdoor sports) for the following 4 tuples (a - d) using Naïve Bayesian classification. Show your calculations.

 

Weather outlook

Temperature

Wind

a

Sunny

32

Strong

b

Rainy

28

Mild

c

Cloudy

10

Weak

d

Sunny

16

Mild

1. a. Describe minimum spanning tree (MST) in hierarchical clustering and illustrate its construction using at least five unique 2D data points (e.g. (2, 1), (3, 3), etc.).

b. Suggest a way to generate MST from a set of data points without using the MST building algorithm in the lecture notes. Explain why it is so

(Hint: An alternative way has been covered in the lecture notes)


Attachment:- New WinRAR archive.rar

Reference no: EM13855349

Questions Cloud

Is gdp the best measure of growth? : Is GDP the best measure of growth?
Baseball complex construction project : Baseball Complex Construction Project-The final project for this course is the creation of a complete Project Plan, consisting of a project topic, project charter, seven subsidiary project management plans, and a Final Report
Identify the minimum risk portfolio : Explain in your own words why the risk of a portfolio is often measured by the standard deviation of past returns on that portfolio. Based on holding periods during this time period for up to 10 years, are stocks ever less risky than bonds are bills
Descriptions of and corporate social responsibility issues : Full descriptions of any ethical and corporate social responsibility issues that you see, to include an identification of stakeholders and how they are implicated by the facts in this problem
Define about build a neural network : build a neural network
Evidence of a faulty planning process : 1. In this case do you see evidence of a faulty planning process? Explain why or why not.
Proposal for a dissertation in a management related field : Demonstrate knowledge of relevant literature by identifying key debates: concepts and theories within the topic area and how the literature will be utilised to inform your research.
Is derivatives program at link technologies reducing risk : Is the derivatives program at Link Technologies reducing risk? Are Ms. Cohen's arguments correct, or is the program performing as expected?
How quickly climate change occurs and the locations : After reading "Global Trends 2025: A Transformed World" choose one "Relative Certainty" and one "Key Uncertainty" and express your opinion on the topic and whether or not you agree. How quickly climate change occurs and the locations where its impa..

Reviews

Write a Review

MATLAB Programming Questions & Answers

  Fingerprint watermarking techniques

Need to investigate the best method to embed watermark image into fingerprint image.

  Find the corresponding equilibrium value for the air speed

Find the corresponding equilibrium value for the airspeed. There are several solutions for the airspeed, we take the higher value and Find the two operating points in terms of Pm and other variables - Find the linearized system near each of the oper..

  Coefficients of the least squares fit

Write a Sa/Mkrus function that takes as its input the data and plots the data points and the least squares linear fit to the data. The function should also return the coefficients of the least squares fit.

  Generate a sphere of diameter 3. create 3 vectors

Generate a sphere of diameter 3. Create 3 vectors representing the translation of this sphere along the x, y, and z axes. Generate the correct vectors given the description below: The sphere should be translated to (-10, -10, -10).

  Write a program that examines student in basic arithmetic

Write a program that examines the student in the basic arithmetic operations (summation, subtraction and multiplications).

  Recognition of colour

MATLAB for recognition of colour randomly using webcam realtime for RGB without external trigger in GUI.

  Image segmentation by matlab hi therewhat i need in this

hi there ltbrgt ltbrgtwhat i need in this order is that quotimage segmentationquot. choose any two obvious photos and

  Program the rank order cluster algorithm with a matrix

Program the rank order cluster algorithm with a matrix with x rows and x columns in any plataform (that the person inputs). I prefer visual studio or visual basic

  Use regression algorithms

The proposal which are two pages and here is the demands - Use Regression Algorithms or any type to achieve the target In Data Mining matter dealing with E-Learning Students' Data.

  Matlab function to perform gauss-jordan elimination

Write a matlab function to perform gauss-jordan elimination with pivoting. Modify the pivoting so that it is using the row with the highest absolute value rather than the first non-zero row.

  Consider the predator-prey models

Consider the predator-prey models developed early part of the 20 th  century in which the number of predators and preys may be predicted using the pair of ODEs

  Design a controller which regulates flow

Design a controller which regulates flow and compensate pressure to my desire value in simscape.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd