Demonstrate your pca analysis on the continuous features

Assignment Help Other Subject
Reference no: EM133849901

Principal Component Analysis

TASK

For your video presentation, you must demonstrate your PCA analysis on the continuous features of the WACY-COM dataset and interpret the results. Submit the recording via the Panopto link on Canvas. Please ensure you follow the instructions carefully.

Perform PCA and Visualise Data
First, copy the code below to a R script. Enter your student ID into the command set.seed(.) and run the whole code. The code will create a sub-sample of 400 that is unique to you.

Extract only the continuous features and the APT feature from the WACY-COM dataset and store them as a data frame/tibble. Refer to Assignment 1 for the feature description if needed. Get top assignment help at pocket friendly prices!

Clean the extracted data based on the feedback received from Assignment 1.

Remove the incomplete cases to make it usable in "R" for PCA.

Perform PCA using prcomp(.) in R, but only on the numeric features (i.e. ignore APT in this step).

Explain why you believe the data should or should not be scaled, i.e. standardised, when performing PCA.

Display and describe the individual and cumulative proportions of variance (3 decimal places) explained by each of the principal components.
Outline how many principal components are adequate to explain at least 50% of the variability in your data.

Display and interpret the coefficients (or loadings) to 3 decimal places for PC1, PC2 and PC3. Describe which features (based on the loadings) are the key drivers for each of these three principal components.

Create and display the biplot for PC1 vs. PC2 to visualise the PCA results in the first two dimensions. Colour-code the points based on the APT feature. Explain the biplot by commenting on the PCA plot and the loadings plot individually, and then both plots combined (see Slides 28-29 of Module 3 notes). Finally, comment on and justify which (if any) features can help distinguish APT activity.

Based on the results from parts (v) and (vi), describe whether PC1 or PC2 (choose one) best assists in classifying APT. Hint: Project all points in the PCA plot onto the PC1 axis (i.e. consider the PC1 scores only) and assess whether there is a clear separation between known and unknown APT actors. Then, project onto the PC2 axis (i.e. consider the PC2 scores only) and evaluate whether the separation is better than in PC1. You can access the PCA scores for PC1 and PC2 via mypca$x, assuming mypca contains your PCA results from prcomp(.).
the key features in this dimension that can drive this process (Hint: based on your decision above, examine the loadings from part (v) of your chosen PC and choose those whose absolute loading (i.e. disregard the sign) is greater than 0.3).

Video Presentation Checklist
In your video presentation, you must
Run your code corresponding to parts (i) to (vii) above
Display the relevant output
Interpret the output

Your video presentation must include a camera shot of yourself in the video capture, unless there is an exceptional reason and is supported by a Learning Assessment Plan (LAP). 20% is automatically deducted from your final mark if this is not included in your video presentation. If you choose to record with another application, you must make sure that this feature is included.

Your video presentation must be between 4-5 minutes long.

Reference no: EM133849901

Questions Cloud

Summarize leadership style : Your definition of team and what makes a team function successfully. For applicants of Director level positions (or above) summarize your leadership style.
What is the best immediate action : As Geno's condition progresses, he exhibits difficulty breathing. What is the best immediate action?
Determine order of prioritization of the nursing diagnoses : How did you determine the order of prioritization of the nursing diagnoses? Which interventions can you delegate to an unlicensed person?
How are students with the disability identified : How are students with the disability identified? What and how are students with this disability taught? List citations and references.
Demonstrate your pca analysis on the continuous features : Principal Component Analysis - demonstrate your PCA analysis on the continuous features of the WACY-COM dataset and interpret the results.
Explain two ways in which materials handling systems : Identify, define and explain two ways in which materials handling systems impact the efficiency and effectiveness of the warehouse?
Non-verbal communication : Give two examples of non-verbal communication and two examples of verbal communication. Give an example of a communication barrier.
How does program address racial and economic disparities : How does your program address racial and economic disparities that are impacting your target populations' healthcare needs?
Which statement defines integrated product support element : Which statement defines the integrated product support element, support equipment?

Reviews

len3849901

4/3/2025 2:10:26 AM

this is the assignment 2. extended work from assignment 1. need to do it and make a video also. possible to get it with video also ? i will replicate the video later but must need to have with video but make sure i receive 1 recorded video with report

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd