Reference no: EM132366962
Assignment: SDLC for human activity recognition project
Human activity recognition
This project aims to develop a human activity recognition IoT application to evaluate students’ knowledge in SDLC. This is an individual assignment. The project tasks shall be carried out individually.
Overview
The recognition of human activities has become a task of high interest for medical, military, and security applications. For instance, patients with diabetes, obesity, or heart disease are often required to follow a well-defined exercise routine as part of their treatments [5]. Therefore, recognizing activities such as walking, running, or cycling becomes quite useful to provide feedback to the caregiver about the patient’s behavior. Likewise, patients with dementia and other mental pathologies could be monitored to detect abnormal activities and thereby prevent undesirable consequences [6].
In such IoT applications, proper software engineering and data engineering are especially important to manage the software development life cycle and help make data useful for machine learning models. Many software engineers are primarily interested in aggregating raw data and making it into useful, ordered and structured data formats. A typical flowchart of sensor-based human activity recognition as shown in Figure 1.

This assignment involves the following subtasks:
1. Use Agile to manage this IoT application development (e.g., develop backlog, create sprint, and monitor the sprint progress). The backlog
and each sprint along with each week’s sprint progress burndown chart shall be recorded in the final submission document.
2. Based on the given workshop materials, create python code to load data and extract corresponding features from the given dataset.
3. Test and evaluate the two given machine learning models (KNN and SVM) and application in general and record the test results and
evaluation summary in the final submission document.
4. Refactor the source code according to the design pattern lecture and make the code easier to understand and extensible. The code shall
be managed by GitHub and will be reviewed for this along with GitHub version control history.
The sourcing data is from a public dataset (Dalia dataset [1], which contains 6 sensors’ data for 19 activities), refining that data and cleaning them up, and extracting significant features through statistical analysis for use in artificial intelligence and machine learning systems.
An example code is provided for reference. You may need to learn the use of Python libraries Numpy [2] and Pandas [3]. Machine learning modules using Scikit-learn [4] are given though having some understanding of them is recommended (we will only cover the basics of it to avoid course overlapping).