Exploring the potential of natural language processing

Assignment Help Python Programming
Reference no: EM131051755

For Reading Purposes.

EXPLORING THE POTENTIAL OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING IN CHILD LANGUAGE DISORDERS DIAGNOSIS

I. Background

Language development is integrally a critical aspect of neurodevelopment in early childhood, in particular for the cognitive and social development. Language disorder has been observed as one of the major developmental conditions amongst young children. Early detection of language disorders in children is of paramount importance in order to provide efficacious treatments and to prevent further deterioration. The goal of this project is to explore the potential of natural language processing and machine learning techniques in developing a computational model that could be used for the early diagnosis of language disorders in children.

As part of the assessment of language development or the diagnosis of language disorders amongst young children, analysis of language samples in the form of transcribed narratives is essential in uncovering various linguistic phenomena manifested, ranging from morphology to syntax to semantics. These could serve as indicative markers for some form of language disorder. For instance, vocabulary richness (such as the mean length of utterance and the total number of different words used) and grammatical mistakes (such as omission of tense marking and disagreement in subject-verb or determiner-noun) were found to be useful measures (Solorio, 2013).

II. Problem Statement

It is hypothesised that various natural language processing (NLP) techniques could potentially be applicable. Through the training of a machine learning algorithm with indicative linguistic patterns mentioned earlier, a computational model could be deployed as a screening measure for diagnostic assessment of child language disorders. For the purpose of this project, the specific form of language disorders that we are exploring here is known as Specific Language Impairment (SLI) - which has been one of the widely studied neurodevelopmental conditions amongst young children (Leonard, 1991).

III. Main Objectives

The project aims at exploring both NLP and machine learning techniques to discover indicative linguistic features that can potentially be useful in identifying children with language disorders. Amongst the key objectives to be attained include:

Are children with language disorders identifiable as compared to children with normal language development through text-based classification?

Of which the linguistic patterns, ranging from morphology to syntax to semantics, are indicators for some form of language disorder?

Of which the machine learning models is best at identifying children with language disorders from their counterparts?

Requirements Task:

1. Preprocessing data from conti-4 : cleaning the raw dataset into structured form for subsequent processing and analysis.

To access data from conti-4 online: https://childes.psy.cmu.edu/ then under database click

Transcipts and media -XML. Then click browse database. On the left side select CLINICAL-MOR, then select conti4-→SLI-narrative.

2. Perform NLP task (Tagging and Parsing).

3. Multiple Features Extraction: such as N-gram(word-based or POS based), grammar rules, other features

4. Basic Classification and Evaluation

5. Feature Selection and more classification

I will be carrying out this assignment in Python 3.0 Using NLTK tool kit for language processing and machine learning toolkit. There is an algorithm that has to be implemented for the machine to learn.

This assignment requires the expert to produce a program based on the description requirement task written in the pdf file "Description Task". Would like to know expert approach on this assignment.

The task can be separated into two sections, the first section being task requirements 1,2 and 3. which covers pre processing, tagging, parsing, and feature extraction.

Then, I would like the same expert who is working on this to continue with the remaining task, which are basic classification and evaluation, feature selection

Attachment:- Conti4.zip

Reference no: EM131051755

Questions Cloud

Identify a real organisation and product : Identify a real organisation and product to promote. You must then complete an Integrated Marketing Communications (IMC) Plan for that organisation.
Determine the new kva and power factor : The torque to force the machine to break from synchronism if the excitation is reduced that the generated e.m.f is 4kV (phase)
To explore franchising opportunities : 1. Use the search tool to find a franchise that has the potential of fulfilling your entrepreneurial dreams. Navigate to the profile of the franchise you selected. Explore the franchise's website if a link is available. Refer to the questions list..
Unique human phenomenon : Menopause is complete when a woman has nothad a menstrual period for a year. Women who are post-menopausalcan no longer get pregnant. Although menopause is a natural processit is a unique human phenomenon that continues to bewilder manypeople.
Exploring the potential of natural language processing : For Reading Purposes. EXPLORING THE POTENTIAL OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING IN CHILD LANGUAGE DISORDERS DIAGNOSIS, Preprocessing data from conti-4 : cleaning the raw dataset into structured form for subsequent processing and ana..
Services professionals in the field of gerontology : What are some of the roles performed by human services professionals in the field of gerontology? Do you believe that this is an area of human services that you would be interested in working in? Why, or why not?
Draw the three most popular brick bonds from memory : Draw the three most popular brick bonds from memory.
Requirements and expectations of an accounting system : Contact a small business and prepare questions to determine their requirements and expectations of an accounting system. Provide the questions and brief details of the business owners responses.
Every person be an organ donor : Would it be ethical to require that every person be an organ donor? What are some of the ethical considerations involved in requiring all individuals to be organ donors?

Reviews

Write a Review

Python Programming Questions & Answers

  Improve the readability and structural design of the code

Improve the readability and structural design of the code by improving the function names, variables, and loops, as well as whitespace. Move functions close to related functions or blocks of code related to your organised code.

  Calculate the accuracy of your linear classifier

Plot X,Y and the decision boundary. Make sure that you use a good plotting technique so that it is easy to distinguish which datapoint is X and which is Y. Calculate the accuracy of your linear classifier.

  Create a jes function

Create a JES function named level1() and include the relevant behaviours. Specifically, in this level, you only need to detect just one character from the text and the text is only in a single row.

  Python function to calculate two roots

Write a Python function main() to calculate two roots. You must input a,b and c from keyboard, and then print two roots. Suppose the discriminant D= b2-4ac is positive.

  You are tasked with improving the code for the haunted

you are tasked with improving the code for the haunted house game. please read the associated hand-out and the code

  Write a program to convert an input value from base

Write a program to convert an input value from base 10 to a user selectable base between 2 and 16.

  We would like to implement the lexical order

We would like to implement the lexical order for lists. For simplicity, we only consider lists of numbers, where , >= have their usual meaning.

  Programmer does not have to use the def statement

Unlike Z+-, the programmer does not have to use the DEF statement to create a variable. Instead, the programmer can simply use the variable. The first use of the variable creates it and initializes it to 0.

  Write a function rmduplic(l), where l is any list

Write a function rmDuplic(L), where L is any list. The function should return a list M that contains the same items as L, except that repetitions (duplicates) have been removed: only the first occurrence of each entry is kept (i.e., order is prese..

  1 why is biodiversity importantnbsp discuss the ways in

1 why is biodiversity important?nbsp discuss the ways in which the value may be assigned to biodiversity. based on your

  Python code for grabbing data from yahoo finance

I am new to Phython. Using Spyder IDE Simple code. It is not recognizing by Data file see line 33 - The purpose of assignment is to get you familiar with the Python.

  Write a program for checking a circle

Write a program for checking a circle program must either print "is a circle: YES" or "is a circle: NO", appropriately.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd