Text mining, Database Management System

Text Processing:

Use readLines to read SOU.txt into R. Create a vector called Pres containing the names of the presidents giving each speech. To do this, rst identify the lines containing this information, then use the tagging and back-referencing strategy we covered in class. Remove any whitespace at the beginning or end of the strings.

 Create an empty list using the command

speech.words <- vector("list", length(Pres))

Note that length(Pres) is the total number of speeches. Now loop over the speeches and ll in the elements of each list as follows. Each element in the list should be a character vector, where each element of the vector is a word in the speech. Hint: For a given speech (one iteration in the loop), rst put the text of the speech into one long character vector (where in relation to the delimiters does it start and stop?), then use the function strsplit to break it up. There are more careful ways to do this, but you can consider \word characters" to
consist only of letters, so that what de nes the breaks between words is one or more \non-word characters.

Posted Date: 2/23/2013 1:51:10 AM | Location : United States







Related Discussions:- Text mining, Assignment Help, Ask Question on Text mining, Get Answer, Expert's Help, Text mining Discussions

Write discussion on Text mining
Your posts are moderated
Related Questions
Define the categories of SQL Statements? 1) Statements that don't return a result set. This statement class includes INSERT, DELETE, and UPDATE. 2)Statements that return

The Proliferation of Object-Oriented Methods The first few years of the 1990s saw blossoming of around fifty various object oriented methods. This proliferation is a sign of g

Evaluate, compare and contrast the two methods of relational database design namely top down method and Bottom up method. What will be your choice if you are asked to select a meth

Construct a generalization-specialization hierarchy for a motor-vehicle sales company. The company sells motorcycles that comprise an engine number and cost; cars that have a chass


What is the use of rename operation? Rename operation is used to rename both relations and an attributes. It uses the as clause, taking the form: Old-name as new-name.

What are the categories of SQL command? SQL commands are separated in to the following categories: 1. Data - Definitition Language 2. Data Manipulation language 3. Dat

Which is another name for weak entity? Child is another name of weak entity.

What is OLTP (Online Transaction Processing)? In OLTP - online transaction processing systems relational database design use the discipline of data modeling and generally follo

What is known as fudge factor? The number of partitions is enhanced by a small value called the fudge factor, which is usually 20 percent of the number of hash partitions compu