State of the union code, Other Subject

Assignment Help:

1)  The average number of sentences per word over time

Do this two ways:

First, using a simple regex that splits on puntuation. Second, using the natural language toolkit's sentence tokenizer.
2)  The average number of unique words per 100 words over time

    - remove common words ( see the lecture slide for a list )

Do this two ways:

First: assume that different words are all unique, even if they share suffixes ( like run and running )

Second: using the stemming code from the NLTK.

Make sure that you sort the speeches by their date, and write the data into a text file with two columns: date and statistic.

EC) Build a word cloud, as seen in the slides from lectures 11 and 12

Part 2: Collect an interesting web statistic.

Use urlweb and a website of your choice to collect a statistic. Write a one paragraph description of your statistic at the top of the code
file.

Examples include sports game data, weather statistics, name statistics, etc.


Related Discussions:- State of the union code

Linear programming, #question.A paper mill produces two grades of paper viz...

#question.A paper mill produces two grades of paper viz., X & Y. Because of raw material restrictions, it cannot produce more 400 tons of grade X paper & 300 tons of grade Y paper

Enviromental, Ask question #Mprinciple of working from whole to parrinimum ...

Ask question #Mprinciple of working from whole to parrinimum 100 words accepted#

1, itoy serbisyo at ahensya ng tubig

itoy serbisyo at ahensya ng tubig

For IAS, which syllabus should read for IAS pre and main

which syllabus should read for IAS pre and main

Inventory and operations management, need to select one of the following: p...

need to select one of the following: process design,capacity,inventory or quality & present a detailed technical analysis of a specific project that a company undertook to improve/

Forensics, Forensics: Forensic science (frequently shortened to forensics)...

Forensics: Forensic science (frequently shortened to forensics) is the application of a wide spectrum of sciences to answer questions of interest to the legal system. It may be in

Education and society, what are the factors affecting wastage and stagnatio...

what are the factors affecting wastage and stagnation

Difference between kerning and rivers, Problem: (a) Explain the fundame...

Problem: (a) Explain the fundamental difference between kerning and rivers with particular reference to your work as a Graphics Designer. (b) Your friend has just started a

Explain john burton''s theory of decision-making, Question 1 How have both...

Question 1 How have both the World Wars and League of Nations affected international politics? Question 2 Define globalization. Also analyse Terrorism as a global threat

The use of interest-based negotiations, QUESTION 1 (a) Critically evalu...

QUESTION 1 (a) Critically evaluate the following statement Disputes can be resolved through the use of interest-based negotiations only (b) Discuss the key conditions req

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd