State of the union code, Other Subject

Assignment Help:

1)  The average number of sentences per word over time

Do this two ways:

First, using a simple regex that splits on puntuation. Second, using the natural language toolkit's sentence tokenizer.
2)  The average number of unique words per 100 words over time

    - remove common words ( see the lecture slide for a list )

Do this two ways:

First: assume that different words are all unique, even if they share suffixes ( like run and running )

Second: using the stemming code from the NLTK.

Make sure that you sort the speeches by their date, and write the data into a text file with two columns: date and statistic.

EC) Build a word cloud, as seen in the slides from lectures 11 and 12

Part 2: Collect an interesting web statistic.

Use urlweb and a website of your choice to collect a statistic. Write a one paragraph description of your statistic at the top of the code
file.

Examples include sports game data, weather statistics, name statistics, etc.


Related Discussions:- State of the union code

What do you mean by barriers of communication, Question 1 What do you mean...

Question 1 What do you mean by barriers of communication? Explain different types of barriers to communication Question 2 Explain the different types of listening Questio

Ethical theories, Compare or Contrast two Ethical theories Write a short ...

Compare or Contrast two Ethical theories Write a short (3-5 page) paper comparing and contrasting the theories of two of the following: Aristotle, Machiavelli, Kant, Locke, Mill

Warehouse and distribution, Question 1 One subway station in Toronto has 6 ...

Question 1 One subway station in Toronto has 6 turnstiles, each of which can be controlled by the station manager to be used for either entrance or exit control – but never for bot

Earths crist, How does the mantle interact with the earths crust?

How does the mantle interact with the earths crust?

Explain freuds therapeutic techniques, Question: Sigmund Freud method o...

Question: Sigmund Freud method of treating mental illness is called psychoanalysis. From the time his theory and method became known and used by others, starting from 1900, hi

Relate selected global efforts in information system, Write a 3-5 page rese...

Write a 3-5 page research paper exploring 3-5 topics regarding global efforts in the information systems industry. The paper needs to clearly identify the global topics, how it''s

Nuremberg trial, importance of the trial to understand genocide looking bac...

importance of the trial to understand genocide looking back and looking forward

Diplomacy is a tool of politics, QUESTION "Diplomacy is a tool of polit...

QUESTION "Diplomacy is a tool of politics". (a) What similarities or differences can you draw between diplomacy and politics? Diplomacy has been instrumental in shaping t

Explain the importance of each of the stages, QUESTION 1 Change is a co...

QUESTION 1 Change is a complex process and people involved in change management need to consider several key areas such as stakeholders, goals, and Identify and describe the ot

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd