Question

In: Computer Science

Write a code to find the following in a text file (Letter). language: Python (a) Find...

Write a code to find the following in a text file (Letter). language: Python

(a) Find the 20 most common words
(b) How many unique words are used?
(c) How many words are used at least 5 times?
(d) Write the 200 most common words, and their counts, to a file.

text file:

Look in thy glass and tell the face thou viewest,
Now is the time that face should form another,
Whose fresh repair if now thou not renewest,
Thou dost beguile the world, unbless some mother.
For where is she so fair whose uneared womb
Disdains the tillage of thy husbandry?
Or who is he so fond will be the tomb,
Of his self-love to stop posterity?
Thou art thy mother's glass and she in thee
Calls back the lovely April of her prime,
So thou through windows of thine age shalt see,
Despite of wrinkles this thy golden time.
But if thou live remembered not to be,
Die single and thine image dies with thee.


4
Unthrifty loveliness why dost thou spend,
Upon thy self thy beauty's legacy?
Nature's bequest gives nothing but doth lend,
And being frank she lends to those are free:
Then beauteous niggard why dost thou abuse,
The bounteous largess given thee to give?
Profitless usurer why dost thou use
So great a sum of sums yet canst not live?
For having traffic with thy self alone,
Thou of thy self thy sweet self dost deceive,
Then how when nature calls thee to be gone,
What acceptable audit canst thou leave?
Thy unused beauty must be tombed with thee,
Which used lives th' executor to be.

5
Those hours that with gentle work did frame
The lovely gaze where every eye doth dwell
Will play the tyrants to the very same,
And that unfair which fairly doth excel:
For never-resting time leads summer on
To hideous winter and confounds him there,
Sap checked with frost and lusty leaves quite gone,
Beauty o'er-snowed and bareness every where:
Then were not summer's distillation left
A liquid prisoner pent in walls of glass,
Beauty's effect with beauty were bereft,
Nor it nor no remembrance what it was.
But flowers distilled though they with winter meet,
Leese but their show, their substance still lives sweet.

Solutions

Expert Solution

import collections
file = open('file1.txt', encoding="utf8")
b= file.read()
unique=0
time5=0
wc = {}
for w in b.lower().split():
w = w.replace(".","")
w = w.replace(",","")
w = w.replace(":","")
w = w.replace("\"","")
w = w.replace("!","")
w = w.replace("â??","")
w = w.replace("â??","")
w = w.replace("*","")
if w not in wc:
wc[w] = 1
else:
wc[w] += 1
n_print = int(input("How many most appeared words to print: "))
print("\nOK. The {} most counted words in the txt file are as follows\n".format(n_print))
word_counter = collections.Counter(wc)
for w, c in word_counter.most_common(n_print):
print(w, ": ", c)

print("\nOK. all unique words are as follows\n".format(n_print))
for w, c in word_counter.most_common():
if c==1:
print(w, ": ", c)
unique=unique+1
print("\nOK. Total number of unique words are ",unique )

print("\nOK. all words with count more than 5 are as follows\n".format(n_print))
for w, c in word_counter.most_common():
if c>=5:
print(w, ": ", c)
time5=time5+1
print("\nOK. Total number of words appeared equal or more than 5 times are ",time5 )

print("\nOK. The 200 most common words are in the file com200.txt\n".format(n_print))
word_counter = collections.Counter(wc)
f1= open("com200.txt","w+")
for w, c in word_counter.most_common(200):
f1.write(w)
f1.write("\n")
f1.close()

# Close the file
file.close()

#############################################################################

output:

##########################################################################

How many most appeared words to print: 20

OK. The 20 most counted words in the txt file are as follows

thou : 11
thy : 10
the : 10
and : 8
of : 8
to : 8
with : 7
dost : 5
be : 5
thee : 5
not : 4
so : 4
but : 4
self : 4
in : 3
glass : 3
is : 3
time : 3
that : 3
for : 3

OK. all unique words are as follows

ok : 1
tell : 1
viewest : 1
should : 1
form : 1
another : 1
fresh : 1
repair : 1
renewest : 1
beguile : 1
world : 1
.

..........

OK. Total number of unique words are 150

OK. all words with count more than 5 are as follows

thou : 11
thy : 10
the : 10
and : 8
of : 8
to : 8
with : 7
dost : 5
be : 5
thee : 5


OK. Total number of words appeared equal or more than 5 times are 10

OK. The 200 most common words are in the file:com200.txt


Related Solutions

Problem 2(a). Letter Frequencies. ? Write Python code that reads a text file into memory and...
Problem 2(a). Letter Frequencies. ? Write Python code that reads a text file into memory and creates a dict object with a frequency count for each letter. For example, for encryptedA.txt, your output should contain the key:value pairs 'a': 78 and 'b': 31. Notes Do not distinguish between uppercase and lowercase letters. Ignore punctuation. Punctuation counts must not appear in your dict If a given letter does not appear in the text, there must be a key:value pair with value...
Problem: Write a Python module (a text file containing valid Python code) named p5.py. This file...
Problem: Write a Python module (a text file containing valid Python code) named p5.py. This file must satisfy the following. Define a function named rinsert. This function will accept two arguments, the first a list of items to be sorted and the second an integer value in the range 0 to the length of the list, minus 1. This function shall insert the element corresponding to the second parameter into the presumably sorted list from position 0 to one less...
Use Python to Complete the following on a single text file and submit your code and...
Use Python to Complete the following on a single text file and submit your code and your output as separate documents. For each problem create the necessary list objects and write code to perform the following examples: Sum all the items in a list. Multiply all the items in a list. Get the largest number from a list. Get the smallest number from a list. Remove duplicates from a list. Check a list is empty or not. Clone or copy...
(IN PYTHON) Write a function that accepts a line of text and a single letter as...
(IN PYTHON) Write a function that accepts a line of text and a single letter as input (case insensitive) and returns the number of times the letter is the last character of a word. Note your program should be able to handle different cases. And check if the user input is a single letter.
Using python as the coding language please write the code for the following problem. Write a...
Using python as the coding language please write the code for the following problem. Write a function called provenance that takes two string arguments and returns another string depending on the values of the arguments according to the table below. This function is based on the geologic practice of determining the distance of a sedimentary rock from the source of its component grains by grain size and smoothness. First Argument Value Second Argument Value Return Value "coarse" "rounded" "intermediate" "coarse"...
Write a python program that does the following: Prompt for a file name of text words....
Write a python program that does the following: Prompt for a file name of text words. Words can be on many lines with multiple words per line. Read the file and convert the words to a list. Call a function you created called list_to_once_words(), that takes a list as an argument and returns a list that contains only words that occurred once in the file. Print the results of the function with an appropriate description. Think about everything you must...
In this PYTHON 3 program assignment, you will find a text file named WorldSeries.txt. This file...
In this PYTHON 3 program assignment, you will find a text file named WorldSeries.txt. This file contains a chronological list of the World Series' winning teams from 1903 through 2018. The first line in the file is the name of the team that won in 1903, and the last line is the name of the team that won in 2018. (Note the World Series was not played in 1904 and 1994. There are entries in the file indicating this.) Write...
C programing language A file "data.txt" contains only integers. Write a code to find average of...
C programing language A file "data.txt" contains only integers. Write a code to find average of all values and print the average How would you use execlp function to execute "ps –e –a –l" command char *dt = "The five boxing wizards jump quickly"; write a program to count frequency of each letter, ignore case. Print the letter and frequency of each letter. // 1A: . Ask the user to enter a password string, store it in pass. Password should...
Using C++, write a code that this program always stores text file output into a text...
Using C++, write a code that this program always stores text file output into a text file named "clean.txt". -The program should read one character at a time from "someNumbers.txt", and do the following. -If it is a letter, print that letter to the screen, AND also store it in the text file. All letters should be converted to lowercase beforehand. -If it is a number, print that number to screen, but do NOT store it in the text file....
Whats the code to open a text file and every line in that text file that...
Whats the code to open a text file and every line in that text file that starts with # then it should delete that line In python using .strip
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT