Develop code in a Scala Maven project to monitor a folder in HDFS in real time such that any new file in the folder will be processed. For each RDD in the stream, the following subtasks are performed concurrently:
(a) Count the word frequency and save the output in HDFS.Note, for each word, make sure space (" "), comma (","), semicolon (";"), colon (":"), period ("."), apostrophe (“’”), quotation marks (“””), exclamation (“!”), question mark (“?”), and brackets ("[", “{”, “(”, “<”,"]", “)”, “}”,”>” ) are trimmed.
(b) Filter out the short words (i.e., < 5 characters) and save the output in HDFS.
(c) Count the co-occurrence of words in each RDD where the context is the same line; and save the output in HDFS.
In: Computer Science
The following data represents number of pictures an individual has posted on Facebook (X) and their level of narcissism(Y) as measured on a 10-point scale, 10 being highest.
| x | y |
| 14 | 3 |
| 26 | 2 |
| 73 | 7 |
| 63 | 6 |
| 64 | 5 |
| 71 | 6 |
| 8 | 1 |
| 91 | 10 |
| 15 | 2 |
| 15 | 1 |
Analyze this data using both the correlational method as well as regression via StatsCrunch. Write the data up using APA guidelines in MS Word. Submit both your output from StatsCrunch as well as your Word document
In: Statistics and Probability
In: Computer Science
1. Explain what the loop is doing and what is the final value of AX after this code has been executed?
.data
word1 WORD 100h, 200h, 300h, 400h, 500h
.code
mov ESI, OFFSET [word1+8]
mov ECX, 2
mov EAX, 0
L1:
mov AX, [ESI]
add AX, 20h
sub ESI, 4
LOOP L1
2. Use the XCHG instruction to reorder the array from [134Fh, 2EA6H, 1234h, F0F0h] to [2EA6H, F0F0h, 1234h, 134Fh]: (Direct Addressing)
.data
array1 WORD 134Fh, 2EA6H, 1234h, F0F0h
.code
In: Computer Science
You are a network administrator in a medium-sized company. The owner has heard of using RIP and OSPF on the routers and that it will help your network. He isn't very computer savvy, so he wants you to explain it to him and tell him what to use and how you are going to implement it. Write a memo using standard memo format that outlines this for your company's owner using either a Microsoft Word Memo template or another example, including diagrams if necessary. You can use any drawing software (Visio, PowerPoint, Word, etc.) to create the diagram.
In: Computer Science
Write a report Approximately 1000 word report (800-1300 words excluding Conclusion) on ONE of the following topics:
1. Quantum Cryptography
2. Quantum Computing and its impact on Cryptography
3. Zero-knowledge protocols (or proofs) in Signature Schemes
4. Secret Sharing (or splitting) in Cryptography
5. P vs. NP
It is VERY important that references are included (Note: Wikipedia is NOT acceptable)
Written Report for required format and marking criteria.
Note: Diagrams and / or images should be included. Any words associated with the diagrams should not be used in the 1000 word count for your report. (e.g. Diagram titles).
In: Computer Science
In java P4.6 Following Section 4.9 develop a program that reads text and displays the average number of words in each sentence. Assume words are separated by spaces, and a sentence ends when a word ends in a period. Start small and just print the first word. Then print the first two words. Then print all words in the first sentence. Then print the number of words in the first sentence. Then print the number of words in the first two sentences. Then print the average number of words in the first two sentences. At this time, you should have gathered enough experience that you can complete the program.
In: Computer Science
In python,
1- Modify your mapper to count words after removing punctuation marks during mapping.
Practice the given tasks in Jupyter notebook first before running them on AWS. If your program fails, check out stderr log file for information about the error.
import sys
sys.path.append('.')
for line in sys.stdin:
line = line.strip() #trim spaces from beginning and
end
keys = line.split() #split line by space
for key in keys:
value = 1
print ("%s\t%d" % (key,value)) #for
each word generate 'word TAB 1' line
In: Computer Science
In python,
Modify your mapper to count the number of occurrences of each character (including punctuation marks) in the file.
Practice the given tasks in Jupyter notebook first before running them on AWS. If your program fails, check out stderr log file for information about the error.
import sys
sys.path.append('.')
for line in sys.stdin:
line = line.strip() #trim spaces from beginning and
end
keys = line.split() #split line by space
for key in keys:
value = 1
print ("%s\t%d" % (key,value)) #for
each word generate 'word TAB 1' line
In: Computer Science
In terms of job loss, North Carolina is one of the states most adversely affected by the United States-Mexico-Canada Agreement (USMCA). Assume hypothetically that North Carolina is considering a 25% tariff (tax) on all foreign-manufactured textiles and furniture items imported into the state. The tariff's purpose will be predominately protective in nature, designed to protect and advance textile and furniture manufacturers in North Carolina and to create jobs. In a 250- to 500-word (double spaced Word document) original paper, advise the lawmakers in North Carolina of the CONSTITUTIONALITY of such a tariff and what other alternative remedies are available, if any.
In: Operations Management