Question

In: Computer Science

The purpose of this problem is to gain familiarity with the dictionary ADT. A homophone is...

The purpose of this problem is to gain familiarity with the dictionary ADT. A homophone is one of two or more words that are pronounced alike but are different in meaning or spelling; for example, the words “two”, “too”, and “to”.

Write a Java program that uses one of the implementations of the dictionary interface (UALDictionary or OALDictionary on Moodle) to find the word that has the most homophones.

Do not use data structures that we have not covered so far in the course. The file “cmudict.0.7a.txt” in the Dictionaries folder on Moodle contains a pronunciation dictionary downloaded from

http://www.speech.cs.cmu.edu/cgi-bin/cmudict

The page also contains a detailed description of the pronunciation dictionary. The file consists of lines of the form

ABUNDANT AH0 B AH1 N D AH0 N T

The first string is the word, which is followed by one or more phonemes (or phones) that describe the pronunciation of the word. There are 39 phonemes occurring in North American English that are used in the dictionary. The collection of 39 symbols is known as the Arpabet, for the Advanced Research Projects Agency (ARPA), which developed it in the 1970’s in connection with research on speech understanding.

The Dictionaries directory on Moodle contains several files you can use. UALDictionary.java is a class that implements the dictionary interface using an unordered array list. The array list stores (key, value) pairs as described in the class KVpair.java. Pronunciation.java is a class that stores and manages access to a (word, phonemes) pair.

There is also a program Homophones.java that shows how you can read in the cmu dictionary (skipping comments and so on). You can modify that program so that when you read in a pronunciation entry you store it in an appropriate dictionary. Call your program MostHomophones.

The output is a first line containing a single integer n, which is the largest number of homophones. The n homophones follow on the next n lines, one word per line.

UAL Dictionary

import java.util.ArrayList;

import java.util.Collections;

class UALDictionary<Key extends Comparable, E> implements Dictionary<Key, E> {

private static final int defaultSize = 10;

private ArrayList<KVpair<Key, E>> list;

// Constructors

UALDictionary() {

this(defaultSize);

}

UALDictionary(int sz) {

list = new ArrayList<KVpair<Key, E>>(sz);

}

public void clear() {

list.clear();

}

/** Insert an element: append to list */

public void insert(Key k, E e) {

KVpair<Key, E> temp = new KVpair<Key, E>(k, e);

list.add(temp);

}

/** Remove element with key k, return element */

public E remove(Key k) {

E temp = find(k);

if (temp != null)

list.remove(new KVpair<Key, E>(k, temp));

return temp;

}

/** Remove any element */

public E removeAny() {

return list.remove(list.size()-1).value();

}

/**

* Find k using sequential search

*

* @return Record with key value k

*/

public E find(Key k) {

for (KVpair<Key, E> t : list)

if (k.compareTo(t.key())==0)

return t.value();

return null; // "k" does not appear in dictionary

}

public Iterable<E> findAll(Key k) {

ArrayList<E> al = new ArrayList<E>();

for (KVpair<Key, E> t : list)

if (k.compareTo(t.key()) == 0)

al.add(t.value());

return al;

}

public int size(){

return list.size();

}

/** Returns an iterable collection of the dictionary. */

public Iterable<E> values() {

ArrayList<E> elements = new ArrayList<E>(list.size());

for(KVpair<Key, E> t : list)

elements.add(t.value());

return elements;

}

Homophones.java

import java.io.File;

import java.io.FileNotFoundException;

import java.util.Scanner;

import java.util.ArrayList;

public class Homophone {

public static void main(String[] args) {

UALDictionary<String, Pronunciation> PDict = new UALDictionary<String, Pronunciation>();

File file = new File("cmudict.0.7a.txt");

final int len = 5; // we start with words of length 5 characters

long start = System.currentTimeMillis();

try {

Scanner scanner = new Scanner(file);

while (scanner.hasNextLine()) {

String line = scanner.nextLine();

if (line.substring(0, 3).equals(";;;"))

continue; // skip comment lines

Pronunciation p = new Pronunciation(line);

if ((p.getWord().length() < len - 1)

|| (p.getWord().length() > len))

continue;

if ((p.getWord().length() == len - 1)

|| (p.getWord().length() == len))

PDict.insert(p.getWord(), p);

}

scanner.close();

} catch (FileNotFoundException e) {

e.printStackTrace();

}

long middle = System.currentTimeMillis();

System.out.println("Loaded dictionary.");

for (Pronunciation p : PDict.values()) {

String w = p.getWord();

if (w.length() == len) {

/* Fill in code to determine if this word

is pronounced the same if we remove the

first letter or if we remove the second.

*/

}

long end = System.currentTimeMillis();

System.out.println("Run times: load dictionary= " + (middle - start)

+ " process= " + (end - middle) + " total= " + (end - start));

}

Please provide comments if possible, I will Thumbs Up for the help!

Expert Solution

:: Solution ::

class Soundex{

private static int getConsonantCode( char ch ){

String codeList[] = { "BFPV", "CGJKQSXZ","DT","L","MN","R" };

int code = 0;

for( int i = 0 ; i < codeList.length ; i++ ){

if( codeList[i].indexOf(ch) >= 0 ) {

code = i+1;

}

return code;

}

private static boolean isVowel( char ch ){

return (new String("AEIOUaeiou")).indexOf(ch) >= 0 ;

}

public static String getSoundexCode( String str ){

str=str.toUpperCase();

String soundexCode = "" + str.charAt(0), temp="";

int length = str.length();

char curr, prev, next;{ }

String dropList = "AEIOUYHW";

for( int i=1 ; i< length ; i++ ){

curr = str.charAt(i);

prev = str.charAt( i-1 );

if( ( curr=='H' || curr == 'W') && i != length-1 ){

if( temp.length() >= 2) temp=temp.substring(1);

next=str.charAt( i+1 );

if( isVowel(curr) && getConsonantCode( prev ) == getConsonantCode( next ) ){

temp += prev+prev;

i=i+1;

}else if( getConsonantCode( prev ) == getConsonantCode( next ) ){

temp += prev;

i=i+1;

}

}else if( getConsonantCode( curr ) != getConsonantCode( prev ) ){

if( dropList.indexOf( curr ) == -1 ){

temp += curr;

}

temp = ( temp + "0000" ).substring( 0, 3 );

for( int i = 0; i<=2 ; i++ ){

soundexCode += getConsonantCode( temp.charAt( i ) );

}

return soundexCode;

}

public static void main( String []args ){

System.out.println( getSoundexCode( "Ashcraft" )+" "+"A261" );

System.out.println( getSoundexCode( "Ashcroft" )+" "+"A261" );

System.out.println( getSoundexCode( "Gauss" )+" "+"G200" );

System.out.println( getSoundexCode( "Ghosh" )+" "+"G200" );

System.out.println( getSoundexCode( "Hilbert" )+" "+"H416" );

System.out.println( getSoundexCode( "Heilbronn" )+" "+"H416" );

System.out.println( getSoundexCode( "Lee" )+" "+"L000" );

System.out.println( getSoundexCode( "Lloyd" )+" "+"L300" );

System.out.println( getSoundexCode( "Moses" )+" "+"M220" );

System.out.println( getSoundexCode( "Pfister" )+" "+"P236" );

System.out.println( getSoundexCode( "Robert" )+" "+"R163" );

System.out.println( getSoundexCode( "Rupert" )+" "+"R163" );

System.out.println( getSoundexCode( "Rubin" )+" "+"R150" );

System.out.println( getSoundexCode( "Tymczak" )+" "+"T522" );

System.out.println( getSoundexCode( "Soundex" )+" "+"S532" );

System.out.println( getSoundexCode( "Example" )+" "+"E251" );

}

Please rate my answer. Thank you...

venereology answered 3 years ago

The purpose of this problem is to gain familiarity with stacks and queues. You have three...

The purpose of this problem is to gain familiarity with stacks and queues. You have three jugs that can hold c1, c2, and c3 liters of water, respectively. Initially, jug 1 is full and the other two jugs are empty. You can repeat the following procedure any number of times: Choose two of the jugs and pour the contents of one into the other until either the first is empty or the second is full. Your goal is to end...

Interpret managed care contract agreements and gain familiarity with language and strategies used by providers and...

Interpret managed care contract agreements and gain familiarity with language and strategies used by providers and managed care organizations. Comparative analysis of managed care contracts

1 Objective The purpose of this assignment is to test your familiarity with Java I/O, and...

1 Objective The purpose of this assignment is to test your familiarity with Java I/O, and if-else statements. Please submit your file as ”Shipping.java” 2 The Backstory Amazon.com wants to try out it’s new ‘Delivery by Drone” service and has recruited you to write a Java console application to calculate the shipping cost. They are going to charge the customer based on various criteria, as shown below: Weight (in kg) Rate per 50 miles 2 or less Over 2 Kg,...

The purpose of this assignment is to test your familiarity with Java I/O statements and if-else statements.

ObjectiveThe purpose of this assignment is to test your familiarity with Java I/O statements andif-else statements. This assignment also tests your understanding of the basics of Javaprogramming and execution, like top-down control flows, translating the logical solutionto a problem into code, and integrating the new concepts you just learned with theolder concepts, including data types and variables, declaration, initialization andassignments, arithmetic operators, etc.Please submit your source code (i.e. your java file) on Canvas under HW1.ProblemIt’s the year 2030 and H&R...

The purpose of this assignment is to gain a better understanding of the code sets used...

The purpose of this assignment is to gain a better understanding of the code sets used for medical billing. These sets can be complicated, but you will learn in the EHR#8 assignment this week, that the EHR practice management functions for billing can use the clinical information to help you with code selection. Coding Classification Sets for Medical Coding and Billing Code Set When is this Code Set used? Format Example Source CPT Category I CPT Category I codes are...

Project Assignment The purpose of this assignment is for you to gain practice in applying the...

Project Assignment The purpose of this assignment is for you to gain practice in applying the concepts and techniques we learned in class. In order to complete the assignment, please do the following: 1. find or create a data set containing values of at least one interval or ratio variable for at least one-hundred cases (n >= 100); 1 2. provide basic descriptive statistics to summarize the central tendency and variability of the data; 3. provide at least one table...

C++ problem - Implement and add the advanced functionalities to the ADT of the BST made...

C++ problem - Implement and add the advanced functionalities to the ADT of the BST made with the fundamental functionalities: Visit - Description: It will display each of the data stored in the BST depending on the input parameter:Preorder Inorder Bidder Level by level Input - An integer (1-4) Exit - Nothing Precondition - A valid BST Postcondition - Nothing Height - Description:Will return the height of the BST Entry - Nothing Output - An integer with which to indicate...

Purpose: This lab will give you experience modifying an existing ADT. Lab Main Task 1: Modify...

Purpose: This lab will give you experience modifying an existing ADT. Lab Main Task 1: Modify the ListInterface Java interface source code given below. Change the name of ListInterface to ComparableListInterface, and have ComparableListInterface inherit from the built-in Java interface Comparable. Note that Comparable has a method compareTo(). compareTo() must be used in programming logic you will write in a method called isInAscendingOrder() (more about isInAscendingOrder() is mentioned further down in the lab description). You can find a brief summary...

In C++, Design and implement an ADT that represents a triangle. The data for the ADT...

In C++, Design and implement an ADT that represents a triangle. The data for the ADT should include the three sides of the triangle but could also include the triangle’s three angles. This data should be in the private section of the class that implements the ADT. Include at least two initialization operations: one that provides default values for the ADT’s data, and another that sets this data to client-supplied values. These operations are the class’s constructors. The ADT also...

Computer Science Design and implement an ADT that represents a triangle. The data for the ADT...

Computer Science Design and implement an ADT that represents a triangle. The data for the ADT should include the three sides of the triangle but could also include the triangle's three angles. The data should be in the private section of the class that implements the ADT. Include at least two initialization operations; one that provides default values for the ADT's data, and another that sets this data to client-supplied values. These operations are the class's constructors. The ADT also...