Question

In: Computer Science

1. You're processing data to be uploaded into a database. In what stage of data preprocessing...

1. You're processing data to be uploaded into a database. In what stage of data preprocessing would you deal with fields where there is no data, or data is missing? A.Data Consolidation B.Data Cleaning C. Data Transformation D. Data Reduction

2. You're company, Outdoor Excursions just acquired another local tour company, Excursions Inc. and you've been tasked with merging Excursions Inc's database with yours. During datapreprocessing, you encounter inconsistencies in the "marital status" column. Some values indicate "married", "single", or "widowed", others are represented by their first letters "m", "s", and "w", and others are left blank. What are some ways you could deal with that data? Select all that apply. Delete the column Recode the existing values Do Nothing Fill in missing values All of the above

3. Explain the term "model fitting". Why is it important in data mining and machine learning?

Solutions

Expert Solution

1.) Choice(B) is correct, i.e., data cleaning. In data cleaning, the raw is scanned for all incorrect,irrelevant, null data, missing data and it is handles accordingly.

2.) Recoding the existing data such as where m or f or w is written, writing code to replace those characters with male, female or widow would match attribute format correctly. It will ensure the process of handling the data of the attribute would become easier.

3.) Model fitting is a measuring factor which tracks how perfectly a machine learning model can be able to generalise and interpret the data which is similar to the data used in training. A good model fitting is determined when output is accurate when some unseen and untrained(data not used in training ML) data is passed as input. Fitting is adjusting the parameter so to improve the accuracy of determining the output with unseen input. This together helps to used data mine in an optimized way which in turn improves machine learning.


Related Solutions

Please describe the steps taken in Data Preprocessing. Give examples and explain what was done in...
Please describe the steps taken in Data Preprocessing. Give examples and explain what was done in each of these steps. PLEASE PROVIDE DETAILED EXPLANATION , SO I CAN UNDERSTAND EASILY COURSE : DATA MINING
1 You're entering records into a database, and you realize you don't have the information for...
1 You're entering records into a database, and you realize you don't have the information for one of the fields. What do you do? Use letters in alphabetical order to select options A In the Field Properties section, change the Required field to Blank so users can skip the field during data entry. B Unless a field's set up to require an entry, you can skip the fields for which you don't have any data. C In order for a...
Data Preprocessing is an important area in order to have quality input data. The four methods...
Data Preprocessing is an important area in order to have quality input data. The four methods discussed are data cleaning, data integration, data transformation and data reduction. a.Explain each of these methods in 2 or 3 paragraphs. (16 points) b.Suppose we were to develop software to implement each of these techniques. Discuss how easy or difficult it would be to develop software for each technique. Give this some thought and write a well-thought answer. Imagine that you had to write...
Need answers for Normalization, Physical Design, Sql, And Security exam. 1. The database you're creating will...
Need answers for Normalization, Physical Design, Sql, And Security exam. 1. The database you're creating will be installed on a group of three servers. What feature of an enterprise RDBMS will allow one server to pick up the processing work if the main server becomes nonoperational? A. Failover B. Business intelligence C. Data warehouse D. Load balancing 2. What type of clause must you always use with DELETE or UPDATE to avoid inadvertently changing data elsewhere in the database? A....
1. Provide brief answers to the following questions: a) What is preprocessing in C programming language?...
1. Provide brief answers to the following questions: a) What is preprocessing in C programming language? Cite 3 examples of preprocessor directives in C. b) What is the main difference between Heap and Stack memory regions? c) What is the difference between Stack and Queue? d) What is the purpose of a compiler? Name the C compiler that you used for this course? e) What is a pointer variable in C? 2. int var; Use this variable to write a...
1. Explain the purpose of resource pooling in regards to cloud computing. 2.When data is uploaded...
1. Explain the purpose of resource pooling in regards to cloud computing. 2.When data is uploaded to a cloud service provider's infrastructure, it is often said the organization loses physical control over their data. Explain how this is not entirely true. 3. Select the cloud technology that best fits the following scenario. A VM is allocated 8GB of memory and averages 3 to 4GB of memory utilization. Another VM is started and needs an additional 4GB of memory. The hypervisor...
You're caring for a mother in the transition stage of labor when she begins crying and...
You're caring for a mother in the transition stage of labor when she begins crying and says, “It hurts so much. I don’t know if I can take this anymore.” As you were admitting her, when contractions were less frequent and intense, the client told you it was very important to her that she deliver the baby without taking any pain medication and that she would feel like a failure if she gave in and took a narcotic. Will you...
1. for 1 molecule of glucose (6 C-atoms), the stage of pyruvate processing generates NADH, CO2...
1. for 1 molecule of glucose (6 C-atoms), the stage of pyruvate processing generates NADH, CO2 and Acetyl CoA ATP, H+, oxaloacetate 2NADH, 2CO2 and 2Acetyl CoA 2ATP, 2H+, 2 oxaloacetate 2. how many reduced electron carriers after glycolysis, pyruvate processing and citric acid cycle are available to make the ET work 5 NADH, 1 FADH2 4 ATP, 5NADH, 1 FADH2 4 ATP, 10NADH, 2 FADH2 10 NADH, 2 FADH2
Define data processing and explain the steps to be followed for data processing
Define data processing and explain the steps to be followed for data processing
What are the similarities and differences between database, data warehouse, and data mining?
What are the similarities and differences between database, data warehouse, and data mining?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT