In: Computer Science
1) Describe a real-world example that uses one of the Data Mining Tasks and why is this task best suited to this example?
PLEASE EXPLAIN IN DETAIL.
Clustering is datamining task.
In real world applications we need to form clusters of entities with similar features.
Resume processing applications: skill set of candidates are considered.
Based on relevant skill set like , known programming languages, previous years of experience in IT field are considered to form clusters.
In marketing applications like customer relationship management requires to group customers based on income ranges.
Income above 1,50,000
Income range 80000-1,50000
income between 50,000 to 80000
income less than 50000
This clusters are used to contact customers to know better about them and to approach them for better services.
—————————-
classification is data mining task to used to classify them into known classes
in loan process
class A - to classify a person as fraudulent
class B - to classify person as legitimate.
based on features, history of payments, valid income sources, person is classified into either of classes.
in clustering , there is no labelling method , number of clusters are unknown, based on the distance between clusters we need to merge them or separate the clusters.