In: Computer Science
What is Data Science? How can you relate the term ‘e-science’ to data science? A data scientist typically performs 3 tasks. What are they?
Answer : Data Science - It is a field that allows you to extract knowledge from structured or unstructured data. Data Science enables you to translate a business problem into research project and then translate back into practical solution. Statistic , Visualization , Deep learning , Machine learning are the important part of the data science. Data science process goes through Discovery , Model planning , Data preparation , Model building , operationalize , Communicate results. High variety of information and data is the biggest challenge of the data science technology.
We have various tools for Data Science -
1) SAAS
2) MATLAB
3) SQL
4) PYTHON
E-science is a research method that involves the collection , processing and utilization of scientific information in the form of data. It consist of large amount of data. It is related to data science as it uses emmense data sets that require grid computing , the term may include technologies. It is often referred as big data.
The tasks which are typically performed by the data scientist are-
1) Identify the data analytics problems that offer the greatest opportunities to the organization.
2) Determining the correct data sets and variables.
3) Collecting large sets of structured and unstructured data from the sources.
and other tasks are-
1) Cleaning and validating the data to ensure accuracy , completeness.
2) Applying models and algorithm to mine the stores of big data.
3) Analyzing the data to identify the pattern and trends.
4) Interpreting the data to discover solutions.