In: Computer Science
For this assignment, you will select a current research paper (published since 2016) to review. You may select any research paper that is related to Data Science or Big Data Analytics. I strongly recommend that you start your search at Google Scholar (scholar.google.com). Once you enter your search term(s), select the "Since 2016" link on the left. Feel free to choose ANY relevant paper. (I would recommend that you select one that you can read and summarize in a reasonable amount of time. Don't select a 100 page paper!)
Need 200 words review on that paper
In recent years data are generated at a dramatic pace. The data is available in structured, semi-structured, and unstructured format. Formally, the is defined from 3Vs to 4Vs. 3Vs refers to volume, velocity, and variety. Volume refers to the huge amount of data that are being generated everyday. Velocity is the rate of growth and how fast the data are gathered for being analysis. Variety provides information about the types of data such as structured, unstructured, semi-structured. The fourth V refers to veracity that includes availability and accountability. Analyzing these data is challenging for a general man.
Big data analysis has found its home at healtcare, public administration etc industries and have been fruitful. Considering the advantages of big data, some challenges have come into existence. The challenges are: Data Storage and Analysis, Knowledge Discovery and Computational Complexities, Scalability and Visualization of Data and Information Security.
Different techniques used for the analysis include statistical analysis, machine learning, data mining, intelligent analysis, cloud computing, quantum computing, and data stream processing. Various tools have been developed over the years to make big data analysis possible easily and effciently. Some of these are: Apache Hadoop and MapReduce, Apache Mahout, Apache Spark etc.
There are some open research issues in big data analysis and these can be classified classified into three broad categories namely internet of things (IoT), cloud computing, bio inspired computing, and quantum computing. However it is not limited to these issues.