In: Computer Science
What are the the three characteristics of Big Data, and what are the main considerations in processing Big Data?
1. Volume: The Big data itself means the volume of data is tremendous, so much that it is not possible to process the data on a single node machine.. Size plays a big role in deciding how to process the big data.
2. Variety: The data can be in various form, mainly known as structured, unstructured and semi-structured data.. It is comparatively easier to parse a structured data than unstructured data (such as logs etc..). It also means the type of data, like text, video, images etc.
3. Velocity: The speed with which the data is created is also very important, While designing any system, we need to know how much data is expected to come everyday, month or year and will our systems be capable of processing that.
Hence, The above points are mainly considered while deciding any architecture in order to process any bigdata... The latency and availability of the system is also something very important, which may vary on case to case basis.. If some service needs to provide the response in real time, then it needs to be very fast, and thus, we may require some caching solution along with processing to serve in real time.