Question

In: Computer Science

Write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This...

Write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas.
Databases have evolved to include various new storage techniques to hold structured and unstructured data. One approach to group them uses the CAP theorem, which states that of the three characteristics—consistency, availability, and partition tolerance—only two can exist simultaneously. Within these new formats, the following databases exist:

Graph


Key-value


Columnar


Document


For this discussion, do the following:

Discuss the CAP theorem and the choose two principle. Provide examples of each.


Research NoSQL databases, including graph, key-value, columnar, and document databases. Provide a summary of each. Use examples as necessary to reinforce the discussion.


cite all of your references using APA format.

you are researching both topics in this assignment. it is only one question with two parts.

Solutions

Expert Solution

CAP Theorem-

In theoretical computer science, the CAP theorem, also named Brewer's theorem after computer scientist Eric Brewer, states that it is impossible for a distributed data store to simultaneously provide more than two out of the following three guarantees:

  • Consistency: Every read receives the most recent write or an error
  • Availability: Every request receives a (non-error) response, without the guarantee that it contains the most recent write
  • Partition tolerance: The system continues to operate despite an arbitrary number of messages being dropped (or delayed) by the network between nodes

When a network partition failure happens should we decide to

  • Cancel the operation and thus decrease the availability but ensure consistency
  • Proceed with the operation and thus provide availability but risk inconsistency

The CAP theorem implies that in the presence of a network partition, one has to choose between consistency and availability. Note that consistency as defined in the CAP theorem is quite different from the consistency guaranteed in ACID database transactions.

No distributed system is safe from network failures, thus network partitioning generally has to be tolerated. In the presence of a partition, one is then left with two options: consistency or availability. When choosing consistency over availability, the system will return an error or a time out if particular information cannot be guaranteed to be up to date due to network partitioning. When choosing availability over consistency, the system will always process the query and try to return the most recent available version of the information, even if it cannot guarantee it is up to date due to network partitioning.

In the absence of network failure – that is, when the distributed system is running normally – both availability and consistency can be satisfied.

CAP is frequently misunderstood as if one has to choose to abandon one of the three guarantees at all times. In fact, the choice is really between consistency and availability only when a network partition or failure happens; at all other times, no trade-off has to be made.

Database systems designed with traditional ACID guarantees in mind such as RDBMS choose consistency over availability, whereas systems designed around the BASE philosophy, common in the NoSQL movement for example, choose availability over consistency.

The PACELC theorem builds on CAP by stating that even in the absence of partitioning, another trade-off between latency and consistency occurs.

NoSQL:

A NoSQL (originally referring to "non-SQL" or "non-relational") database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases.

Motivations for this approach include: simplicity of design, simpler "horizontal" scaling to clusters of machines (which is a problem for relational databases),finer control over availability and limiting the object-relational impedance mismatch.[8] The data structures used by NoSQL databases (e.g. key–value pair, wide column, graph, or document) are different from those used by default in relational databases, making some operations faster in NoSQL. The particular suitability of a given NoSQL database depends on the problem it must solve. Sometimes the data structures used by NoSQL databases are also viewed as "more flexible" than relational database tables.

Many NoSQL stores compromise consistency (in the sense of the CAP theorem) in favor of availability, partition tolerance, and speed. Barriers to the greater adoption of NoSQL stores include the use of low-level query languages (instead of SQL, for instance), lack of ability to perform ad-hoc joins across tables, lack of standardized interfaces, and huge previous investments in existing relational databases. Most NoSQL stores lack true ACID transactions, although a few databases have made them central to their designs.

Instead, most NoSQL databases offer a concept of "eventual consistency", in which database changes are propagated to all nodes "eventually" (typically within milliseconds), so queries for data might not return updated data immediately or might result in reading data that is not accurate, a problem known as stale reads. Additionally, some NoSQL systems may exhibit lost writes and other forms of data loss. Some NoSQL systems provide concepts such as write-ahead logging to avoid data loss. For distributed transaction processing across multiple databases, data consistency is an even bigger challenge that is difficult for both NoSQL and relational databases. Relational databases "do not allow referential integrity constraints to span databases". Few systems maintain both ACID transactions and X/Open XA standards for distributed transaction processing. Interactive relational databases share conformational relay analysis techniques as a common feature. Limitations within the interface environment are overcome using semantic virtualization protocols, such that NoSQL services are accessible to most operating systems.

Over time, four major types of NoSQL databases emerged: document databases, key-value databases, wide-column stores, and graph databases. Let’s examine each type.

  • Document databases store data in documents similar to JSON (JavaScript Object Notation) objects. Each document contains pairs of fields and values. The values can typically be a variety of types including things like strings, numbers, booleans, arrays, or objects, and their structures typically align with objects developers are working with in code. Because of their variety of field value types and powerful query languages, document databases are great for a wide variety of use cases and can be used as a general purpose database. They can horizontally scale-out to accomodate large data volumes. MongoDB is consistently ranked as the world’s most popular NoSQL database according to DB-engines and is an example of a document database. For more on document databases, visit What is a Document Database?.

  • Key-value databases are a simpler type of database where each item contains keys and values. A value can typically only be retrieved by referencing its value, so learning how to query for a specific key-value pair is typically simple. Key-value databases are great for use cases where you need to store large amounts of data but you don’t need to perform complex queries to retrieve it. Common use cases include storing user preferences or caching. Redis and DynanoDB are popular key-value databases.

  • Wide-column stores store data in tables, rows, and dynamic columns. Wide-column stores provide a lot of flexibility over relational databases because each row is not required to have the same columns. Many consider wide-column stores to be two-dimensional key-value databases. Wide-column stores are great for when you need to store large amounts of data and you can predict what your query patterns will be. Wide-column stores are commonly used for storing Internet of Things data and user profile data. Cassandra and HBase are two of the most popular wide-column stores.

  • Graph databases store data in nodes and edges. Nodes typically store information about people, places, and things while edges store information about the relationships between the nodes. Graph databases excel in use cases where you need to traverse relationships to look for patterns such as social networks, fraud detection, and recommendation engines. Neo4j and JanusGraph are examples of graph databases.

Note: Plzzz don' t give dislike.....Plzzz comment if u have any problem i will try to resolve it.......


Related Solutions

write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This...
write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. In 1944, finance specialists and bankers from around the world met to discuss what the post-WWII monetary system would be. Given the instability of the pre-war period, the goal was to create a new system. The outcome of this was the Bretton Woods...
write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This...
write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. The importance of trade continues to be a debated topic because the gains are not always quantifiable by those involved. Some would argue that due to a significant difference in wages and regulations, free trade is not always fair trade and that free...
Write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This...
Write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. In today’s modern world, people constantly create data through their constant desire and need to be connected to the digital world. For this activity, you will research the various types of data that are commonly generated. Examples may include browsing products in an...
write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. Research...
write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. Research the "as-a-service" type of approach to cloud computing while considering Software as a Service (SaaS), Identity as a Service (IDaaS), Platform as a Service (PaaS), and Infrastructure as a Service (IaaS). How can these approaches reduce costs and add value? Specific attention should also be given to the (SaaS) cloud model. How can this model reduce costs and add value to collaboration services like...
Primary Task Response: write 400–600 words that respond to the following questions with your thoughts, ideas,...
Primary Task Response: write 400–600 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. In the Mercantile period, from the 16th through the end of the 18th centuries, trade was driven by a need to accumulate gold and silver. Nations worked to restrict imports and drive exports. With the advent of the Industrial Age,...
Respond to the following scenario with your thoughts, ideas, and comments. Be substantive and clear, and...
Respond to the following scenario with your thoughts, ideas, and comments. Be substantive and clear, and use research to reinforce your ideas. Over lunch, you and Mary meet to discuss next steps with the expansion project. “Do we have everything we need on sales and costs?” you ask. ”It must be time to compute the net present value (NPV) and internal rate of return (IRR) of the Apix expansion project.” “We have the data from James and Luke regarding projected...
Within the Discussion Board area, write 400-500 words that respond to the following questions with your...
Within the Discussion Board area, write 400-500 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. There are many reasons why a company should pay attention to their reputation as it relates to their future vendor relations and product supply costs and availability. Complete the following: Discuss why it is important for organizations to...
Within the Discussion Board area, write 500 words that respond to the following questions with your...
Within the Discussion Board area, write 500 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. Discuss the 2 pros and 2 cons of activity-based costing. Give an example of a situation where activity-based costing could be used effectively. Explain your reason.
Within the Discussion Board area, write 200-300 words that respond to the following questions with your...
Within the Discussion Board area, write 200-300 words that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation for future discussions by your classmates. Be substantive and clear, and use examples to reinforce your ideas. The line managers in the production department have heard a lot of good things about value chain management. They heard that you have done considerable research on this topic. They have come to your office today to discuss...
Respond to and answer the following scenarios and questions: A. In your own words, explain the...
Respond to and answer the following scenarios and questions: A. In your own words, explain the concept of the multiplier. B. Approximately 1,500 out-of-town epidemiologists attended their annual convention in June of 2017. It was the largest convention ever held in the city of Boise. The average amount spent by an out-of-town convention attendee is $280 dollars per day. Assume the convention lasts 3 days, and the marginal propensity to consume is .50. How much will businesses in Boise benefit...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT