Parameters | RDBMS | Hadoop |
---|---|---|
Definition | RDBMS is a Relational Database Management System in which data is stored in the form of tables consisting of rows and columns. | Hadoop is an open source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. |
Use | It is used for OLTP (Online Transaction Processing). | It is used for analytical and especially for big data processing. |
Size of Data | It can handle data upto gigabytes. | It can handle Petabytes and even more large data sets. |
Data Structure | It can work only with structured data. | It can work with all structured, semi structured and unstructured data. |
Manufactures | Sql Server, MySQL, Oracle etc. | Hadoop implementations by Cloudera, Intel and Amazon. |
File system | It rely on OS file system. | It is based on distributed file system - HDFS. |
Integrity | High. It has ACID properties. | Low. |
Data Schema | Static | Dynamic |
Access Method | Batch | Interactive and batch |
Scaling | Non linear | Linear |
Normalization of Data | Required | Not required |
Query Response Time | Can be near immediate. | Has latency. |
Thursday, 12 April 2018
RDBMS Vs Hadoop
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment