Parameters |
RDBMS |
Hadoop |
Definition |
RDBMS is a Relational Database Management System in which data is stored in the form of tables consisting of rows and columns. |
Hadoop is an open source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. |
Use |
It is used for OLTP (Online Transaction Processing). |
It is used for analytical and especially for big data processing. |
Size of Data |
It can handle data upto gigabytes. |
It can handle Petabytes and even more large data sets. |
Data Structure |
It can work only with structured data. |
It can work with all structured, semi structured and unstructured data. |
Manufactures |
Sql Server, MySQL, Oracle etc. |
Hadoop implementations by Cloudera, Intel and Amazon. |
File system |
It rely on OS file system. |
It is based on distributed file system - HDFS. |
Integrity |
High. It has ACID properties. |
Low. |
Data Schema |
Static |
Dynamic |
Access Method |
Batch |
Interactive and batch |
Scaling |
Non linear |
Linear |
Normalization of Data |
Required |
Not required |
Query Response Time |
Can be near immediate. |
Has latency. |
No comments:
Post a Comment