Thursday 12 April 2018

RDBMS Vs Hadoop


Parameters RDBMS Hadoop
Definition RDBMS is a Relational Database Management System in which data is stored in the form of tables consisting of rows and columns. Hadoop is an open source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Use It is used for OLTP (Online Transaction Processing). It is used for analytical and especially for big data processing.
Size of Data It can handle data upto gigabytes. It can handle Petabytes and even more large data sets.
Data Structure It can work only with structured data. It can work with all structured, semi structured and unstructured data.
Manufactures Sql Server, MySQL, Oracle etc. Hadoop implementations by Cloudera, Intel and Amazon.
File system It rely on OS file system. It is based on distributed file system - HDFS.
Integrity High. It has ACID properties. Low.
Data Schema Static Dynamic
Access Method Batch Interactive and batch
Scaling Non linear Linear
Normalization of Data Required Not required
Query Response Time Can be near immediate. Has latency.

No comments:

Post a Comment