| Parameters | RDBMS | Hadoop |
|---|---|---|
| Definition | RDBMS is a Relational Database Management System in which data is stored in the form of tables consisting of rows and columns. | Hadoop is an open source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. |
| Use | It is used for OLTP (Online Transaction Processing). | It is used for analytical and especially for big data processing. |
| Size of Data | It can handle data upto gigabytes. | It can handle Petabytes and even more large data sets. |
| Data Structure | It can work only with structured data. | It can work with all structured, semi structured and unstructured data. |
| Manufactures | Sql Server, MySQL, Oracle etc. | Hadoop implementations by Cloudera, Intel and Amazon. |
| File system | It rely on OS file system. | It is based on distributed file system - HDFS. |
| Integrity | High. It has ACID properties. | Low. |
| Data Schema | Static | Dynamic |
| Access Method | Batch | Interactive and batch |
| Scaling | Non linear | Linear |
| Normalization of Data | Required | Not required |
| Query Response Time | Can be near immediate. | Has latency. |
No comments:
Post a Comment