What is CAP Theorem? CAP theorem is designed for distributed file systems(collection of interconnected nodes).CAP Theorem also known as Brewer’s theorem and used to distributed consistency.It contains follwing three technical terms for distributed systems. C – Consistency A – Availability P – Partition Read More …
Author:
Difference Between NoSql Cassandra and Apache Hadoop
What is Cassandra? Cassandra is the NoSql Database and it handles the more amount of data between multiple servers. It serves data from database to online transactional applications and business intelligence because Cassandra is the open source database. Cassandra created Read More …
Roles And Responsibilities of Hadoop Administrator
Who is Hadoop Administrator? Hadoop Administrator maintains the Hadoop cluster and manages the full resources of the Hadoop. Administrator job not related to Hadoop application development. The main job of the administrator is installing the Hadoop cluster for company needed. Read More …
Apache Hadoop Hive Tutorial
What is Hive? Hive is Data warehousing tool and used to process the data in Hadoop and HDFS.Hive is similar to SQL because it analyzes and processes the data with querying language. Hive runs on MapReduce and top of the Hadoop. Hive Read More …
Apache Kafka Architecture and Components
What is Apache Kafka? Kafka is designed for distributed systems.It mainly used to transfer data from Hadoop using the messaging system. Messaging system means transferring data from one application to another one but it does not consider how to transfer data and Read More …
Apche Hadoop Flume Tutorial
What is Apache Flume? Apache Flume is one tool and used to moving data from one place to another place.Flume is the distributed systems that transporting the data at reliable manner.Flume is most important part of hadoop ecosystem.In Apache flume Read More …
Apache Spark Tutorial
What is Apche Spark? Spark also open source framework and mainly used for data analytics. Spark runs more faster than hadoop and it designed on top of the hadoop. Spark does not have separate file system and it integrated with Read More …
Apache Hadoop Oozie Tutorial
Introduction: Oozie is mainly used to manages the hadoop jobs in HDFS and it combines the multiple jobs in particular order to achieve the big task. It is the open source framework and used to make multiple hadoop jobs. Oozie supports the Read More …
Top Ten Difference Between Apache Hbase and Hive
S.NO Apache Hive Apache Hbase 1 Hive is Datawarehousing tool and used to process the data in hadoop and HDFS.Hive is similar to SQL because it analyze and process the data with querying language. Apache Hbase is open source Read More …
Ten Amazing Big Data Myths
Big Data holds great promise for enterprises of all sizes. It can bring insights that help the business drive revenue and also understand gaps in service and products. Here are some myths about data: 1. Big data is new Read More …