What is HDFS Hadoop comes with a distributed file system called HDFS. In HDFS data is distributed over several machines and replicated…
Hadoop tutorial
-
-
RDBMS vs HBase There differences between RDBMS and HBase are given below. Schema/Database in RDBMS can be compared to namespace in Hbase.…
-
Pig Latin The Pig Latin is a data flow language used by Apache Pig to analyze the data in Hadoop. It is…
-
Hive – Alter Table In Hive, we can perform modifications in the existing table like changing the table name, column name, comments,…
-
What is Big Data Data which are very large in size is called Big Data. Normally we work on data of size…
-
Apache Pig Run Modes Apache Pig executes in two modes: Local Mode and MapReduce Mode. Local Mode It executes in a single…
-
Hive Architecture The following architecture explains the flow of submission of query into Hive. Hive Client Hive allows writing applications in various…
-
What is Hadoop Hadoop is an open source framework from Apache and is used to store process and analyze data which are…
-
Pig UDF (User Defined Functions) To specify custom processing, Pig provides support for user-defined functions (UDFs). Thus, Pig allows us to create…
-
Hive – Create Database In Hive, the database is considered as a catalog or namespace of tables. So, we can maintain multiple…