Nitin has Published 3 Articles

What is Bucketing in Hive?

Nitin

Nitin

Updated on 25-Aug-2022 12:30:31

5K+ Views

Bucketing is a method in Hive which is used for organizing the data. It is a concept of separating data into ranges known as buckets. Bucketing in hives comes helpful when the use of partitioning becomes hard. A user can determine the range of a specific bucket by the hash ... Read More

RDD Shared Variables In Spark

Nitin

Nitin

Updated on 25-Aug-2022 12:29:12

891 Views

The full name of the RDD is a distributed database. Spark performance is based on this ambiguous set, enabling it to consistently cope with major data processing conditions, including MapReduce, streaming, SQL, machine learning, graphs, etc. Spark supports many programming languages, including Scala, Python, and R. RDD also supports the ... Read More

Sqoop Integration with Hadoop Ecosystem

Nitin

Nitin

Updated on 25-Aug-2022 12:27:12

362 Views

Data was previously stored in relational data management systems when Hadoop and big data concepts were not available. After introducing Big Data concepts, it was essential to store the data more concisely and efficiently. However all data stored in the related data management system needs to be transferred to the ... Read More

1
Advertisements