Month: January 2019

Hadoop vs Spark – Choosing the Right Big Data Software

Hadoop vs Spark – Choosing the Right Big Data Software

Considered competitors or enemies in Big Data space by many, Apache Hadoop and Apache Spark are the most looked-for technologies and platforms for big data analytics. More interestingly, in the present time, companies that have been managing and performing big data analytics using Hadoop have also started implementing Spark in their everyday organizational and business … Continue reading Hadoop vs Spark – Choosing the Right Big Data Software

Advertisements
Comparing Hadoop, MapReduce, Spark, Flink, and Storm

Comparing Hadoop, MapReduce, Spark, Flink, and Storm

Companies that need to work with large sets of data have a range of big data, open-source frameworks and solutions from which to choose. Each solution has a different set of advantages, disadvantages and ideal applications. If you're new to Big Data, you may have heard some of these terms. Below we provide a brief … Continue reading Comparing Hadoop, MapReduce, Spark, Flink, and Storm

Real-time Big Data Pipeline with Hadoop, Spark & Kafka

Real-time Big Data Pipeline with Hadoop, Spark & Kafka

Defined by 3Vs that are velocity, volume, and variety of the data, big data sits in the separate row from the regular data. Though big data was the buzzword since last few years for data analysis, the new fuss about big data analytics is to build up real-time big data pipeline. In a single sentence, … Continue reading Real-time Big Data Pipeline with Hadoop, Spark & Kafka