Big Data Analytics

Posts

PROS & CONS OF APACHE SPARK

March 27, 2018

What are the pros and cons of Apache Spark? Shwati Kumar , Part of Apache Software Foundation Apache Spark - Next gen Big Data tool. It is a general-purpose & lightning fast cluster computing platform. Pros of Spark: Spark is easy to program and don’t require much hand coding, whereas MapReduce is not that easy in terms of programming and requires lots of hand coding. Apache Spark processes the data in memory while Hadoop MapReduce persists back to the disk after map or reduce action. But Spark needs a lot of memory. Spark is general purpose cluster computation engine with support for streaming, machine learning, batch processing as well as interactive mode whereas Hadoop MapReduce supports only batch processing. Spark executes batch processing jobs about 10 to 100 times faster than Hadoop MapReduce. Spark uses a variety of abstraction such as RDD, DataFrame, Streaming, GraphX which makes Spark feature rich whereas...

Search This Blog

Big Data Analytics

Posts

Beginner Level Projects on Apache Spark

PROS & CONS OF APACHE SPARK