
Specifically talking the few best points I like about Spark SQL is as follows:
- It is the best choice for big data analytics in collaboration with Hadoop.
- It provides fast access to data in SQL workloads.
- In Spark SQL, many types of data processing can be used together.
- It is easy to pull in multiple data sources - from Spark RDD to external databases.
- Spark SQL supports Map-reduce, SQL queries, Streaming data, Machine learning (ML), and Graph algorithms. Review collected by and hosted on G2.com.
My major dislike is Spark SQL's limitations, including Latency issues, minor files issues, and no real-time data processing. Apache has already resolved some with an alternative solution by Apache Apex. However, These issues need to be determined at Spark SQL as an alternative is okay, but some features that Spark SQL offers aren't available with Apex. Review collected by and hosted on G2.com.
At G2, we prefer fresh reviews and we like to follow up with reviewers. They may not have updated their review text, but have updated their review.
Validated through LinkedIn
This reviewer was offered a nominal gift card as thank you for completing this review.
Invitation from G2. This reviewer was offered a nominal gift card as thank you for completing this review.




