paint-brush
Accelerate Spark and Hive Jobs on AWS S3 by 10x with Alluxio as a Tiered Storage Solutionby@bin-fan
263 reads

Accelerate Spark and Hive Jobs on AWS S3 by 10x with Alluxio as a Tiered Storage Solution

by Bin Fan6mApril 19th, 2020
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Bazaarvoice leverages Alluxio as a caching tier on top of AWS S3 to maximize performance and minimize operating costs on running Big Data analytics on AWS EC2. The company is a software-as-a-service provider that allows retailers and brands to curate, manage, and understand user-generated content such as reviews for their products. The big data platform completely relies on the open source Hadoop ecosystem, utilizing tools such as Apache Hive, Spark for ETLs Kafka, ElasticSearch and HBase for durable datastore.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Accelerate Spark and Hive Jobs on AWS S3 by 10x with Alluxio as a Tiered Storage Solution
Bin Fan HackerNoon profile picture
Bin Fan

Bin Fan

@bin-fan

VP of Open Source and Founding Member @Alluxio

About @bin-fan
LEARN MORE ABOUT @BIN-FAN'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Bin Fan HackerNoon profile picture
Bin Fan@bin-fan
VP of Open Source and Founding Member @Alluxio

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Dataengineeringweekly
Dev
Co
Allfamousbirthday
Cheer
Fastly