Apache Parquet

4.1
(5)

Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model or programming language.

Work for Apache Parquet?

Learning about Apache Parquet?

We can help you find the solution that fits you best.

Apache Parquet Reviews

Chat with a G2 Advisor
Write a Review
Filter Reviews
Filter Reviews
  • Ratings
  • Company Size
  • User Role
  • Industry
Ratings
Company Size
User Role
Industry
Showing 5 Apache Parquet reviews
LinkedIn Connections
Apache Parquet review by Jake B.
Jake B.
Validated Reviewer
Review Source
content

"A great format for columnar data"

What do you like best?

I love how easy it is to use to store columnar data. Once you learn the details, it makes Hadoop-use a breeze. Column-store data has many benefits, and Parquet is such a help.

What do you dislike?

There is definitely a learning curve with the environment, but it is minimal. There honestly is not much I dislike about it.

Recommendations to others considering the product:

I would definitely recommend Apache Parquet if you are considering using columnar-store data!

What problems are you solving with the product? What benefits have you realized?

I had to gather raw data and consolidate it in a way to run statistical analysis and machine learning on it. Apache Parquet made my job a lot easier. This data analysis provided a huge step in the completion of the project.

Sign in to G2 to see what your connections have to say about Apache Parquet
Apache Parquet review by Consultant
Consultant
Validated Reviewer
Verified Current User
Review Source
content

"Parquet is the Big Data solution"

What do you like best?

It is a widely adopted file format that works well with all big data applications.

What do you dislike?

I have no complaints about parquet. It's just a file format, much like CSVs. I guess one complaint is that you have to re-write your parquets to update their versions to get the latest parquet version benefits.

What problems are you solving with the product? What benefits have you realized?

Big data analysis, ETL, etc.

What Other Non-Relational Databases solution do you use?

Thanks for letting us know!
Apache Parquet review by User
User
Validated Reviewer
Review Source
content

"Well-designed format for your data needs"

What do you like best?

I am impressed at how well-designed the file format is. Best for big data/data analysis.

What do you dislike?

It's a high learning curve. You have to think about the benefits versus the drawbacks

What problems are you solving with the product? What benefits have you realized?

Data analysis and machine learning.

Apache Parquet review by User
User
Validated Reviewer
Review Source
content

"Parquet for data storage"

What do you like best?

Works with any table/data format we use.

What do you dislike?

Can be difficult to load from s3 when files get too big

What problems are you solving with the product? What benefits have you realized?

Saving training data for our production models

Apache Parquet review by Administrator
Administrator
Validated Reviewer
Review Source
content

"Apache Parquet"

What do you like best?

The way the parquet-format project contain specifications format and properly formatted.

What do you dislike?

The complex nature of the database for a simple project.

What problems are you solving with the product? What benefits have you realized?

Building Java resources that actually work.

Kate from G2

Learning about Apache Parquet?

I can help.
* We monitor all Apache Parquet reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.