This reviewer's identity has been verified by our review moderation team. They have asked not to show their name, job title, or picture.
At the heart of the Pandas library is the data frame, which makes using the Pandas framework interoperable from a skills-building standpoint. Not only will learning the methods in Pandas be valuable within Python, but you can quickly transfer your knowledge of the framework to R or even Spark (for big data applications). Further, the framework itself implemented in Python is beneficial for data analysis, providing numerous helper functions on the data frame object, that include aggregation methods, standard statistical calculation methods, and handy join/merge, and subsetting functionality that all data analysts will likely use. On top of that, it is built on top of Numpy for easy transference between those types for more heavy-duty/actual work or even pushing it up to a higher level of abstraction for more data-viz/communications/analysis work. Review collected by and hosted on G2.com.
There's not much to dislike, except perhaps memory and some run-time constraints. By adding a lot of 'extra' structure on top of the NumPy array, the data frame isn't the most efficient data type, but what you get is worth the extra resources needed to run it, though maybe not at extreme scale (several dozen gigs or more than a couple million rows depending on how many columns of data is included in your frame). Review collected by and hosted on G2.com.
Validated through LinkedIn
This reviewer was offered a nominal gift card as thank you for completing this review.
Invitation from G2. This reviewer was offered a nominal gift card as thank you for completing this review.



