
Best thing about Dataflow about its fully managed capability so that we don't need to manage infrastructure and scales easily. It also provides lot of templates which is useful for beginner and intermediate level developers and top of that they can easily update the configuration and pipeline and can run process petabyte of data. Also it supports Yaml SDK which removes Apache Beam dependencies as well. Review collected by and hosted on G2.com.
When we are working with distributed processing, its difficult to get correct configuration especially for new user its very complex to set it up and most of the time it charges a lot if not set properly. And as it supports only Apache Beam, some of the concepts are very difficult to understand. Also they can work on monitoring and logging, sometime its not clear. Review collected by and hosted on G2.com.




