This post entails moving large amount of data to HDFS using S3DistCp on Amazon AWS.
Tag: bigdata
Google Cloud Dataflow, what and how?
Cloud Dataflow is a Google technology that provides a cloud service to process data. It allows developers to build pipelines, monitor their execution, and transform & analyse data, all in the cloud.