Some tricky questions and answers relating to Hadoop architecture, MapReduce and Yarn.
Tag: hadoop
Hadoop command reference
A list of helpful command reference and descriptions for Hadoop.
Moving large amounts of data from HDFS to AWS
This post entails moving large amount of data to HDFS using S3DistCp on Amazon AWS.
Google Cloud Dataflow, what and how?
Cloud Dataflow is a Google technology that provides a cloud service to process data. It allows developers to build pipelines, monitor their execution, and transform & analyse data, all in the cloud.
Using Google App Engine to build scalable mobile application
Why I intend to use Google App Engine for my next mobile startup