hadoop - Download large volumes from S3 to Local Machine? - s3distcp -


currently using distcp slow, taking 4:16 minutes copy 1 hour's worth of logs, while custom function wrote me takes 16 seconds. given amazon provides s3distcp examples involving logs, thought give go , test performance.

i know possible distcp possible use s3distcp on local machine copy large volumes of data (potentially 100gb+) onto hfs cluster on local machine without use of emr?

amazon , subsequent tutorials , articles reference s3distcp abilities step in emr..


Comments

Popular posts from this blog

python - No exponential form of the z-axis in matplotlib-3D-plots -

php - Best Light server (Linux + Web server + Database) for Raspberry Pi -

c# - "Newtonsoft.Json.JsonSerializationException unable to find constructor to use for types" error when deserializing class -