hadoop - Download large volumes from S3 to Local Machine? - s3distcp -
currently using distcp
slow, taking 4:16 minutes copy 1 hour's worth of logs, while custom function wrote me takes 16 seconds. given amazon provides s3distcp
examples involving logs, thought give go , test performance.
i know possible distcp
possible use s3distcp
on local machine copy large volumes of data (potentially 100gb+) onto hfs cluster on local machine without use of emr?
amazon , subsequent tutorials , articles reference s3distcp
abilities step in emr..
Comments
Post a Comment