logging - Collect logs from Mesos Cluster -


my team deploying new cluster on amazon ec2 instances. after bit of research, decided go apache mesos cluster manager , spark computation.

the first question asked ourself best way collect logs machines, each different framework. till now, developed custom bash/python scripts collect logs predefined locations, zip them , send compressed file s3. kind of rotation activated cron job, runs every hour.

i have been searching "best" (or standard) way this. found apache flume, data collector logs, don't understand how integrated in mesos cluster collect logs (and spark).

i found this "similar" question, solutions not open source or no more supported.

is there better way rotate logs or standard way i'm missing?

thank much

there no perfect answer this. if using spark , interested in using flume, have either write custom flume -> spark interface 1 doesn't exist far know. however, can this:

  1. use flume ingest log data in realtime.
  2. have flume pre-processing on log data custom interceptor.
  3. have flume write kafka after pre-processing done.
  4. have spark streaming read off of kafka queue process logs , run computations.

spark streaming supposedly not prime time production grade yet 1 potential solution.


Comments

Popular posts from this blog

python - No exponential form of the z-axis in matplotlib-3D-plots -

php - Best Light server (Linux + Web server + Database) for Raspberry Pi -

c# - "Newtonsoft.Json.JsonSerializationException unable to find constructor to use for types" error when deserializing class -