From LinkedIn’s engineering team:
While tools like Ganglia provide system-level metrics, we wanted to be able
to understand what resources were being used by each user and at what times.
White Elephant parses Hadoop logs to provide visual drill downs and rollups
of task statistics for your Hadoop cluster, including total task time, slots
used, CPU time, and failed job counts.
Isn’t this a form of resource usage auditing? Based on this, next you could build support for resource quotas and then start enforcing them.
Original title and link: White Elephant: Task Statistics for Hadoop ( ©myNoSQL)