Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-4318

Infinispan should collect statistics for M/R tasks

    XMLWordPrintable

Details

    • Enhancement
    • Resolution: Obsolete
    • Major
    • None
    • 7.0.0.Alpha4
    • Clustered Executor
    • None

    Description

      Map/Reduce tasks should collect statistics during the task execution that can be returned to the user to help them determine the optimal settings for the task. Here are some thoughts on useful statistics:

      Final status - completed, failed, cancelled, etc.
      Duration - either overall, per node, per phase (map, reduce, combine, collate)
      Number of nodes participating in the task
      Keys in the intermediate cache
      Keys in the result map
      Node specific statistics:
      Status of node - completed, failed, cancelled, etc.
      Number of keys processed
      Max size of collector

      Here are the built in counters that are reported by Hadoop:
      https://www.inkling.com/read/hadoop-definitive-guide-tom-white-3rd/chapter-8/counters

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rhn-support-afield Alan Field
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: