> At the time of the below reported problem there was a swarm
> of OOMEs. Three regionservers went down within minutes of each
> other.
> The load at the time was four reducers that were writing
> serialized Document objects back. I suspect the writes were
> all hitting the same few regions (<= 4). These writes were
> in addition to the crawler write load of ~100-200
> objects/second.

Statistics from one invocation of that job:
Counters: 13
  File Systems
    Local bytes read=6599658380
    Local bytes written=11191638985
  Job Counters
    Launched reduce tasks=5
    Launched map tasks=373
  Map-Reduce Framework
    Reduce input groups=61134
    Combine output records=0
    Map input records=781246
    Reduce output records=61134
    Map output bytes=4514592311
    Map input bytes=0
    Combine input records=0
    Map output records=61134
    Reduce input records=61134

The map output bytes is a good ballpark for the amount of
data that will collectively be committed back to HBase by
the reducers.

  - Andy



      

Reply via email to