Re: Latest Flume test report and problem

Denny Ye Wed, 25 Jul 2012 05:10:39 -0700

hi Alex
   What you got has been set with 4GB heap

-Regards
Denny Ye


2012/7/25 alo.alt <[email protected]>

> Hey Denny,
>
> Please set XMX and XMS as the same (4GB here), because on high traffic
> sinks it could be that the allocation process can cause a gc sweep. Also I
> would recommend to define the gen sizes with:
> -XX:NewSize=64m -XX:MaxNewSize=64m
>
> cheers,
> Alex
>
>
>
>
> On Mittwoch, 25. Juli 2012 11:48:22, Denny Ye wrote:
>
>> hi Alex,
>>     Attachment is preparing for you!
>>     Long term pause may be the critical problem for us. Do you agree me ?
>>     Wish your response, thanks!
>>
>> -Regards
>> Denny Ye
>>
>> 2012/7/25 alo.alt <[email protected] <mailto:[email protected]>>
>>
>>
>>     Hey Denny,
>>
>>     thanks for the report.
>>
>>     Can you please try to rerun with:
>>
>>     JAVA_OPTS="-Xms200m -Xmx200m -Xmn32m -XX:+UseParNewGC
>>     -XX:+UseConcMarkSweepGC -XX:**CMSInitiatingOccupancyFraction**=70
>> -Xss128k
>>     -XX:+UseMembar -verbose:gc -XX:+PrintGCDetails -XX:+PrintGCDateStamps
>>     -Xloggc:/var/log/flume/gc.log"
>>
>>     Plese attach the gc.log after.
>>
>>     cheers,
>>     Alex
>>
>>     Am 25.07.2012 10 <tel:25.07.2012%2010>:35, schrieb Denny Ye:
>>
>>     > hi all,
>>     >
>>     >    I tested Flume in last week with ScribeSource(
>>     > 
>> https://issues.apache.org/**jira/browse/FLUME-1382<https://issues.apache.org/jira/browse/FLUME-1382>)
>> and HDFS Sink.
>>     More
>>     > detailed conditions and deployment cases listed below. Too many
>>     'Full GC'
>>     > impact the throughput and amount of events promoted into old
>>     generation. I
>>     > have applied some tuning methods, no much effect.
>>     >
>>     >    Could someone give me your feedback or tip to reduce the GC
>>     problem?
>>     > Wish your attention.
>>     >
>>     >
>>     > PS: Using Mike's report template at
>>     >
>>     https://cwiki.apache.org/**FLUME/flume-ng-performance-**
>> measurements.html<https://cwiki.apache.org/FLUME/flume-ng-performance-measurements.html>
>>     >
>>     > *
>>     > *
>>     >
>>     > *Flume Performance Test 2012-07-25*
>>     >
>>     > *Overview*
>>     >
>>     > The Flume agent was run on its own physical machine in a single
>>     JVM. A
>>     > separate client machine generated load against the Flume box in
>>     > List<LogEntry> format. Flume stored data onto a 4-node HDFS cluster
>>     > configured on its own separate hardware. No virtual machines
>>     were used in
>>     > this test.
>>     >
>>     > *Hardware specs*
>>     >
>>     > CPU: Inter Xeon L5640 2 x quad-core @ 2.27 GHz (12 physical cores)
>>     >
>>     > Memory: 16 GB
>>     >
>>     > OS: CentOS release 5.3 (Final)
>>     >
>>     > *Flume configuration*
>>     >
>>     > JAVA Version: 1.6.0_20 (Java HotSpot 64-Bit Server VM)
>>     >
>>     > JAVA OPTS: -Xms1024m -Xmx4096m -XX:PermSize=256m -XX:NewRatio=1
>>     > -XX:SurvivorRatio=5 -XX:InitialTenuringThreshold=**15
>>     > -XX:MaxTenuringThreshold=31 -XX:PretenureSizeThreshold=**4096
>>     >
>>     > Num. agents: 1
>>     >
>>     > Num. parallel flows: 5
>>     >
>>     > Source: ScribeSource
>>     >
>>     > Channel: MemoryChannel
>>     >
>>     > Sink: HDFSEventSink
>>     >
>>     > Selector: RandomSelector
>>     >
>>     > *Config-file*
>>     >
>>     > # list sources, channels, sinks for the agent
>>     >
>>     > agent.sources = seqGenSrc
>>     >
>>     > agent.channels = mc1 mc2 mc3 mc4 mc5
>>     >
>>     > agent.sinks = hdfsSin1 hdfsSin2 hdfsSin3 hdfsSin4 hdfsSin5
>>     >
>>     >
>>     >
>>     > # define sources
>>     >
>>     > agent.sources.seqGenSrc.type =
>>     org.apache.flume.source.**scribe.ScribeSource
>>     >
>>     > agent.sources.seqGenSrc.**selector.type = io.flume.RandomSelector
>>     >
>>     >
>>     >
>>     > # define sinks
>>     >
>>     > agent.sinks.hdfsSin1.type = hdfs
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.path = /flume_test/data1/
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.**rollInterval = 300
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.**rollSize = 0
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.**rollCount = 1000000
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.**batchSize = 10000
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.**fileType = DataStream
>>     >
>>     > agent.sinks.hdfsSin1.hdfs.**txnEventMax = 1000
>>     >
>>     > # ... define sink #2 #3 #4 #5 ...
>>     >
>>     >
>>     >
>>     > # define channels
>>     >
>>     > agent.channels.mc1.type = memory
>>     >
>>     > agent.channels.mc1.capacity = 1000000
>>     >
>>     > agent.channels.mc1.**transactionCapacity = 1000
>>     >
>>     > # ... define channel #2 #3 #4 #5 ...
>>     >
>>     >
>>     >
>>     > # specify the channel each sink and source should use
>>     >
>>     > agent.sources.seqGenSrc.**channels = mc1 mc2 mc3 mc4 mc5
>>     >
>>     > agent.sinks.hdfsSin1.channel = mc1
>>     >
>>     > # ... specify sink #2 #3 #4 #5 ...
>>     >
>>     > *Hadoop configuration*
>>     >
>>     > The HDFS sink was connected to a 4-node Hadoop cluster running
>>     CDH3u1. For
>>     > different HDFS sink, HDFS wrote data into different path.
>>     >
>>     > *Visualization of test setup*
>>     >
>>     >
>>     https://lh3.googleusercontent.**com/**dGumq1pu1Wr3Bj8WJmRHOoLWmUlGqx*
>> *C4wW7_**XCNO9R1wuh15LRXaKKxGoccpjBXtgq**cdSVW-vtg<https://lh3.googleusercontent.com/dGumq1pu1Wr3Bj8WJmRHOoLWmUlGqxC4wW7_XCNO9R1wuh15LRXaKKxGoccpjBXtgqcdSVW-vtg>
>>     >
>>     > There are 10 Scribe Clients and each client send 20 million LogEntry
>>     > objects to ScribleSource.
>>     >
>>     > *Data description*
>>     >
>>     > List<LogEntry> entries containing a string category and a
>>     ByteArray body.
>>     > The ByteArray body size is 500 bytes.
>>     >
>>     > *Results*
>>     >
>>     > Throughput:
>>     >
>>     > Average:       Source: 46.4 MB/s, Sink: 45.2 MB/s
>>     >
>>     > Maximum:    Source: 67.1 MB/s, Sink: 88.3 MB/s
>>     >
>>     >
>>     >
>>     > CPU:       Average: 196%, Maximum: 440%
>>     >
>>     >
>>     >
>>     > GC:         Young GC: 1636 times,      Full GC: 384 times
>>     >
>>     >
>>     > No data loss.
>>     >
>>     > *Heap and GC*
>>     >
>>     > By analyzing JVM Heap, we found that there are many LogEntry
>>     objects in
>>     > OldGen. We have tried to carry out some optimizations, but the
>>     results are
>>     > not satisfactory. We will continue to track this limitation.
>>     >
>>     >
>>     >
>>     > FullGC Log examples:
>>     >
>>     > [Full GC [PSYoungGen: 1497984K->0K(1797568K)] [PSOldGen:
>>     > 1720643K->1693741K(2097152K)] 3218627K->1693741K(3894720K)
>>     [PSPermGen:
>>     > 14566K->14566K(262144K)], 5.0027700 secs] [Times: user=5.01
>>     sys=0.00,
>>     > real=5.00 secs]
>>     >
>>     > [Full GC [PSYoungGen: 1497960K->0K(1797568K)] [PSOldGen:
>>     > 1693805K->1752540K(2097152K)] 3191765K->1752540K(3894720K)
>>     [PSPermGen:
>>     > 14571K->14571K(262144K)], 5.0732570 secs] [Times: user=5.07
>>     sys=0.00,
>>     > real=5.07 secs]
>>     >
>>     > [Full GC [PSYoungGen: 1497984K->0K(1797568K)] [PSOldGen:
>>     > 1752540K->1642553K(2097152K)] 3250524K->1642553K(3894720K)
>>     [PSPermGen:
>>     > 14572K->14568K(262144K)], 5.0710730 secs] [Times: user=5.07
>>     sys=0.01,
>>     > real=5.08 secs]
>>     >
>>     >
>>     >
>>     > －Regards
>>     >
>>     > Denny Ye
>>     >
>>
>>
>>

Re: Latest Flume test report and problem

Reply via email to