Re: Low throughput of FileChannel

Hari Shreedharan Thu, 02 Aug 2012 22:50:29 -0700

Thanks for filing the jira. It would be nice if you could update the jira with 
what you did/plan to do differently to improve performance. If you already have 
the patch, please consider submitting it for review. We will try and expedite 
its commit.


Thanks
Hari

--  
Hari Shreedharan


On Thursday, August 2, 2012 at 8:27 PM, Hari Shreedharan wrote:

> Denny,  
>  
> Please file a jira and post your code changes if you would like to contribute 
> it to Apache Flume. One of us will be glad to review and commit it. This way, 
> it will benefit the community in general. This will also allow us to discuss 
> the performance benefits of your code changes.  
>  
> Thanks,
> Hari
>  
> --  
> Hari Shreedharan
>  
>  
> On Thursday, August 2, 2012 at 8:22 PM, Denny Ye wrote:
>  
> > hi Hari,  
> >     Mostly channels in my production environment will be configured with 
> > FileChannel. It may impact our platform performance. Also I'm not sure if 
> > anyone already have got better throughput. If anyone have similar result 
> > with me, I'd like to post my code changes to discuss.
> >  
> > -Regards
> > Denny Ye
> >  
> > 2012/8/3 Hari Shreedharan <[email protected] 
> > (mailto:[email protected])>
> > > Denny,
> > >  
> > > I am not sure if anyone has actually benchmarked the FileChannel. What 
> > > kind of performance are you getting as of now? If you have a patch that 
> > > can improve the performance a lot, please feel free to submit it. We'd 
> > > definitely like to get such a patch committed.
> > >  
> > > Thanks
> > > Hari
> > >  
> > > --
> > > Hari Shreedharan
> > >  
> > >  
> > > On Thursday, August 2, 2012 at 8:02 PM, Denny Ye wrote:
> > >  
> > > > hi all,
> > > >     I posted performance of MemoryChannel last week. That's normal 
> > > > throughput in most environment. Therefore, the performance result of 
> > > > FileChannel is below expectation with same environments and parameters, 
> > > > almost 5MB/s.
> > > >
> > > >     I want to know your throughput result of FileChannel specially. Am 
> > > > I walking with wrong way? It's hard to believe the result.
> > > >
> > > >    Also I have tuning with several code changes, the throughput 
> > > > increasing to 30MB/s. I think there also have lots of points to impact 
> > > > the performance.
> > > >
> > > >     Any guys, would you give me your throughput result or feedback for 
> > > > tuning?
> > > >
> > > > -Regards
> > > > Denny Ye
> > > >
> > > >
> > > > ---------- Forwarded message ----------
> > > > From: Denny Ye <[email protected] (mailto:[email protected]) 
> > > > (mailto:[email protected])>
> > > > Date: 2012/7/25
> > > > Subject: Latest Flume test report and problem
> > > > To: [email protected] (mailto:[email protected]) 
> > > > (mailto:[email protected])
> > > >
> > > >
> > > > hi all,
> > > >    I tested Flume in last week with 
> > > > ScribeSource(https://issues.apache.org/jira/browse/FLUME-1382) and HDFS 
> > > > Sink. More detailed conditions and deployment cases listed below. Too 
> > > > many 'Full GC' impact the throughput and amount of events promoted into 
> > > > old generation. I have applied some tuning methods, no much effect.
> > > >    Could someone give me your feedback or tip to reduce the GC problem? 
> > > > Wish your attention.
> > > >
> > > > PS: Using Mike's report template at 
> > > > https://cwiki.apache.org/FLUME/flume-ng-performance-measurements.html
> > > >
> > > > Flume Performance Test 2012-07-25
> > > > Overview
> > > > The Flume agent was run on its own physical machine in a single JVM. A 
> > > > separate client machine generated load against the Flume box in 
> > > > List<LogEntry> format. Flume stored data onto a 4-node HDFS cluster 
> > > > configured on its own separate hardware. No virtual machines were used 
> > > > in this test.
> > > > Hardware specs
> > > > CPU: Inter Xeon L5640 2 x quad-core @ 2.27 GHz (12 physical cores)
> > > > Memory: 16 GB
> > > > OS: CentOS release 5.3 (Final)
> > > > Flume configuration
> > > > JAVA Version: 1.6.0_20 (Java HotSpot 64-Bit Server VM)
> > > > JAVA OPTS: -Xms1024m -Xmx4096m -XX:PermSize=256m -XX:NewRatio=1 
> > > > -XX:SurvivorRatio=5 -XX:InitialTenuringThreshold=15 
> > > > -XX:MaxTenuringThreshold=31 -XX:PretenureSizeThreshold=4096
> > > > Num. agents: 1
> > > > Num. parallel flows: 5
> > > > Source: ScribeSource
> > > > Channel: MemoryChannel
> > > > Sink: HDFSEventSink
> > > > Selector: RandomSelector
> > > > Config-file
> > > > # list sources, channels, sinks for the agent
> > > > agent.sources = seqGenSrc
> > > > agent.channels = mc1 mc2 mc3 mc4 mc5
> > > > agent.sinks = hdfsSin1 hdfsSin2 hdfsSin3 hdfsSin4 hdfsSin5
> > > >
> > > > # define sources
> > > > agent.sources.seqGenSrc.type = 
> > > > org.apache.flume.source.scribe.ScribeSource
> > > > agent.sources.seqGenSrc.selector.type = io.flume.RandomSelector
> > > >
> > > > # define sinks
> > > > agent.sinks.hdfsSin1.type = hdfs
> > > > agent.sinks.hdfsSin1.hdfs.path = /flume_test/data1/
> > > > agent.sinks.hdfsSin1.hdfs.rollInterval = 300
> > > > agent.sinks.hdfsSin1.hdfs.rollSize = 0
> > > > agent.sinks.hdfsSin1.hdfs.rollCount = 1000000
> > > > agent.sinks.hdfsSin1.hdfs.batchSize = 10000
> > > > agent.sinks.hdfsSin1.hdfs.fileType = DataStream
> > > > agent.sinks.hdfsSin1.hdfs.txnEventMax = 1000
> > > > # ... define sink #2 #3 #4 #5 ...
> > > >
> > > > # define channels
> > > > agent.channels.mc1.type = memory
> > > > agent.channels.mc1.capacity = 1000000
> > > > agent.channels.mc1.transactionCapacity = 1000
> > > > # ... define channel #2 #3 #4 #5 ...
> > > >
> > > > # specify the channel each sink and source should use
> > > > agent.sources.seqGenSrc.channels = mc1 mc2 mc3 mc4 mc5
> > > > agent.sinks.hdfsSin1.channel = mc1
> > > > # ... specify sink #2 #3 #4 #5 ...
> > > > Hadoop configuration
> > > > The HDFS sink was connected to a 4-node Hadoop cluster running CDH3u1. 
> > > > For different HDFS sink, HDFS wrote data into different path.
> > > > Visualization of test setup
> > > > https://lh3.googleusercontent.com/dGumq1pu1Wr3Bj8WJmRHOoLWmUlGqxC4wW7_XCNO9R1wuh15LRXaKKxGoccpjBXtgqcdSVW-vtg
> > > > There are 10 Scribe Clients and each client send 20 million LogEntry 
> > > > objects to ScribleSource.
> > > > Data description
> > > > List<LogEntry> entries containing a string category and a ByteArray 
> > > > body. The ByteArray body size is 500 bytes.
> > > > Results
> > > > Throughput:
> > > > Average:       Source: 46.4 MB/s, Sink: 45.2 MB/s
> > > > Maximum:    Source: 67.1 MB/s, Sink: 88.3 MB/s
> > > >
> > > > CPU:       Average: 196%, Maximum: 440%
> > > >
> > > > GC:         Young GC: 1636 times,      Full GC: 384 times
> > > >
> > > > No data loss.
> > > > Heap and GC
> > > > By analyzing JVM Heap, we found that there are many LogEntry objects in 
> > > > OldGen. We have tried to carry out some optimizations, but the results 
> > > > are not satisfactory. We will continue to track this limitation.
> > > >
> > > > FullGC Log examples:
> > > > [Full GC [PSYoungGen: 1497984K->0K(1797568K)] [PSOldGen: 
> > > > 1720643K->1693741K(2097152K)] 3218627K->1693741K(3894720K) [PSPermGen: 
> > > > 14566K->14566K(262144K)], 5.0027700 secs] [Times: user=5.01 sys=0.00, 
> > > > real=5.00 secs]
> > > > [Full GC [PSYoungGen: 1497960K->0K(1797568K)] [PSOldGen: 
> > > > 1693805K->1752540K(2097152K)] 3191765K->1752540K(3894720K) [PSPermGen: 
> > > > 14571K->14571K(262144K)], 5.0732570 secs] [Times: user=5.07 sys=0.00, 
> > > > real=5.07 secs]
> > > > [Full GC [PSYoungGen: 1497984K->0K(1797568K)] [PSOldGen: 
> > > > 1752540K->1642553K(2097152K)] 3250524K->1642553K(3894720K) [PSPermGen: 
> > > > 14572K->14568K(262144K)], 5.0710730 secs] [Times: user=5.07 sys=0.01, 
> > > > real=5.08 secs]
> > > >
> > > >
> > > > －Regards
> > > > Denny Ye
> > > >
> > > >
> > >  
> >  
>

Re: Low throughput of FileChannel

Reply via email to