RE: Speed of plb_temac 3.00 on ML403

Ming Liu Sun, 11 Feb 2007 05:37:27 -0800

Dear Rick,

First thank you SO SO SO MUCH for your kindly telling. It's really usefulfor me to solve problems.

From the test summary listed, I can see that we have similar systems except

that you are using MontaVista while I am using the general open-sourcekernel. Also I enabled all the features which could improve theperformance, including DRE, CSUM offload and SGDMA,etc.

- Are Checksum offload, SGDMA, and DRE enabled in the plb_temac?

Yes. All features are enabled.

- Are you using the TCP_SENDFILE option of netperf? Your UDP numbers are

similar already to what we saw in Linux 2.6, and your TCP numbers aresimilar to what we saw *without* the sendfile option.

Perhaps no. I just understand that sendfile option is so important toimprove performance. At first I thought it will achieve a same performancewith TCP_STREAM, until I saw the article to explain how to use sendfile()to optimize data transfer. So I will try this soon.After some reading of the articles on performance improving, here comeother problems. So I will appreciate you so much if you can clarify themfor me.

1. Results are from PLB_TEMAC, not GSRD. You would likely see similar

throughput rates with GSRD and Linux.

Problem 1: From the website for GSRD, I know that it uses a differentstructure than PLB, where a Multi port mem controller and DMA are added torelease the CPU from move data between Memory and TEMAC. So can GSRDachieve a higher performance than PLB_TEMAC, or similar performance likewhat you said above? If their performance is similar, what's the advantagefor GSRD? Could you please explain some differences between these twostructures?

2. Assuming you have everything tuned for SGDMA based on previous emails,

I would suspect the bottleneck is the 300MHz CPU *when* running Linux. InLinux 2.6 we've not spent any time trying to tune the TCP/Ethernetparameters on the target board or the host, so there could be someoptimizations that can be done at that level. In the exact same system wecan achieve over 800Mbps using the Treck TCP/IP stack, and with VxWorks itwas over 600Mbps.Problem 2. I read XAPP546 of High performance TCP/IP on xilinx FPGA devicesusing the Treck embedded TCP/IP stack. I notice that the features of TreckTCP/IP stack include: Zero-copy send and receive, Jumbo-frame support, CSUMoffload, etc. which could achieve a much higher performance than not usingit. However in the Xilinx TEMAC core V3.00, these features are allsupported: Zero-copy is supported by sendfile() when using Netperf;Jumbo-frame is also supported; CSUM offload and DRE are also supported bythe hardware. So does this mean I can achieve a similarly high performancewith PLB_TEMAC V3.00 and without Treck TCP/IP stack? I mean if all thefeatures of Treck stack have been included in the PLB_TEMAC cores, what'sthe use for Treck stack?Maybe my questions are a little stupid. But I am really confused on them.So thank you so much if you can explain them to me. Thanks a lot.


BR
Ming

_________________________________________________________________

与联机的朋友进行交流，请使用 MSN Messenger: http://messenger.msn.com/cn

_______________________________________________
Linuxppc-embedded mailing list
Linuxppc-embedded@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-embedded

RE: Speed of plb_temac 3.00 on ML403

Reply via email to