Hi,

On Wed, May 14, 2008 at 12:17:04PM -0400, Audet, Jean-Michel wrote:
> Yes, it's an application that we made to sync up data between
> to application.  I do use the heartbeat API to sync up data.  I
> have approximately 1 MB of data to sync up and I split the data
> into +- 30 packets of 32K.

Why don't you go with smaller packets? Larger packets might be
better if you have jumbo frames enabled, but that doesn't exist
for 100Mbps networks.

> See below the statistics, I don't see any errors or packet drops.
> 
> Here is one of your statements:
> So, it may be that your application is pushing the communication layer's 
> limits. Or the network limits. Hard to say without looking at the network.
> 
> What do you mean about "communication layer's limits"?  Network limits?
> Is sending 1Mbytes of data is big?  This would be very surprising.

It's not, of course. I was just thinking about sending such big
packets back-to-back given that the MTU is much smaller.

> Architecture is very simple.  2 nodes connected to a switch (ZNYX ZX5000). 
> 
> 
> eth3      Link encap:Ethernet  HWaddr 00:A0:A5:5E:08:F8  
>           inet addr:192.168.0.149  Bcast:192.168.0.255  Mask:255.255.255.0
>           inet6 addr: fe80::2a0:a5ff:fe5e:8f8/64 Scope:Link
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           RX packets:184419 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:614979 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:1000 
>           RX bytes:43421083 (41.4 MiB)  TX bytes:767939664 (732.3 MiB)
>           Base address:0x5c00 Memory:f61e0000-f6200000 
> 
> eth3:0    Link encap:Ethernet  HWaddr 00:A0:A5:5E:08:F8  
>           inet addr:192.168.0.100  Bcast:192.168.0.255  Mask:255.255.255.0
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           Base address:0x5c00 Memory:f61e0000-f6200000 
> 
> lo        Link encap:Local Loopback  
>           inet addr:127.0.0.1  Mask:255.0.0.0
>           inet6 addr: ::1/128 Scope:Host
>           UP LOOPBACK RUNNING  MTU:16436  Metric:1
>           RX packets:26 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:26 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0 
>           RX bytes:1820 (1.7 KiB)  TX bytes:1820 (1.7 KiB)
> 
> 
> Anymore details will be greatly appreciated!

The only thing I can think of is to enable debugging and open a
bugzilla. Please don't forget to include all relevant
information.

Thanks,

Dejan

> Jean-Michel Audet
> 
> Concepteur Logiciel
> Software Designer
> Kontron Canada 
> Tel: (450) 979-5320 
> 1 800 354-4223 
> Fax: (450) 437-8053 
> <mailto:[EMAIL PROTECTED]> 
> Sales:<http://www.kontron.com/> 
> Support:<ftp://ftp.kontron.ca/Support/> 
> 
> L'information contenue dans le pr?sent document est la propri?t? de Kontron 
> Canada Inc. et est divulgu?e en toute confidentialit?. Cette information ne 
> doit pas ?tre r?vis?e, utilis?e, divulgu?e ou distribu?e sans le consentement 
> ?crit explicite de Kontron Canada Inc. Si vous n'?tes pas le destinataire 
> pr?vu et avez re?u cette communication par erreur, veuillez contacter 
> l'originateur et supprimer toute copie.
> 
> The information contained in this document is confidential and property of 
> Kontron Canada Inc. Any unauthorized review, use, disclosure or distribution 
> is prohibited without express written consent of Kontron Canada Inc. If you 
> are not the intended recipient, please contact the sender and destroy all 
> copies of the original message and enclosed attachments.
> 
> 
> -----Message d'origine-----
> De?: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] De la part de Dejan 
> Muhamedagic
> Envoy??: Wednesday, May 14, 2008 11:16 AM
> ??: General Linux-HA mailing list
> Objet?: Re: [Linux-HA] Lots of Retransmitting pkt
> 
> Hi,
> 
> On Wed, May 14, 2008 at 08:58:13AM -0400, Audet, Jean-Michel wrote:
> > Hi all, 
> > 
> >  
> > 
> > Hi, I have an implementation using heartbeat 2.1.2.  It is based on
> > Master/Slave architecture.  I have only two nodes on my implementation.
> > 
> > 
> >  
> > 
> > I have a lot of "Retransmitting pkt".  At the end, all the packets are
> > received, but I think it is very slow.  My packets are all around
> > 32Kbytes in size but on a 100MBits network not a lot of traffic, it
> > should not be any problems.  I am using the function sendnodemsg to
> > directly send message to my slave node.
> 
> You have a home-made application using the heartbeat API?
> 
> > Here is a trace of one of the message.  When sending for example 40
> > packets, 10 to 20 will be a least retransmit once.  
> > 
> > May 13 14:49:43 node1 heartbeat: [29379]: info: Retransmitting pkt 92
> > 
> > May 13 14:49:43 node1 heartbeat: [29379]: info: msg size =32581,
> > type=msg_type_xyz
> > 
> >  
> > 
> > Any idea on what can be the problem?  Need more informations?
> 
> Heartbeat uses UDP (unicast, broadcast, or multicast). Is you
> network healthy? Did you check the interfaces statistics? Did you
> try watching the traffic using the tcpdump (or similar) tool? Did
> you try unicast if you're using broadcast?
> 
> I believe that often Heartbeat packets are close to or exceeding
> 32k with more complex configurations or more than 3-4 nodes, but
> very probably not very often, i.e. packets are big only when full
> CIB is being sent. I don't think that there were many complaints
> of similar nature. So, it may be that your application is pushing
> the communication layer's limits. Or the network limits. Hard to
> say without looking at the network.
> 
> Thanks,
> 
> Dejan
> 
> 
> > Jean-Michel Audet
> > Software, Kontron Canada
> > 
> >  
> > 
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to