I figured out what was wrong, and the problem was udp-sender.
I found a little bug in the source code of udp-sender, which caused udp-sender not being able to track the time that passed since the first connection.
I attached a diff patch to this e-mail which fixes that issue. If you apply this patch to udpcast and recompile udp-sender, it should wait correctly according to the parameters you pass. It works great now over here.
* Attached: udpcast-waitfix.patch
On a side note, I came across a few other things on my 'quest for flamethrower multicasting':
- the flamethrower_client function in /etc/init.d/functions of the ramdisk does not remove /tmp/multicast.tar after untarring it. In some situations, in particular with small .tar's (i.e.: the flamethrower autoinstall scripts dir) the udp-receiver doesn't recreate the file correctly and the .tar becomes corrupt.
* Adding 'rm -f /tmp/multicast.tar' to that function, after untarring, fixes this.
I've applied your fix for 3.3.1, and for 3.2.3.
- there is a other little bug in the flamethrower_client function in /etc/init.d/functions. The COUNT variable is never initialized and set to 0. This causes that udpcasts are not retried correctly, in stead of the default 7 times or specified number of retries.
* Adding 'COUNT=0' to the top of that function fixes it.
COUNT=0 seems to be already getting set, but I may be looking in the wrong place. Can you send in a patch?
- flamethrowerd has a small bug; when a PORTBASE is set for one of the modules in flamethrower.conf, it keeps on trying to start udp-sender for that module, and overwriting the .pid file in the state dir. This causes that some udp-sender processes are not killed when using the init.d script to restart/stop flamethrower.
* Workaround: don't specify a PORTBASE :)
Thanks, Ramon!
-Brian
Perhaps one of the code maintainers could have a look at these issues and/or apply these fixes?
Over here, we now have multicasting images working, after fixing the issues above.
I hope this information is of any use for anyone.
Kind regards,
Ramon.
-----Original Message----- From: [EMAIL PROTECTED] [mailto:sisuite-users- [EMAIL PROTECTED] On Behalf Of Samuele Tognini Sent: donderdag 27 mei 2004 20:24 To: [EMAIL PROTECTED] Subject: R: [Sisuite-users] _multi_ casting -> udp-sender does not wait?
I had the same problem, and i did not resolve it in a good way. Anyway i got multicast to work on redhat 9 (after some little manual changes in source code and master script ). To get all pc to start all the same moment, i dind't start flamethrowerd service till all clients were waiting for udpcast server ( in fact if the udp client does not connect the udp server then it remains in wait).
Regards, Samuele Tognini
-----Messaggio originale----- Da: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Per conto di Ramon Bastiaans Inviato: gioved? 27 maggio 2004 9.56 A: [EMAIL PROTECTED] Oggetto: [Sisuite-users] _multi_ casting -> udp-sender does not wait?
Hi,
I have been working on getting multicast systemimager image broadcasts to work, but I am having a hard time.
After testing with flamethrowerd, I tried manually casting with the udp sender and receiver, to see if that would work.
Is it just me (am I doing something wrong?) or does udp-sender completely ignore the wait and client parameters?
Udp-sender doesn't wait at all for a minimum number of clients, after the first connection, it immediately starts the multicast to the first client.
On the server:
# udp-sender --pipe 'tar -B -S -cpf - -C /home/test/sara/ramon/testdir . ' --portbase 12345 --min-clients 20 --max-wait 1800 --min-wait 600 --fec 8x8/128 --interface eth0 --max-bitrate 20M --full-duplex --nopointopoint --nokbd stripes=8 redund=8 stripesize=128 Udp-sender 2004-02-22 Using mcast address 234.168.44.1 Compressed UDP sender for (stdin) at 10.168.44.1 on eth0 Broadcasting control to 10.168.44.255 New connection from 10.168.44.5 (#0) 00000019 Starting transfer: 00000019 bytes= 11 929 600 re-xmits=000000 ( 0.0%) slice=1024 73 709 551 615 - 0 Transfer complete. Disconnecting #0 (10.168.44.5)
On the client:
# udp-receiver --interface eth0 --portbase 12345 --nokbd --nosync --file ./bla.tar Udp-receiver 2004-02-22 UDP receiver for ./bla.tar at 10.168.44.5 on eth0 received message, cap=00000019 Connected as #0 to 10.168.44.1 Listening to multicast on 234.168.44.1 bytes= 11 929 600 ( 15.91 Mbps) 11 929 600 Transfer complete. [EMAIL PROTECTED]:~/1.udptest$
Now there is no timestamps, but the udp-sender starts the multicast in about 2 seconds after the connect from the client. But the sender does not have 20 clients connected, and it didn't wait 10 minutes at all (I kind of overdid the values here, but with lower values it doesn't work either).
Am I missing something? I would be happy to hear if anyone got the _multi_ cast to work or has any tips.
Kind regards,
Ramon Bastiaans.
------------------------------------------------------- This SF.Net email is sponsored by: Oracle 10g Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE. http://ads.osdn.com/?ad_id149&alloc_id?66&op=ick _______________________________________________ Sisuite-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/sisuite-users
------------------------------------------------------- This SF.Net email is sponsored by: Oracle 10g Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE. http://ads.osdn.com/?ad_id149&alloc_id?66&op=ick _______________________________________________ Sisuite-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/sisuite-users
--
------------------------------------------------------
Brian Elliott Finley Argonne, MCS Division Mobile: 630.631.6621 Office: 630.252.4742
gpg --keyserver wwwkeys.pgp.net --recv-keys 10F8EE52
------------------------------------------------------
-------------------------------------------------------
This SF.Net email is sponsored by: Oracle 10g
Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE.
http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click
_______________________________________________
Sisuite-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/sisuite-users
