Re: [casper] Question of chose Correlator Architecture

2014-12-02 Thread David MacMahon
Hi, Peter,

On Dec 2, 2014, at 9:34 AM, Peter Niu wrote:

> Thanks for your reply.Our switch ports are enough,but the Nics on roach is 
> not enough(we only have 4 ports each roach),so  if we send packets to x box 
> though switch,we only need 4 10Gbe ports on each roach.that is why we want to 
> use the pre-build paper model.

Yes, to do "F -> SWITCH -> X" you only need four 10 GbE ports on the ROACH2.  
In theory this could be done with 1 SFP+ card in the ROACH2, but if you want to 
use the pre-built model available on the internet to do this, you will still 
need 2 SFP+ cards in each ROACH2 due to how the interfaces are allocated.  If 
you modify the PAPER model, you can reassign the four "eth_N_sw" cores to be on 
1 SFP+ card.  The "eth_N_gpu" cores would be unused and could be deleted to 
save resources.

> if we want to use the pre-build model,the initial ruby scripts may should be 
> modified.the eth_n_gpu codes should be delete,the arp codes of eth_n_sw 
> should be changed.except the ethernet part of ruby codes should be 
> modified,need the other 
> parts codes like the PFB ,EQ parts be changed?

Yes, the network config stuff will have to change somewhat, but the rest of the 
configuration will remain the same.  Once you have settled on a network 
configuration, it should be fairly straightforward to change the 
paper_feng_init.rb script accordingly.

> could the pre-build model and scripts find on the websites?

All my pre-built PAPER F engine models and scripts are on the internet and I 
think you already have them.

Dave




Re: [casper] ROACH serial connection issues

2014-12-02 Thread Norbert Bonnici
Hi David,

The problem is that I don't see the counter anymore and any command I
give the roach such as run bit, run sailboat etc are ignored or at
least no feedback is receive after the "Hit any key" message now.

U boot is running using bootstrap option C. Should it be H? The dip
switches are currently all in the off position. I have also tried
different combinations of the dip switches such as bootstrap option B
(slow mode)

I will try to use a different USB to serial adapter maybe mine is
faulty or something weird is happening.

Kind regards,
Norbert

On 2 December 2014 at 18:07, David MacMahon  wrote:
> Hi, Norbert,
>
> If you hit any key to stop autoboot when it says "Hit any key to stop 
> autoboot", does it in fact stop the autoboot?  If so, you could use u-boot's 
> "printenv" command to see what commands get run as part of autoboot and then 
> try to run them "by hand" to try to figure out where things go bad.
>
> HTH,
> Dave
>
> On Dec 2, 2014, at 12:16 AM, Norbert Bonnici wrote:
>
>> Hi Marc,
>>
>> The USB dongle's baud rate should have been set properly. When set to
>> different baud rates no readable data is received through the serial
>> port. Added line wrapping but it didn't change anything.
>>
>> In addition, recently the communications are being disabled when the
>> GND wire is connected to the USB dongle. Data is only being received
>> when only the tx and rx wires are connected.
>>
>> Regards,
>> Norbert
>>
>> On 2 December 2014 at 08:39, Marc Welz  wrote:
>>>
>>>
>>> On Mon, Dec 1, 2014 at 2:47 PM, Norbert Bonnici
>>>  wrote:

 Dear Marc,

 I've have tried all the possible CR+LF combinations.
>>>
>>>

 Any ideas?
>>>
>>>
>>> Then I am not sure - I know that some USB dongles attempt to autodetect the
>>> serial
>>> speed - maybe something is going wrong there ? Also, maybe enable line
>>> wrapping (Control-A W) might help.
>>>
>>> BTW:  CC'ing the mailing list is good form - it helps others who might have
>>> the same problem, and you might also get suggestions from other people
>>>
>>> regards
>>>
>>> marc
>>>
>>>
>>
>
>



Re: [casper] Question of chose Correlator Architecture

2014-12-02 Thread Peter Niu
Hi,Dave, Thanks for your reply.Our switch ports are enough,but the Nics on 
roach is not enough(we only have 4 ports each roach),so if we send packets to x 
box though switch,we only need 4 10Gbe ports on each roach.that is why we want 
to use the pre-build paper model. if we want to use the pre-build model,the 
initial ruby scripts may should be modified.the eth_n_gpu codes should be 
delete,the arp codes of eth_n_sw should be changed.except the ethernet part of 
ruby codes should be modified,need the other parts codes like the PFB ,EQ parts 
be changed?could the pre-build model and scripts find on the websites? Best 
Regards! Peter -- 发自 Android 网易邮箱 On 2014-12-03 01:01 , David MacMahon Wrote: 
Hi, Peter, If you have enough ports on the switch then you certainly can 
configure things to send the packets directly from the ROACHs to the various X 
boxes.  The pre-built PAPER model will support this if properly configured.  
The one thing to keep in mind is the IP-to-MAC address table in the 10 GbE 
cores.  These are setup by paper_feng_init.rb. You could use the "factory" MAC 
addresses of the X-box 10 GbE interfaces or you could configure the X boxes to 
set their 10 GbE MAC addresses to predefined values.  If using the latter 
approach, we often use 02:02:ww:xx:yy:zz where ww:xx:yy:zz corresponds to the 
IP address. Hope this helps, Dave On Dec 1, 2014, at 5:21 AM, Peter Niu wrote: 
> Hi,Dave, > Thanks for your Document about EQ,and suggestion about the sample 
rate.Now I have a question about the correlator architecture. > I have saw your 
PPT : > Correlator Architectures > Present and Future > CASPER Workshop 2011 > 
> The structure mentioned in the PPT is the structure PAPER used now .Using a 
set of precise IP assignment to avoid Loop Back is ok. However,If we use the 
Packetized F/X Concept:Uses two ports on switch per F/X pair.It may not meet 
the Loop Back problem.On the switch, The IP address will tell the packet which 
Xeng to go .The structure which The PAPER model using now is the eth_?_gpu port 
on ROACH connect HPC port directly.Is this only for saving ports on 
switch?Well,Our switch have 64 ports,If we use the  two ports on switch per F/X 
pair Concept,the ports may be sufficient . > This is the question asked by my 
teacher Wu fengquan. As PAPER provide a lot of ruby control scripts online to 
use,I'd rather use this model exits now.What should I say to him?Is there some 
more advantages to use this structure instead the two ports on switch per F/X 
pair? > Thanks for your help! > Best wishes! > Peter > > > >

Re: [casper] NFS setup: TFTP permissions problem

2014-12-02 Thread David MacMahon
Hi, Michael,

In addition to the other suggestions, you should check whether you are running 
dnsmasq in "tftp-secure" more.  That might impose ownership and/or permission 
restrictions.  See "man dnsmasq" for more details.

Dave

On Dec 2, 2014, at 6:07 AM, Michael D'Cruze wrote:

> Hi everyone
> 
> I'm following the NFS setup guide, and have come across a problem with the 
> /srv/roach_boot/boot directory permissions. I restart the dnsmasq service and 
> receive the following error:
> 
> Starting dnsmasq: 
> dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission denied
>[FAILED]
> 
> The output of ls -l from /srv/roach_boot is
> 
> [root@roach-workstation roach_boot]# ls -l
> total 8
> drwxrwxrwx.  2 root root 4096 Dec  1 16:31 boot
> drwxrwxrwx. 23 root root 4096 Feb  2  2009 etch
> 
> and from within /boot is
> 
> [root@roach-workstation boot]# ls -l
> total 1360
> -rwxrwxrwx. 1 michael michael 1390149 Dec  1 15:35 uImage-20110812-mmcomitfix
> 
> The output of ls --context from within /boot is
> 
> [root@roach-workstation boot]# ls --context
> -rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 
> uImage-20110812-mmcomitfix
> 
> All of these permissions and contexts look correct according to the 
> guideso I'm at a bit of a loss. Has anyone seen this problem before, 
> given all of the above conditions?
> 
> Does the /boot directory have to have the same context as the uImage file 
> within it?
> 
> Suggestions or guidance greatly appreciated.
> 
> Michael




Re: [casper] ROACH serial connection issues

2014-12-02 Thread David MacMahon
Hi, Norbert,

If you hit any key to stop autoboot when it says "Hit any key to stop 
autoboot", does it in fact stop the autoboot?  If so, you could use u-boot's 
"printenv" command to see what commands get run as part of autoboot and then 
try to run them "by hand" to try to figure out where things go bad.

HTH,
Dave

On Dec 2, 2014, at 12:16 AM, Norbert Bonnici wrote:

> Hi Marc,
> 
> The USB dongle's baud rate should have been set properly. When set to
> different baud rates no readable data is received through the serial
> port. Added line wrapping but it didn't change anything.
> 
> In addition, recently the communications are being disabled when the
> GND wire is connected to the USB dongle. Data is only being received
> when only the tx and rx wires are connected.
> 
> Regards,
> Norbert
> 
> On 2 December 2014 at 08:39, Marc Welz  wrote:
>> 
>> 
>> On Mon, Dec 1, 2014 at 2:47 PM, Norbert Bonnici
>>  wrote:
>>> 
>>> Dear Marc,
>>> 
>>> I've have tried all the possible CR+LF combinations.
>> 
>> 
>>> 
>>> Any ideas?
>> 
>> 
>> Then I am not sure - I know that some USB dongles attempt to autodetect the
>> serial
>> speed - maybe something is going wrong there ? Also, maybe enable line
>> wrapping (Control-A W) might help.
>> 
>> BTW:  CC'ing the mailing list is good form - it helps others who might have
>> the same problem, and you might also get suggestions from other people
>> 
>> regards
>> 
>> marc
>> 
>> 
> 




Re: [casper] Question of chose Correlator Architecture

2014-12-02 Thread David MacMahon
Hi, Peter,

If you have enough ports on the switch then you certainly can configure things 
to send the packets directly from the ROACHs to the various X boxes.  The 
pre-built PAPER model will support this if properly configured.  The one thing 
to keep in mind is the IP-to-MAC address table in the 10 GbE cores.  These are 
setup by paper_feng_init.rb.

You could use the "factory" MAC addresses of the X-box 10 GbE interfaces or you 
could configure the X boxes to set their 10 GbE MAC addresses to predefined 
values.  If using the latter approach, we often use 02:02:ww:xx:yy:zz where 
ww:xx:yy:zz corresponds to the IP address.

Hope this helps,
Dave

On Dec 1, 2014, at 5:21 AM, Peter Niu wrote:

> Hi,Dave,
> Thanks for your Document about EQ,and suggestion about the sample rate.Now I 
> have a question about the correlator architecture.
> I have saw your PPT : 
> Correlator Architectures
> Present and Future
> CASPER Workshop 2011
> 
> The structure mentioned in the PPT is the structure PAPER used now .Using a 
> set of precise IP assignment to avoid Loop Back is ok. However,If we use the 
> Packetized F/X Concept:Uses two ports on switch per F/X pair.It may not meet 
> the Loop Back problem.On the switch, The IP address will tell the packet 
> which Xeng to go .The structure which The PAPER model using now is the 
> eth_?_gpu port on ROACH connect HPC port directly.Is this only for saving 
> ports on switch?Well,Our switch have 64 ports,If we use the  two ports on 
> switch per F/X pair Concept,the ports may be sufficient .
> This is the question asked by my teacher Wu fengquan. As PAPER provide a lot 
> of ruby control scripts online to use,I'd rather use this model exits 
> now.What should I say to him?Is there some more advantages to use this 
> structure instead the two ports on switch per F/X pair?
> Thanks for your help!
> Best wishes!
> Peter
> 
> 
> 
> 




Re: [casper] NFS setup: TFTP permissions problem

2014-12-02 Thread Marc Welz
On Tue, Dec 2, 2014 at 2:07 PM, Michael D'Cruze <
michael.dcr...@postgrad.manchester.ac.uk> wrote:

>
> Starting dnsmasq:
> dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission
> denied
>[FAILED]
>

In addition to whatever SElinux has gotten in the way, there is also the
fact that recent kernels have changed the behaviour of symbolic links of
different uids - I think somewhere in /proc there is an option to deal with
that - unclear if you will encounter that, but something to remember

regards

marc


Re: [casper] NFS setup: TFTP permissions problem

2014-12-02 Thread Michael D'Cruze
Hi Jack,


I'm running RHEL 6.6.


THanks

Michael


From: Jack Hickish 
Sent: Tuesday, December 2, 2014 2:15 PM
To: Michael D'Cruze; casper@lists.berkeley.edu
Subject: Re: [casper] NFS setup: TFTP permissions problem

Hi Michael,

Do you have SELinux running? I've just checked and I get a similar permissions 
error if I reactivate SELinux on my Centos 6 server.

On Tue Dec 02 2014 at 14:07:45 Michael D'Cruze 
mailto:michael.dcr...@postgrad.manchester.ac.uk>>
 wrote:

Hi everyone


I'm following the NFS setup guide, and have come across a problem with the 
/srv/roach_boot/boot directory permissions. I restart the dnsmasq service and 
receive the following error:


Starting dnsmasq:
dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission denied
   [FAILED]


The output of ls -l from /srv/roach_boot is


[root@roach-workstation roach_boot]# ls -l
total 8
drwxrwxrwx.  2 root root 4096 Dec  1 16:31 boot
drwxrwxrwx. 23 root root 4096 Feb  2  2009 etch


and from within /boot is


[root@roach-workstation boot]# ls -l
total 1360
-rwxrwxrwx. 1 michael michael 1390149 Dec  1 15:35 uImage-20110812-mmcomitfix


The output of ls --context from within /boot is


[root@roach-workstation boot]# ls --context
-rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 
uImage-20110812-mmcomitfix


All of these permissions and contexts look correct according to the guideso 
I'm at a bit of a loss. Has anyone seen this problem before, given all of the 
above conditions?


Does the /boot directory have to have the same context as the uImage file 
within it?


Suggestions or guidance greatly appreciated.


Michael


Re: [casper] NFS setup: TFTP permissions problem

2014-12-02 Thread Jack Hickish
Hi Michael,

Do you have SELinux running? I've just checked and I get a similar
permissions error if I reactivate SELinux on my Centos 6 server.

On Tue Dec 02 2014 at 14:07:45 Michael D'Cruze <
michael.dcr...@postgrad.manchester.ac.uk> wrote:

>  Hi everyone
>
>
>  I'm following the NFS setup guide, and have come across a problem with
> the /srv/roach_boot/boot directory permissions. I restart the dnsmasq
> service and receive the following error:
>
>
>  Starting dnsmasq:
> dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission
> denied
>[FAILED]
>
>
>  The output of ls -l from /srv/roach_boot is
>
>
>  [root@roach-workstation roach_boot]# ls -l
> total 8
> drwxrwxrwx.  2 root root 4096 Dec  1 16:31 boot
> drwxrwxrwx. 23 root root 4096 Feb  2  2009 etch
>
>
>  and from within /boot is
>
>
>  [root@roach-workstation boot]# ls -l
> total 1360
> -rwxrwxrwx. 1 michael michael 1390149 Dec  1 15:35
> uImage-20110812-mmcomitfix
>
>
>  The output of ls --context from within /boot is
>
>
>  [root@roach-workstation boot]# ls --context
> -rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0
> uImage-20110812-mmcomitfix
>
>
>  All of these permissions and contexts look correct according to the
> guideso I'm at a bit of a loss. Has anyone seen this problem before,
> given all of the above conditions?
>
>
>  Does the /boot directory have to have the same context as the uImage
> file within it?
>
>
>  Suggestions or guidance greatly appreciated.
>
>
>  Michael
>


[casper] NFS setup: TFTP permissions problem

2014-12-02 Thread Michael D'Cruze
Hi everyone


I'm following the NFS setup guide, and have come across a problem with the 
/srv/roach_boot/boot directory permissions. I restart the dnsmasq service and 
receive the following error:


Starting dnsmasq:
dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission denied
   [FAILED]


The output of ls -l from /srv/roach_boot is


[root@roach-workstation roach_boot]# ls -l
total 8
drwxrwxrwx.  2 root root 4096 Dec  1 16:31 boot
drwxrwxrwx. 23 root root 4096 Feb  2  2009 etch


and from within /boot is


[root@roach-workstation boot]# ls -l
total 1360
-rwxrwxrwx. 1 michael michael 1390149 Dec  1 15:35 uImage-20110812-mmcomitfix


The output of ls --context from within /boot is


[root@roach-workstation boot]# ls --context
-rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 
uImage-20110812-mmcomitfix


All of these permissions and contexts look correct according to the guideso 
I'm at a bit of a loss. Has anyone seen this problem before, given all of the 
above conditions?


Does the /boot directory have to have the same context as the uImage file 
within it?


Suggestions or guidance greatly appreciated.


Michael


Re: [casper] ROACH serial connection issues

2014-12-02 Thread Norbert Bonnici
Hi Marc,

The USB dongle's baud rate should have been set properly. When set to
different baud rates no readable data is received through the serial
port. Added line wrapping but it didn't change anything.

In addition, recently the communications are being disabled when the
GND wire is connected to the USB dongle. Data is only being received
when only the tx and rx wires are connected.

Regards,
Norbert

On 2 December 2014 at 08:39, Marc Welz  wrote:
>
>
> On Mon, Dec 1, 2014 at 2:47 PM, Norbert Bonnici
>  wrote:
>>
>> Dear Marc,
>>
>> I've have tried all the possible CR+LF combinations.
>
>
>>
>> Any ideas?
>
>
> Then I am not sure - I know that some USB dongles attempt to autodetect the
> serial
> speed - maybe something is going wrong there ? Also, maybe enable line
> wrapping (Control-A W) might help.
>
> BTW:  CC'ing the mailing list is good form - it helps others who might have
> the same problem, and you might also get suggestions from other people
>
> regards
>
> marc
>
>