Yevgeny -

Could Mellanox update the FAQ item about this?

Large-memory nodes are becoming more common.


On Nov 3, 2012, at 6:33 PM, Yevgeny Kliteynik wrote:

> Hi Paul,
> 
> On 10/31/2012 10:22 PM, Paul Kapinos wrote:
>> Hello Yevgeny, hello all,
>> 
>> Yevgeny, first of all thanks for explaining what the MTT parameters do and 
>> why there are two of them! I mean this post:
>> http://www.open-mpi.org/community/lists/devel/2012/08/11417.php
>> 
>> Well, the official recommendation is "twice the RAM amount".
>> 
>> And here we are: we have 2 nodes with 2 TB (that with a 'tera') RAM and a 
>> couple of nodes with 1TB, each with 4x Mellanox IB adapters. Thus we should 
>> have raised the MTT parameters in order to make up to 4 TB memory 
>> registrable.
> 
> You don't really *have* to be able to register twice the available RAM.
> It's just heuristics. It depends on the application that you're running
> and fragmentation that it creates in the MTT.
> 
> However:
> 
>> I've tried to raise the MTT parameters in multiple combinations, but the 
>> maximum amount of registrable memory I was able to get was one TB (23 / 5). 
>> All tries to get more (24/5, 23/6 for 2 TB) lead to not responding 
>> InfiniBand HCAs.
>> 
>> Is there any another limits in the kernel have to be adjusted in order to be 
>> able to register that a bunch of memory?
> 
> Unfortunately, current driver has a limitation in this area so 1TB 
> (23/5 values) is probably the top what the driver can do.
> IIRC, log_num_mtt can reach 26, so perhaps you can try 26/2 (same 1TB),
> and then, if it works, try 26/3 (fingers crossed), which will bring you
> to 2 TB, but I'm not sure it will work.
> 
> This has already been fixed, and the fix was accepted to the upstream
> Linux kernel, so it will be included in the next OFED/MLNX_OFED versions.
> 
> -- YK
> 
> 
>> Best,
>> 
>> Paul Kapinos
>> 
>> 
>> 
>> 
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to