Thanks. Now that I know what to look for, I should be able to figure it
out.
BTW, I switched the script that ultimately runs the mpiexec from tcsh to
bash and the problem went away. Not complaining but do you have any idea
why that might be?

Gene  
--      Eugene M Fluder, Jr, PhD
        Computational Scientist
        Scientific Computing

        The Mt. Sinai School of Medicine
        One Gustave L. Levy Place, Box 1498
        New York, NY 10029-6574

        T: 212 659 8608
        F: 646 537 8660
        E: eugene.flu...@mssm.edu










On 7/6/12 10:57 AM, "Holger Mickler" <holger.mick...@tu-dresden.de> wrote:

>Oh, I just realized that you are probably using the Open MPI version of
>VT which
>builds as part of the Open MPI build. I'm not 100% sure if the
>modification of
>config.h works as laid out, but it should... you need to look out for VT's
>config.h then, not Open MPI's.
>
>Holger
>
>
>On 07/06/2012 04:54 PM, Holger Mickler wrote:
>> Hi Gene,
>> 
>> this error is often caused by insufficiently synchronized TSCs (time
>>stamp
>> counter) of different processors/cores.
>> When VT uses the TSC for timing the events (it does that by default),
>>and the
>> processes switch to another core during execution, it may well happen
>>that the
>> next recorded time stamp is earlier in time than the last one.
>> 
>> One possibility to avoid this situation is pinning the processes to
>>cores - Open
>> MPI has functionality for realizing this, see
>> http://www.open-mpi.org/faq/?category=tuning#using-paffinity
>> 
>> If this is not feasible, you may use another clock source with VT which
>>provides
>> global time. To do this, you need to compile another version of VT. Run
>> configure as usual, then edit config.h: replace the value of
>> #define TIMER [...]
>> with e.g.
>> #define TIMER TIMER_CLOCK_GETTIME
>> or
>> #define TIMER TIMER_GETTIMEOFDAY
>> depending on what is available on your system. Be aware that the
>>resolution of
>> those clocks is not as high as the TSC's.
>> 
>> Have a look inside config.h at the place of the mentioned variables -
>>there is
>> some documentation there.
>> Afterwards, compile and install VT. Using this version, you should not
>>encounter
>> the errors anymore.
>> 
>> Regards,
>>   Holger
>> 
>> 
>> 
>> 
>> On 07/06/2012 04:04 PM, Fluder, Eugene wrote:
>>> I got the following error running a VT enabled run of AMBER. This was
>>>reported
>>> in December of 2009  under almost identical conditions but the thread
>>>does not
>>> contain a resolution. I reran the test with VT_UNIFY=no and it
>>>completed
>>> normally. The same error occurred when I ran vtunify separately. Any
>>>help?
>>>
>>> Was this ever resolved?
>>>
>>> Gene
>>>
>>> [fludee01@node7-10 trace_noiox]$ vtunify 8 a
>>> OTF ERROR in function OTF_WBuffer_setTimeAndProcess, file:
>>>OTF_WBuffer.c, line: 308:
>>>  time not increasing. (t= 99459634, p= 6)
>>> vtunify: Error: Could not read events of OTF stream [namestub
>>>./a__ufy.tmp id 6]
>>> OTF ERROR in function OTF_WBuffer_setTimeAndProcess, file:
>>>OTF_WBuffer.c, line: 308:
>>>  time not increasing. (t= 105413860, p= 5)
>>> vtunify: Error: Could not read events of OTF stream [namestub
>>>./a__ufy.tmp id 5]
>>> OTF ERROR in function OTF_WBuffer_setTimeAndProcess, file:
>>>OTF_WBuffer.c, line: 308:
>>>  time not increasing. (t= 103189146, p= 7)
>>> vtunify: Error: Could not read events of OTF stream [namestub
>>>./a__ufy.tmp id 7]
>>> OTF ERROR in function OTF_WBuffer_setTimeAndProcess, file:
>>>OTF_WBuffer.c, line: 308:
>>>  time not increasing. (t= 100509810, p= 8)
>>> vtunify: Error: Could not read events of OTF stream [namestub
>>>./a__ufy.tmp id 8]
>>> vtunify: An error occurred during unifying events - Terminating ...
>>>
>>> -- /Eugene M Fluder, Jr, PhD/
>>> /Computational Scientist/
>>> /Scientific Computing/
>>> /
>>> /
>>> /The Mt. Sinai School of Medicine/
>>> /One Gustave L. Levy Place, Box 1498/
>>>
>>> /New York, NY 10029-6574/
>>>
>>> /
>>> /
>>>
>>> /T: 212 659 8608/
>>>
>>> /F: 646 537 8660/
>>>
>>> /E: eugene.flu...@mssm.edu/
>>>
>>> /
>>> /
>>>
>>> //
>>>
>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>-- 
>Dipl.-Inf. Holger Mickler
>
>Technische Universität Dresden
>Center for Information Services
>and High Performance Computing (ZIH)
>01062 Dresden
>Germany
>
>Office:  Willers-Bau (WIL) A36
>Tel.:    +49 (351) 463-37903
>Fax:     +49 (351) 463-37773
>E-Mail:  holger.mick...@tu-dresden.de
>
>
>_______________________________________________
>devel mailing list
>de...@open-mpi.org
>http://www.open-mpi.org/mailman/listinfo.cgi/devel


Reply via email to