Am 12.07.2014 19:16, schrieb Toralf Förster:
> On 05/02/2014 04:14 PM, Richard Weinberger wrote:
>> Am 02.05.2014 16:07, schrieb Toralf Förster:
>>> On 05/02/2014 09:46 AM, Richard Weinberger wrote:
>>>> Am 01.05.2014 23:34, schrieb Toralf Förster:
>>>>> On 05/01/2014 10:57 PM, Richard Weinberger wrote:
>>>>>> Toralf,
>>>>>>
>>>>>> Yeah, this is because trinity destroys the UML stub code.
>>>>>> Please test the attached patch, it should fix the root cause of the 
>>>>>> problem.
>>>>>>
>>>>>> Thanks,
>>>>>> //richard
>>>>>>
>>>>>
>>>>> If I do just apply fix2.patch onto latest git tree  
>>>>> v3.15-rc3-113-gba6728f then I do get after a while :
>>>>>
>>>>>  * Starting sshd ...                                                      
>>>>>                                             [ ok ]
>>>>>  * Starting local
>>>>> net.core.warnings = 0                                                     
>>>>>                                             [ ok ]
>>>>> Kernel panic - not syncing: do_syscall_stub : PTRACE_SETREGS failed, 
>>>>> errno = 3
>>>>>
>>>>> CPU: 0 PID: 1728 Comm: trinity-c0 Not tainted 
>>>>> 3.15.0-rc3-00113-gba6728f-dirty #5
>>>>> Stack:
>>>>> BUG: soft lockup - CPU#0 stuck for 22s! [trinity-c0:1728]
>>>>>
>>>>> EIP: c500:[<47c6cf00>] CPU: 0 Not tainted EFLAGS: 476af700
>>>>>     Not tainted
>>>>> EAX: 47cfc500 EBX: 0a024d00 ECX: 086c75fc EDX: 080fff88
>>>>> ESI: 0839f4bc EDI: 47cfc500 EBP: 0839f4bc DS: c500 ES: cd62
>>>>> EXT4-fs (ubda): error count: 1
>>>>> EXT4-fs (ubda): initial error at 1398962134: ext4_mb_generate_buddy:756
>>>>> EXT4-fs (ubda): last error at 1398962134: ext4_mb_generate_buddy:756
>>>>>
>>>>>
>>>>> which is a big improvement because before it crashes immediately after 
>>>>> few seconds.
>>>>>
>>>>> After applying both fixes the test case runs w/o a crash till now.
>>>>
>>>> Can you please also try fix3 (without fix1/2)?
>>>> I think I've found the other hidden issue.
>>>> So far trinity did not crash my kernel...
>>>>
>>>> Thanks,
>>>> //richard
>>>>
>>>
>>> fix3 made it - till now it runs fine.
>>> Of course the syslog of the UML guest is flooded with messages like :
>>>
>>> May  2 15:45:59 trinity kernel: BUG: Bad rss-counter state mm:47d4d8c0 
>>> idx:0 val:2
>>> May  2 15:46:00 trinity kernel: fix_range_common: failed, killing current 
>>> process: 2983
>>> May  2 15:46:00 trinity kernel: fix_range_common: failed, killing current 
>>> process: 2984
>>> May  2 15:46:30 trinity kernel: fix_range_common: failed, killing current 
>>> process: 2986
>>> May  2 15:46:30 trinity kernel: fix_range_common: failed, killing current 
>>> process: 2989
>>> May  2 15:46:30 trinity kernel: fix_range_common: failed, killing current 
>>> process: 2991
>>> May  2 15:46:32 trinity kernel: Stub registers -
>>> May  2 15:46:32 trinity kernel:         0 - 100000
>>> May  2 15:46:32 trinity kernel:         1 - 1000
>>> May  2 15:46:32 trinity kernel:         2 - 7
>>> May  2 15:46:32 trinity kernel:         3 - 11
>>> May  2 15:46:32 trinity kernel:         4 - 3
>>> May  2 15:46:32 trinity kernel:         5 - 3cbae
>>> May  2 15:46:32 trinity kernel:         6 - 100000
>>> May  2 15:46:32 trinity kernel:         7 - 7b
>>> May  2 15:46:32 trinity kernel:         8 - 7b
>>> May  2 15:46:32 trinity kernel:         9 - 0
>>> May  2 15:46:32 trinity kernel:         10 - 33
>>> May  2 15:46:32 trinity kernel:         11 - ffffffff
>>> May  2 15:46:32 trinity kernel:         12 - 100fff
>>> May  2 15:46:32 trinity kernel:         13 - 73
>>> May  2 15:46:32 trinity kernel:         14 - 10206
>>> May  2 15:46:32 trinity kernel:         15 - 101028
>>> May  2 15:46:32 trinity kernel:         16 - 7b
>>> May  2 15:46:32 trinity kernel: wait_stub_done : failed to wait for 
>>> SIGTRAP, pid = 483, n = 483, errno = 0, status = 0xb7f
>>> May  2 15:46:32 trinity kernel: BUG: Bad rss-counter state mm:47d4d8c0 
>>> idx:0 val:1
>>> May  2 15:46:32 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3000
>>> May  2 15:46:33 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3002
>>> May  2 15:46:33 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3004
>>> May  2 15:46:33 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3006
>>> May  2 15:46:34 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3009
>>> May  2 15:46:34 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3010
>>> May  2 15:46:34 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3012
>>> May  2 15:46:35 trinity kernel: BUG: Bad rss-counter state mm:47d4d8c0 
>>> idx:0 val:2
>>> May  2 15:46:35 trinity kernel: fix_range_common: failed, killing current 
>>> process: 3015
>>>
>>>
>>>
>>> which is expected (right ?) b/c I hammered the UML with the syscall 
>>> "mremap" by 2 trinity childs for a while.
>>
>> Yeah. Maybe I find a way to prevent "BUG: Bad rss-counter state mm:47d4d8c0 
>> idx:0 val:1" too.
>>
>> Thanks for testing!
>> //richard
>>
> /me wonders, if fix3 will make it in mainline ?

It will. :)

Thanks for the kind reminder...
//richard

------------------------------------------------------------------------------
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

Reply via email to