[
https://issues.apache.org/jira/browse/MESOS-9109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16695132#comment-16695132
]
Chun-Hung Hsiao commented on MESOS-9109:
----------------------------------------
Unassigned myself for now since I'm going oncall. [~bbannier] It would be great
if you can pick this up. Otherwise I'll pick it up after my oncall rotation.
My proposed approach is in the following link:
https://lists.apache.org/thread.html/49961f30445eac7d61ef6cf1a384646760dad3754d3c332d9a9ae18a@%3Cuser.mesos.apache.org%3E
> Windows agent uses reserved character :(colon) for file name and crashes when
> attempting to remove link
> -------------------------------------------------------------------------------------------------------
>
> Key: MESOS-9109
> URL: https://issues.apache.org/jira/browse/MESOS-9109
> Project: Mesos
> Issue Type: Bug
> Components: agent
> Affects Versions: 1.7.0
> Reporter: Constantin Eduard Staniloiu
> Assignee: Chun-Hung Hsiao
> Priority: Blocker
> Labels: windows
>
> I have a hybrid cluster running Mesos Agents on Windows, and I am using
> Chronos to launch jobs on Windows Agents.
> Chronos is using the character : (colon) internally when spawning jobs. The
> Windows Mesos Agent spawns those jobs and creates the paths on disk, but when
> the job terminates and it attempts to remove the link it crashes with the
> following error message
>
> {code:java}
> I0719 09:20:00.621385 14788 gc.cpp:129] Unscheduling
> 'D:\ws\mes-wd\meta\slaves\5563b512-518e-44c6-bdc1-3c927d0622da-S1\frameworks\77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000\executors\ct:1532006400000:0
> :sample-child-job-lv2:' from gc
> I0719 09:20:00.622387 24124 slave.cpp:2406] Authorizing task
> 'ct:1532006400000:0:sample-child-job2:' for framework
> 77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000
> I0719 09:20:00.630340 24124 slave.cpp:2406] Authorizing task
> 'ct:1532006400000:0:sample-child-job-lv2:' for framework
> 77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000
> I0719 09:20:00.644341 24124 slave.cpp:2849] Launching task
> 'ct:1532006400000:0:sample-child-job2:' for framework
> 77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000
> I0719 09:20:00.649345 24124 paths.cpp:748] Creating sandbox
> 'D:\ws\mes-wd\slaves\5563b512-518e-44c6-bdc1-3c927d0622da-S1\frameworks\77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000\executors\ct:1532006400000
> :0:sample-child-job2:\runs\cecbf7ab-ace3-4f45-a208-9c104f69624c'
> F0719 09:20:00.653342 24124 paths.cpp:763] CHECK_SOME(os::rm(latest)): The
> filename, directory name, or volume label syntax is incorrect.
> Failed to remove latest symlink
> 'D:\ws\mes-wd\slaves\5563b512-518e-44c6-bdc1-3c927d0622da-S1\frameworks\77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000\executors\ct:1532006400000:0:sample-child-job2:\runs\
> latest'
> *** Check failure stack trace: ***
> {code}
>
> The problem seems to be the job name:
> {code:java}
> 'ct:1532006400000:0:sample-child-job2:'
> {code}
> Chronos is using internally : (colon) which is a reserved character on
> Windows
> [https://docs.microsoft.com/en-us/windows/desktop/FileIO/naming-a-file|http://example.com/]
>
> I believe it's the responsibility of the agent to check and sanitize the task
> names against restricted characters.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)