[jira] [Comment Edited] (HADOOP-14600) LocatedFileStatus constructor forces RawLocalFS to exec a process to get the permissions
[ https://issues.apache.org/jira/browse/HADOOP-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16273970#comment-16273970 ] Ping Liu edited comment on HADOOP-14600 at 12/1/17 5:47 AM: Just verified. There is no error! I missed {{-Pnative}} in Maven build that is required profile to generate JNI native code. Now after built with {{-Pnative}}, things look good. I tried the patch on IntelliJ in both Windows and Linux and made sure seeing the code flow into the test cases. Also tested command line console. I am attaching the command line test results from both Windows and Linux (see attachments: {{command_line_test_result__linux.txt}}, {{command_line_test_result__windows.txt}}). cc: [~chris.douglas], [~steve_l] was (Author: myapachejira): Just verified. There is no error! I missed {{-Pnative}} in Maven build that is required profile to generate JNI native code. Now things look good. I tried the patch on IntelliJ in both Windows and Linux and made sure seeing the code flow into the test cases. Also tested command line console. I am attaching the command line test results from both Windows and Linux (see attachments: {{command_line_test_result__linux.txt}}, {{command_line_test_result__windows.txt}}). cc: [~chris.douglas], [~steve_l] > LocatedFileStatus constructor forces RawLocalFS to exec a process to get the > permissions > > > Key: HADOOP-14600 > URL: https://issues.apache.org/jira/browse/HADOOP-14600 > Project: Hadoop Common > Issue Type: Bug > Components: fs >Affects Versions: 2.7.3 > Environment: file:// in a dir with many files >Reporter: Steve Loughran >Assignee: Ping Liu > Attachments: HADOOP-14600.001.patch, HADOOP-14600.002.patch, > HADOOP-14600.003.patch, HADOOP-14600.004.patch, HADOOP-14600.005.patch, > HADOOP-14600.006.patch, HADOOP-14600.007.patch, HADOOP-14600.008.patch, > HADOOP-14600.009.patch, TestRawLocalFileSystemContract.java, > command_line_test_result__linux.txt, command_line_test_result__windows.txt > > > Reported in SPARK-21137. a {{FileSystem.listStatus}} call really craws > against the local FS, because {{FileStatus.getPemissions}} call forces > {{DeprecatedRawLocalFileStatus}} tp spawn a process to read the real UGI > values. > That is: for every other FS, what's a field lookup or even a no-op, on the > local FS it's a process exec/spawn, with all the costs. This gets expensive > if you have many files. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Comment Edited] (HADOOP-14600) LocatedFileStatus constructor forces RawLocalFS to exec a process to get the permissions
[ https://issues.apache.org/jira/browse/HADOOP-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16263821#comment-16263821 ] Ping Liu edited comment on HADOOP-14600 at 11/23/17 5:01 AM: - [~chris.douglas] Finally, this round is green. That's great! Do you still need me verify it? If so, I will try to work on it during this weekend. was (Author: myapachejira): [~chris.douglas] Finally, this round is green. That's great! Do you still need me verify it? If so, I need learn how to use "git apply " :) > LocatedFileStatus constructor forces RawLocalFS to exec a process to get the > permissions > > > Key: HADOOP-14600 > URL: https://issues.apache.org/jira/browse/HADOOP-14600 > Project: Hadoop Common > Issue Type: Bug > Components: fs >Affects Versions: 2.7.3 > Environment: file:// in a dir with many files >Reporter: Steve Loughran >Assignee: Ping Liu > Attachments: HADOOP-14600.001.patch, HADOOP-14600.002.patch, > HADOOP-14600.003.patch, HADOOP-14600.004.patch, HADOOP-14600.005.patch, > HADOOP-14600.006.patch, HADOOP-14600.007.patch, HADOOP-14600.008.patch, > HADOOP-14600.009.patch, TestRawLocalFileSystemContract.java > > > Reported in SPARK-21137. a {{FileSystem.listStatus}} call really craws > against the local FS, because {{FileStatus.getPemissions}} call forces > {{DeprecatedRawLocalFileStatus}} tp spawn a process to read the real UGI > values. > That is: for every other FS, what's a field lookup or even a no-op, on the > local FS it's a process exec/spawn, with all the costs. This gets expensive > if you have many files. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Comment Edited] (HADOOP-14600) LocatedFileStatus constructor forces RawLocalFS to exec a process to get the permissions
[ https://issues.apache.org/jira/browse/HADOOP-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236837#comment-16236837 ] Chris Douglas edited comment on HADOOP-14600 at 11/2/17 11:51 PM: -- Just skimmed the patch, but this line jumped out: {noformat} +cleanup: + if (ret == NULL) { +if (path) + (*env)->ReleaseStringChars(env, j_path, (const jchar*) path); {noformat} Shouldn't {{path}} be released if not null, even if {{ret != NULL}} ? Checkstyle output is gone, but it looks like {{Helper.java}} is not indented correctly. Other than that, this looks good. was (Author: chris.douglas): Just skimmed the patch, but this line jumped out: {{noformat}} +cleanup: + if (ret == NULL) { +if (path) + (*env)->ReleaseStringChars(env, j_path, (const jchar*) path); {{noformat}} Shouldn't {{path}} be released if not null, even if {{ret != NULL}} ? Checkstyle output is gone, but it looks like {{Helper.java}} is not indented correctly. Other than that, this looks good. > LocatedFileStatus constructor forces RawLocalFS to exec a process to get the > permissions > > > Key: HADOOP-14600 > URL: https://issues.apache.org/jira/browse/HADOOP-14600 > Project: Hadoop Common > Issue Type: Bug > Components: fs >Affects Versions: 2.7.3 > Environment: file:// in a dir with many files >Reporter: Steve Loughran >Assignee: Ping Liu >Priority: Major > Attachments: HADOOP-14600.001.patch, HADOOP-14600.002.patch, > HADOOP-14600.003.patch, HADOOP-14600.004.patch, HADOOP-14600.005.patch, > TestRawLocalFileSystemContract.java > > > Reported in SPARK-21137. a {{FileSystem.listStatus}} call really craws > against the local FS, because {{FileStatus.getPemissions}} call forces > {{DeprecatedRawLocalFileStatus}} tp spawn a process to read the real UGI > values. > That is: for every other FS, what's a field lookup or even a no-op, on the > local FS it's a process exec/spawn, with all the costs. This gets expensive > if you have many files. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Comment Edited] (HADOOP-14600) LocatedFileStatus constructor forces RawLocalFS to exec a process to get the permissions
[ https://issues.apache.org/jira/browse/HADOOP-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16150811#comment-16150811 ] John Zhuge edited comment on HADOOP-14600 at 9/1/17 4:45 PM: - Please follow the guidelines in https://wiki.apache.org/hadoop/HowToContribute#Naming_your_patch. The patch file name should be: *HADOOP-14600.NNN.patch*, e.g., *HADOOP-14600.001.patch*. was (Author: jzhuge): Please follow the guidelines in https://wiki.apache.org/hadoop/HowToContribute#Naming_your_patch. The patch file name should be: *HADOOP-14600.001.patch*. > LocatedFileStatus constructor forces RawLocalFS to exec a process to get the > permissions > > > Key: HADOOP-14600 > URL: https://issues.apache.org/jira/browse/HADOOP-14600 > Project: Hadoop Common > Issue Type: Bug > Components: fs >Affects Versions: 2.7.3 > Environment: file:// in a dir with many files >Reporter: Steve Loughran >Assignee: Ping Liu > > Reported in SPARK-21137. a {{FileSystem.listStatus}} call really craws > against the local FS, because {{FileStatus.getPemissions}} call forces > {{DeprecatedRawLocalFileStatus}} tp spawn a process to read the real UGI > values. > That is: for every other FS, what's a field lookup or even a no-op, on the > local FS it's a process exec/spawn, with all the costs. This gets expensive > if you have many files. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org