cxzl25 commented on PR #2077:
URL: 
https://github.com/apache/incubator-celeborn/pull/2077#issuecomment-1812315913

   > Bump Hadoop client version from 3.2.4 to 3.3.6 will make Celeborn can not 
access HDFS 2.x cluster. I think it's ok to bump hadoop version but you'll need 
to test the cluster compatibility.
   
   Test file operations such as mkdir, ls, put, get, etc.
   
   
   | Hadoop Server | Hadoop client 2.x | Hadoop client 3.2.4 | Hadoop client 
3.3.6 |
   | ------------- | ----------------- | ------------------- | 
------------------- |
   | 2.2.0         | YES               | Write failed        | Write failed     
   |
   | 2.3.0         | YES               | YES                 | YES              
   |
   | 2.6.0         | YES               | YES                 | YES              
   |
   | 2.10.0        | YES               | YES                 | YES              
   |
   
   
   
   ### Write failed
   ```java
   2023-11-15 17:41:58,502 WARN hdfs.DataStreamer: DataStreamer Exception
   java.lang.NullPointerException
        at 
org.apache.hadoop.hdfs.DataStreamer.createBlockOutputStream(DataStreamer.java:1805)
        at 
org.apache.hadoop.hdfs.DataStreamer.nextBlockOutputStream(DataStreamer.java:1728)
        at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:713)
   put: java.lang.NullPointerException
   ```
   
   `nodeStorageIDs`==null  NPE
   org.apache.hadoop.hdfs.DataStreamer#createBlockOutputStream
   ```java
           new Sender(out).writeBlock(blockCopy, nodeStorageTypes[0], 
accessToken,
               dfsClient.clientName, nodes, nodeStorageTypes, null, bcs,
               nodes.length, block.getNumBytes(), bytesSent, newGS,
               checksum4WriteBlock, cachingStrategy.get(), isLazyPersistFile,
               (targetPinnings != null && targetPinnings[0]), targetPinnings,
               nodeStorageIDs[0], nodeStorageIDs);
   ```
   
   
   Storage ID was introduced in 2.3.0, causing an error when writing to HDFS 
2.2.0 if it is greater than or equal to 3.0 (HDFS-9807).
   
   
   
   HDFS-2832: Enable support for heterogeneous storages in HDFS
   https://issues.apache.org/jira/browse/HDFS-2832
   Fix Version/s: 2.3.0
   
   HDFS-9807. Add an optional StorageID to writes.
   https://issues.apache.org/jira/browse/HDFS-9807
   Fix Version/s:3.0.0-alpha4
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to