[GitHub] piiswrong commented on a change in pull request #8662: 2bit gradient compression

GitBox Thu, 16 Nov 2017 10:29:13 -0800

piiswrong commented on a change in pull request #8662: 2bit gradient compression
URL: https://github.com/apache/incubator-mxnet/pull/8662#discussion_r151498642


 ##########
 File path: python/mxnet/kvstore.py
 ##########
 @@ -349,6 +349,77 @@ def row_sparse_pull(self, key, out=None, priority=0, 
row_ids=None):
             check_call(_LIB.MXKVStorePullRowSparse(
                 self.handle, mx_uint(len(ckeys)), ckeys, cvals, crow_ids, 
ctypes.c_int(priority)))
 
+    def set_gradient_compression(self, compression_params=(('compression', 
'2bit'),)):
+        """ Specifies type of low-bit quantization for gradient compression if 
any, \
+         and additional arguments depending on the type of compression being 
used.
+
+        2bit Gradient Compression takes a positive float `threshold`.
+        The technique works by thresholding values such that positive values 
in the
+        gradient above threshold will be set to threshold. Negative values 
whose absolute
+        values are higher than threshold, will be set to the negative of 
threshold.
+        Values whose absolute values are less than threshold will be set to 0.
+        By doing so, each value in the gradient is in one of three states. 
2bits are
+        used to represent these states, and every 16 float values in the 
original
+        gradient can be represented using one float. This compressed 
representation
+        can reduce communication costs. The difference between these 
thresholded values and
+        original values is stored at the sender's end as residual and added to 
the
+        gradient in the next iteration.
+
+        When kvstore is 'local', gradient compression is used to reduce 
communication
+        between multiple devices (gpus). Gradient is quantized on each GPU 
which
+        computed the gradients, then sent to the GPU which merges the 
gradients. This
+        receiving GPU dequantizes the gradients and merges them. Note that this
+        increases memory usage on each GPU because of the residual array 
stored.
+
+        When kvstore is 'dist', gradient compression is used to reduce 
communication
+        from worker to sender. Gradient is quantized on each worker which
+        computed the gradients, then sent to the server which dequantizes
+        this data and merges the gradients from each worker. Note that this
+        increases CPU memory usage on each worker because of the residual 
array stored.
+        Only worker to server communication is compressed in this setting.
+        If each machine has multiple GPUs, currently this GPU to GPU 
communication is
+        not compressed. Server to worker communication (in the case of pull) 
is also not
+        compressed.
+
+        To use 2bit compression, we need to specify `compression` as `2bit`.
+        Only specifying `compression` would use default value for the 
threshold.
+        To completely specify the arguments for 2bit compression, we would 
need to pass
+        a dictionary which includes `threshold` like:
+        {'compression': '2bit', 'threshold': 0.5}
+
+        Parameters
+        ----------
+        compression_params : dict
+            A dictionary specifying the type and parameters for gradient 
compression.
+            The key `compression` in this dictionary is a
+            required string argument and specifies the type of gradient 
compression.
+            Other keys in this dictionary are optional and specific to the type
+            of gradient compression. Defaults to (('compression', '2bit'),).
+            The default value is not a dict,
+            just to avoid pylint warning on dangerous default values.
+        """
+        if compression_params:
 
 Review comment:
   superfluous if?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

[GitHub] piiswrong commented on a change in pull request #8662: 2bit gradient compression

Reply via email to