Christian Müller created CAMEL-17861:
----------------------------------------

             Summary: Streaming  in Azure (Blob-Storage)  component not working 
                 Key: CAMEL-17861
                 URL: https://issues.apache.org/jira/browse/CAMEL-17861
             Project: Camel
          Issue Type: Bug
          Components: camel-azure
    Affects Versions: 3.14.1
            Reporter: Christian Müller


As described in the email conversation below we are having memory problems with 
the current implementation of the azure (blob storage component). 
Concretely the component does not stream properly!

_But looking at this stacktrace and the corresponding sourcecode it’s obvious 
that the whole stream is read to memory to check the total payload size (seems 
necessary for the azure client)_
As we transfer mass data with the azure component we consider this a major bug 
as we cannot use the azure-component as long as it does not stream properly. 

Thx and Regards Christian

Email History (camel user mailing list): 
*Response from Claus Ibsen:* 
What are the sources of those streams?
I wonder if we could enrich from the message some sort of total size header
into the camel blob producer, so it can tell the blob client the expected
length, so it does not read the stream itself to find out.
Also if you have the opportunity you are welcome to test with latest Camel
3.9.0 release, if its still a problem.
 
And you are welcome to create a JIRA as it would be great to have streaming
work well with azure, especially for blob as its supposed to be also big
blobs of data ;)

*initial question from Lukas Angerer:* 
We are transferring lots of data to the azure-storage with the 
azure-storage-blob component (version 3.7.0)
The Route itself is only working with streams to keep the memory overhead low, 
streamcaching is enabled.

But looking at this stacktrace and the corresponding sourcecode it’s obvious 
that the whole stream is read to memory to check the total payload size (seems 
necessary for the azure client)

 

Caused by: java.lang.OutOfMemoryError: Java heap space

            at 
org.apache.commons.io.output.AbstractByteArrayOutputStream.toByteArrayImpl(AbstractByteArrayOutputStream.java:366)

            at 
org.apache.commons.io.output.ByteArrayOutputStream.toByteArray(ByteArrayOutputStream.java:163)

            at org.apache.commons.io.IOUtils.toByteArray(IOUtils.java:2241)

            at 
org.apache.camel.component.azure.storage.blob.BlobUtils.getInputStreamLength(BlobUtils.java:37)

            at 
org.apache.camel.component.azure.storage.blob.BlobStreamAndLength.createBlobStreamAndLengthFromExchangeBody(BlobStreamAndLength.java:50)

            at 
org.apache.camel.component.azure.storage.blob.operations.BlobOperations.uploadBlockBlob(BlobOperations.java:181)

            at 
org.apache.camel.component.azure.storage.blob.BlobProducer.process(BlobProducer.java:86)

            at 
org.apache.camel.support.AsyncProcessorConverterHelper$ProcessorToAsyncProcessorBridge.process(AsyncProcessorConverterHelper.java:66)

            at 
org.apache.camel.processor.SendDynamicProcessor.lambda$process$0(SendDynamicProcessor.java:195)

I was wondering if there is a better way to do this. Maybe a shortcut for the 
cached stream that just checks the size of the cache?

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to