Hello,

I have resolved the below issue.
It has something to do with the naming of the FlowFiles [So instead of
​
clickinfo-10:41:09.json , if I give
​
clickinfo*_10.41.09.*json then it works !!]

​However ​I need help with "*MergeContent*" processor (I have used it
successfully though with Avro and JSON)
When I merge n number of ORC FlowFiles into 1 ORC FlowFile and write onto
HDFS location pertaining to a HIVE External table location, Hive query only
picks the 1st ORC record and other 2 are not there !!
If I check the merged ORC file, I can see 3 records though !
I am using "Bin-Packing Algorithm" and "Binary Concatenation" and "Keep
Only Common Attributes" as main property.
Any help would really be appreciated.


Thank you!
​
______________________

*Kind Regards,*
*Anshuman Ghosh*
*Contact - +49 179 9090964*


On Mon, Mar 27, 2017 at 1:36 PM, Anshuman Ghosh <
[email protected]> wrote:

> Hello everyone,
>
> We are trying to convert some AVRO data to ORC before storing it onto HDFS
> for HIVE External table.
> As far as the properties are concerned, only thing added is the value for "ORC
> Configuration Resources" which is set to
> */etc/hive/conf.cloudera.hive/hive-site.xml*
> However we are getting the following issue from the processor -
>
> Do we need to provide hadoop path somewhere?
>
> 2017-03-27 11:10:37,308 ERROR [Timer-Driven Process Thread-3]
> o.a.n.processors.hive.ConvertAvroToORC 
> ConvertAvroToORC[id=1016116e-1621-1188-3608-385ba348c790]
> ConvertAvroToORC[id=1016116e-1621-1188-3608-385ba348c790] failed to
> process session due to java.lang.IllegalArgumentException:
> java.net.URISyntaxException: Relative path in absolute URI:
> clickinfo-10:41:09.json: java.lang.IllegalArgumentException:
> java.net.URISyntaxException: Relative path in absolute URI:
> ​​
> clickinfo-10:41:09.json
> 2017-03-27 11:10:37,310 ERROR [Timer-Driven Process Thread-3]
> o.a.n.processors.hive.ConvertAvroToORC
> java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative
> path in absolute URI: clickinfo-10:41:09.json
>         at org.apache.hadoop.fs.Path.initialize(Path.java:206) ~[na:na]
>         at org.apache.hadoop.fs.Path.<init>(Path.java:172) ~[na:na]
>         at org.apache.nifi.processors.hive.ConvertAvroToORC.lambda$
> onTrigger$0(ConvertAvroToORC.java:221) ~[na:na]
>         at org.apache.nifi.controller.repository.
> StandardProcessSession.write(StandardProcessSession.java:2578)
> ~[nifi-framework-core-1.1.1.jar:1.1.1]
>         at org.apache.nifi.processors.hive.ConvertAvroToORC.
> onTrigger(ConvertAvroToORC.java:207) ~[na:na]
>         at 
> org.apache.nifi.processor.AbstractProcessor.onTrigger(AbstractProcessor.java:27)
> ~[nifi-api-1.1.1.jar:1.1.1]
>         at org.apache.nifi.controller.StandardProcessorNode.onTrigger(
> StandardProcessorNode.java:1099) ~[nifi-framework-core-1.1.1.jar:1.1.1]
>         at org.apache.nifi.controller.tasks.ContinuallyRunProcessorTask.
> call(ContinuallyRunProcessorTask.java:136) [nifi-framework-core-1.1.1.
> jar:1.1.1]
>         at org.apache.nifi.controller.tasks.ContinuallyRunProcessorTask.
> call(ContinuallyRunProcessorTask.java:47) [nifi-framework-core-1.1.1.
> jar:1.1.1]
>         at org.apache.nifi.controller.scheduling.
> TimerDrivenSchedulingAgent$1.run(TimerDrivenSchedulingAgent.java:132)
> [nifi-framework-core-1.1.1.jar:1.1.1]
>         at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> [na:1.8.0_111]
>         at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
> [na:1.8.0_111]
>         at java.util.concurrent.ScheduledThreadPoolExecutor$
> ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
> [na:1.8.0_111]
>         at java.util.concurrent.ScheduledThreadPoolExecutor$
> ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
> [na:1.8.0_111]
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> [na:1.8.0_111]
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> [na:1.8.0_111]
>         at java.lang.Thread.run(Thread.java:745) [na:1.8.0_111]
> Caused by: java.net.URISyntaxException: Relative path in absolute URI:
> clickinfo-10:41:09.json
>         at java.net.URI.checkPath(URI.java:1823) ~[na:1.8.0_111]
>         at java.net.URI.<init>(URI.java:745) ~[na:1.8.0_111]
>         at org.apache.hadoop.fs.Path.initialize(Path.java:203) ~[na:na]
>         ... 16 common frames omitted
>
>
>
>
> Thank you
> ​ in advance​
> !
> ​
> ______________________
>
> *Kind Regards,*
> *Anshuman Ghosh*
> *Contact - +49 179 9090964 <+49%20179%209090964>*
>
>

Reply via email to