It's good for you, Ryan, because there are many alternatives.

FYI, Apache Spark 3.0.0 is using Apache Hive 2.3.7.
And, everything is running in a local mode on the single container.

> Would the entire thing run within the same container and then I leverage
the Spark APIs from that in local mode?

More simply, you can generate a minimal scala script on the runtime like
the following and run via Spark shell in that container.

    $ cat hello.scala
    print("a")
    $ bin/spark-shell -I hello.scala

Bests,
Dongjoon.


On Wed, Jul 15, 2020 at 7:34 PM Matt Burgess <mattyb...@apache.org> wrote:

> Ryan,
>
> It's possible there are some changes that would cause that code not to
> compile for Hive 2, but I have done some work with porting similar
> processors to Hive 2 and as I recall it was mostly API-type breaking
> changes and not so much from the behavior side of things, more of a
> Maven and Java-package-name kind of thing.
>
> Regards,
> Matt
>
> On Wed, Jul 15, 2020 at 8:39 PM Ryan Schachte
> <coderyanschac...@gmail.com> wrote:
> >
> > Great, thanks Matt! Looking at this code now and feel this will really
> help
> > me a lot. Anything you think would break using this logic for Hive 2.3.5?
> >
> > On Wed, Jul 15, 2020 at 5:04 PM Matt Burgess <mattyb...@apache.org>
> wrote:
> >
> > > Ryan,
> > >
> > > In Apache NiFi we have a ConvertAvroToOrc processor [1], you may find
> > > code there that you can use in your Java program (take a look at line
> > > 212 and down). We had to create our own OrcFileWriter because the one
> > > in Apache ORC writes to a FileSystem where we needed to write to our
> > > own FlowFile component. But all the relevant code should be there (you
> > > can replace the createWriter() call with the normal ORC one); one
> > > caveat is that it's for Apache Hive 1.2, you may need to make changes
> > > if you're using Hive 3 libraries for example.
> > >
> > > Regards,
> > > Matt
> > >
> > > [1]
> > >
> https://github.com/apache/nifi/blob/main/nifi-nar-bundles/nifi-hive-bundle/nifi-hive-processors/src/main/java/org/apache/nifi/processors/hive/ConvertAvroToORC.java
> > >
> > > On Wed, Jul 15, 2020 at 4:51 PM Ryan Schachte
> > > <coderyanschac...@gmail.com> wrote:
> > > >
> > > > I'm writing a standalone Java process and interested in converting
> the
> > > > consumed Avro messages to ORC. I've seen a plethora of examples of
> > > writing
> > > > to ORC, but the conversion to ORC from Avro is what I can't seem to
> find
> > > a
> > > > lot of examples of.
> > > >
> > > > This is just a standard Java process running inside of a container.
> > >
>

Reply via email to