RE: KafkaSimpleJsonExtractor

2018-04-12 Thread Kidwell, Jack
Thank you for your time spent on our question. This quick start document, https://gobblin.readthedocs.io/en/latest/case-studies/Kafka-HDFS-Ingestion/, shows “source.class=org.apache.gobblin.source.extractor.extract.kafka.KafkaSimpleSource”. That class overrides method, getExtractor, to return

RE: KafkaSimpleJsonExtractor

2018-04-12 Thread Kidwell, Jack
We want to extract both key and value from a Kafka message and publish the combined information to HDFS. Keys contain numeric ids for the strings contained in values. Please explain “other subscriber”. Are you proposing a different Kafka message structure? KafkaSimpleJsonExtractor.java caught

Data Extraction from Oracle and Ingesting to HDFS

2018-04-12 Thread phani bhushan peddi
Hi, I am evaluating the Gobblin tool if we can use it for data extraction from Oracle. We have a use case to extract data from Oracle and ingest into Data lake on HDFS. Is there sample code ready which can be used for this purpose . I could not find anything related in Github. Please let me know.

Re: Data Extraction from Oracle and Ingesting to HDFS

2018-04-12 Thread Vicky Kak
Have you looked at this one? https://github.com/apache/incubator-gobblin/blob/master/gobblin-modules/gobblin-sql/src/main/java/org/apache/gobblin/source/extractor/extract/jdbc/OracleSource.java On Thu, Apr 12, 2018 at 9:56 PM, phani bhushan peddi wrote: > Hi, > I am

Re: KafkaSimpleJsonExtractor

2018-04-12 Thread Vicky Kak
Are you not having this configuration? *** writer.builder.class=org.apache.gobblin.writer.SimpleDataWriterBuilder writer.file.path.type=tablename writer.destination.type=HDFS writer.output. format=txt