Re: How to validate records in Hadoop using NiFi?

2016-01-10 Thread sudeep mishra
Thank you Joe. Sqoop to HDFS data load is outside the NiFi flow. Once the data is pushed to HDFS then I have to process each record and perform validations. By Validation i meant that we will be picking a particular column for each record store in HDFS and the performing a SQL query against anoth

Re: How to validate records in Hadoop using NiFi?

2016-01-09 Thread Joe Witt
Hello Sudeep, "Which NiFi processor can I use to split each record (separated by a new line character)" For this the SplitText processor is rather helpful if you want to split each line. I recommend you do two SplitText processors in a chain where one splits on every 1000 lines for example and

Fwd: How to validate records in Hadoop using NiFi?

2016-01-09 Thread sudeep mishra
Hi, I am pushing some database records into HDFS using Sqoop. I want to perform some validations on each record in the HDFS data. Which NiFi processor can I use to split each record (separated by a new line character) and perform validations? For validations I want to verify a particular column