something not closing correctly in a session

2020-10-26 Thread Greene (US), Geoffrey N
I have a groovy scripts that I am running inside of a ExecuteGroovyScript processor (nifi v 1.12.0) I've been following the examples at https://community.cloudera.com/t5/Community-Articles/ExecuteScript-Cookbook-part-2/ta-p/249018, and I have something like: flowFile = session.get() if(!flowFi

RE: [EXTERNAL] Re: something not closing correctly in a session

2020-10-26 Thread Greene (US), Geoffrey N
wFile. Thanks, Bryan On Mon, Oct 26, 2020 at 3:49 PM Greene (US), Geoffrey N wrote: > > I have a groovy scripts that I am running inside of a > ExecuteGroovyScript processor (nifi v 1.12.0) > > I’ve been following the examples at > https://community.cloudera.com/t5/Community

horizontal merge

2020-11-17 Thread Greene (US), Geoffrey N
I'm trying to glue two flow files together HORIZONTALLY. That is, Flowfile1 ID,STARTINGLETTER 1,A 2,B And flowfile2: ID, WORD 1,Apple 2, Ball 3, Cat I want it to become: ID, STARTINGLETTER, WORD 1,A,Apple 2,B,Ball 3,,Cat The only way I've been able to figure out how to do this is to write a cus

RE: [EXTERNAL] Re: horizontal merge

2020-11-17 Thread Greene (US), Geoffrey N
s less than 2, you can rollback the session and return (possibly yielding first if you don't want to check again rapidly). Regards, Matt On Tue, Nov 17, 2020 at 4:13 PM Greene (US), Geoffrey N wrote: > > I’m trying to glue two flow files together HORIZONTALLY. That is, > > Flowfil

RE: [EXTERNAL] Re: horizontal merge

2020-11-18 Thread Greene (US), Geoffrey N
nly want to transfer just the ONE out, since the data was merged in, but I can manage with two files if I have to make # inputs = # outputs Thanks -Original Message- From: Greene (US), Geoffrey N Sent: Tuesday, November 17, 2020 8:30 PM To: users@nifi.apache.org Subject: RE: [EXTERNAL] Re:

RE: [EXTERNAL] Re: horizontal merge

2020-11-18 Thread Greene (US), Geoffrey N
want a call to `session.remove(flowFile1)` instead of transferring it. Cheers, Chris Sampson On Wed, 18 Nov 2020, 20:03 Greene (US), Geoffrey N, mailto:geoffrey.n.gre...@boeing.com>> wrote: I've gotten closer with grabbing two files and processing them. I still have something wro

RE: [EXTERNAL] horizontal merge

2020-11-18 Thread Greene (US), Geoffrey N
d. You now appear to remove 1 and 2 then transfer the 3rd. That should be fine. Thanks On Wed, Nov 18, 2020 at 1:50 PM Greene (US), Geoffrey N mailto:geoffrey.n.gre...@boeing.com>> wrote: Session.remove()! That’s very helpful, and it makes my numbers come out correctly. I’m Still

CM/CI best practices

2021-01-21 Thread Greene (US), Geoffrey N
I'm trying to figure out some CI/CM best practices. I want to be able to design a flow, test the flow on some test data, then distribute that exact same configuration (definitely flows, probably services, and so on) into production. I may have multiple engineers working in this environment, a

some questions about splits

2021-02-24 Thread Greene (US), Geoffrey N
Im having some trouble with multiple splits/merges. Here's the idea: Big data -> split 1->Save all the fragment.*attributes into variables -> split 2-> save all the fragment.* attributes | Split 1 | Save fragment.* attributes into split1.fragment.* | Split 2 | Save fragment.* attributes

RE: some questions about splits

2021-02-24 Thread Greene (US), Geoffrey N
helps! -Mark [1] https://www.youtube.com/watch?v=RjWstt7nRVY On Feb 24, 2021, at 4:59 PM, Greene (US), Geoffrey N mailto:geoffrey.n.gre...@boeing.com>> wrote: Im having some trouble with multiple splits/merges. Here’s the idea: Big data -> split 1->Save all the fragment.*att

ScriptedLookupService

2021-02-25 Thread Greene (US), Geoffrey N
Writing my first ScriptedLookupService in groovy, and I think I have a pretty simple question: I'd like to be able to return multiple values in one lookup, but I can't figure out what my return type needs to be. String isn't right, obviously, and returning a Map isn't right. Is lookup only able t

Some SimpleRecordSchema questions

2021-02-26 Thread Greene (US), Geoffrey N
I'm writing a LookupService in Groovy for my ScriptedLookupService. It is, as everyone had suggested, significantly faster than split/merge. I'm really very close to having it working. In fact, it works fine when my is simple; a few strings. My situation, though, is that my service returns a

RE: [EXTERNAL] Re: Some SimpleRecordSchema questions

2021-03-03 Thread Greene (US), Geoffrey N
Field("allMyIntegers", RecordFieldType.ARRAY.getArrayDataType(RecordFieldType.INT.getDataType(), true))) If I misunderstood what you're trying to do please let me know and I'll help where I can. Regards, Matt On Fri, Feb 26, 2021 at 7:04 PM Greene (US), Geoffrey N wrote: > >

nifi registry throwing NPE on viewing local changes

2021-03-12 Thread Greene (US), Geoffrey N
Brand new installation of nifi-registry here (0.8.0), and nifi 1.12.0, probably screwed something on the setup. It's a vanilla installation except that I am 1) Doing this all on windows (I know, I know) 2) Using the corporate gitlab. It uses a self-signed cert, but I put the cert in

RE: [EXTERNAL] Re: nifi registry throwing NPE on viewing local changes

2021-03-12 Thread Greene (US), Geoffrey N
is the problem, but NiFi Registry is not considered supported on Windows, and there are definitely some issues with the Git provider on Windows. https://nifi.apache.org/docs/nifi-registry-docs/html/administration-guide.html#system-requirements Thanks, Bryan On Fri, Mar 12, 2021 at 4:12 PM Greene

RecordPath...With INNER records

2021-03-22 Thread Greene (US), Geoffrey N
Im making good progress leaning to use using LookupRecord Processor. I have it working so that I know how to turn [{"key": "value1"}, {"key":"value2"}, {"key":"value3"}] Into [{"key": "value1","enhanced":1}, {"key":"value2","enhanced":2}, {"key":"value3","enhanced":3}] BUT What if you have inner

RecordPath...With INNER records

2021-03-24 Thread Greene (US), Geoffrey N
Sorry for the resend. Thought I'd try one more time. I'm struggling with LookupRecord. I'm making good progress leaning to use using Lookup Record Processor. I have it working so that I know how to turn [{"key": "value1"}, {"key":"value2"}, {"key":"value3"}] Into [{"key": "value1","enhanced":1}

RE: RecordPath...With INNER records

2021-03-25 Thread Greene (US), Geoffrey N
21, at 21:59, Greene (US), Geoffrey N mailto:geoffrey.n.gre...@boeing.com>> wrote: Sorry for the resend. Thought I’d try one more time. I’m struggling with LookupRecord. I’m making good progress leaning to use using Lookup Record Processor. I have it working so that I know how to turn [{“key”

Some retry flowfile questions

2021-04-22 Thread Greene (US), Geoffrey N
We have a rest endpoint that is "unreliable". It works sometimes. When it doesn't work, the solution seems to be to sleep for awhile, then try again So I put in a retry processor: http processor<- Retry | \ ^ Success Failure -| So far, so good, that loop works. B

Unable to write to container default due to archive file size constraints; waiting for archive cleanup

2021-05-12 Thread Greene (US), Geoffrey N
Nifi 1.13.2 here I need to make about 50 REST calls to an interface, but I suspect my rest service is unreliable. So I added a ControlRate processor. I tried to slow it down so that I only process one flowfile/min. I was still getting REST failures, so I decided to slow it down to only process

validating S3 upload hashes in NiFi

2021-07-06 Thread Greene (US), Geoffrey N
I know that in the S3 cli, you can validate an object's integrity during the upload by passing in the object's MD5 hash. Does NiFi do that automatically somehow? Or, post upload, do I need to retrieve the Etag hash from S3 and then compare it with the hash to a local file somehow? How do you v

DistributedMapCacheServer

2022-11-02 Thread Greene (US), Geoffrey N
I make heavy use of DistributedMapCacheServer in my nifi flows (one node; not clustered). I seem to remember reading that the DistributedMapCacheServer is not to be used in production; it's a reference implementation only, and it is not really recommended for production. Unfortunately, I can n

json into a json-enabled DB

2022-12-14 Thread Greene (US), Geoffrey N
Some databases (postgres, sql server, others) support native json columns. With postgres, there's a native jsonb type, with sql server it's a string type, that you can treat as json. In any event, once you have the json in the database, one can then query it, e.g.: SELECT id,product_name,

RE: [EXTERNAL] Re: json into a json-enabled DB

2022-12-16 Thread Greene (US), Geoffrey N
in any sort of consistent way. Happy to hear > your thoughts and input, perhaps we can add some ad-hoc support for > your use case? > > Regards, > Matt > > [1] https://issues.apache.org/jira/browse/NIFI-5901 > [2] https://issues.apache.org/jira/browse/NIFI-5845 > [3] h

Out of file descriptors? but not!

2023-07-03 Thread Greene (US), Geoffrey N
So, I came back from two weeks vacation... My nifi (1.17.0). is misbehaving. The logs say 2023-07-03 07:04:07,668 ERROR [Index Provenance Events-1] o.a.n.p.index.lucene.EventIndexTask Failed to index Provenance Events org.apache.lucene.store.AlreadyClosedException: this IndexWriter is closed ..

RE: [EXTERNAL] Re: Large json string field errors

2023-09-18 Thread Greene (US), Geoffrey N via users
Is this what you are referring to? 20MB string values vs 20 MB document size… https://github.com/FasterXML/jackson-core/issues/1082 From: Joe Witt Sent: Monday, September 18, 2023 2:25 PM To: users@nifi.apache.org Subject: [EXTERNAL] Re: Large json string field errors EXT email: be mindful of

reference implementation of getPropertyDescriptor()

2023-10-04 Thread Greene (US), Geoffrey N via users
What is the minimum or reference implementation of getPropertyDescriptor(name) in a groovy ScriptedLookup processor? I don't have any properties, and I had been doing @Override PropertyDescriptor getPropertyDescriptor(String name) { return null } But I don't think that is cor

clustering behaving inconsistently

2023-11-03 Thread Greene (US), Geoffrey N via users
Im having some rather odd behavior on clustering, working on trying to set up my first cluster (of three nodes). Im running nifi 1.17.0 everything over ssl, ad I'm using the embedded zookeeper My certs are all signed by my org, and my truststore DOES have my org's root cert in it. My cert has

file descriptors

2024-05-18 Thread Greene (US), Geoffrey N via users
Well, I learned something I thought I'd share: https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html#configuration-best-practices states that you should increase the number of file handles. The apache guide says to do this, you change limits.conf Nonetheless, my nifi would run o

RE: [EXTERNAL] Accessing File Size in ExecuteGroovyScript script

2024-06-17 Thread Greene (US), Geoffrey N via users
I believe you are casting ff.size as an Integer. 8 GB is signifincantly larger than can be held in a 32 bit integer, which is just 2 GBs. Suggest you use a double or a big decimal. Whats happening is you are truncating the high order bits. From: James McMahon Sent: Monday, June 17, 2024 7:51

Which is better ListSmb vs ls

2024-06-27 Thread Greene (US), Geoffrey N via users
Which is better and why: 1. Using ListSmb/FetchSMB nifi processors to get data from a samba share 2. Mounting the samba share to my linux OS, and using regular ls/fetch File. Opinions? What are the differences, performance-wise? Are there other considerations? Geoff Greene Associate

RE: PublishKafkaRecord_2_6 produces negative partition number

2024-07-11 Thread Greene (US), Geoffrey N via users
Interesting. The largest number representable by a 32 bit integer is 2147483647. Which exactly one less than th number you have. That sounds like a hint... Sent via the Samsung Galaxy A54 5G, an AT&T 5G smartphone Original message From: "Richter, Robert" Date: 7/11/24 5:38

varying JDBC connection parameters

2024-08-23 Thread Greene (US), Geoffrey N via users
Ive been introduced to a new situation, thought someone might have some advice. Imagine a system in which live data is stored in a sql lite database. The name of the sql lite file changes every hour. The schema is always the same, but the files roll over every 60 minutes. Now imagine I have t