Hi,

I have log files with lines that begin with a time stamp. However those
lines continue onto new lines representing Java stack traces. I want to be
able to search for the line & then pull out the corresponding stack traces.

I was thinking of using either take(n) or reduce to 'peek' ahead at the
next line in the RDD to see if belongs to the stack trace. Another approach
I considered was using accumulators.

What would be the idiomatic way to achieve this in Spark?

Thanks in advance,

Regards,

Kevin.

Reply via email to