Be aware that the CSV input format extends the delimited input format. The
delimited input format splits at the line delimiter (such as \n) without
awareness of quotes. So that character can never be part of a quote...
On Mon, Aug 24, 2015 at 11:55 AM, Tamara Mendt tammyme...@gmail.com wrote:
Hi,
Okay, than I understood correctly.
My point was something different. I never said that the approach I
suggested will produce identical results to the continuos DOP 1 top-k,
because thats impossible to parallelize.
What I suggested is to apply batch (or window) updates which would
Hi,
In the example the result is not correct because the values for a,b,c and d
are never forwarded from instance 2 even though they would modify the
global top-k result. It works, though, if you partition by the key field
(tuple field 0, in this case) before doing the summation and local top-k. I
Hi Tamara,
Quoted strings should not contain the quoting character. The way to work
around this is to escape the quote characters. However, currently there is
no option to escape quotes which pretty much forbids any use of quote
characters within quoted fields. This should be fixed. I opened a
Hi all,
When using the parseQuotedStrings function for the CsvReader class, I have
noticed that if the caracter of the quotes is also inside of the string,
the parsing fails.
For example, if there is a field of this form:
RT @sportsguy33: New Time Warner slogan: Time Warner, where we make you