Re: Read CSV Parse Quoted Strings Function

2015-08-24 Thread Stephan Ewen
Be aware that the CSV input format extends the delimited input format. The delimited input format splits at the line delimiter (such as \n) without awareness of quotes. So that character can never be part of a quote... On Mon, Aug 24, 2015 at 11:55 AM, Tamara Mendt tammyme...@gmail.com wrote:

Re: Statefull computation

2015-08-24 Thread Gyula Fóra
Hi, Okay, than I understood correctly. My point was something different. I never said that the approach I suggested will produce identical results to the continuos DOP 1 top-k, because thats impossible to parallelize. What I suggested is to apply batch (or window) updates which would

Re: Statefull computation

2015-08-24 Thread Aljoscha Krettek
Hi, In the example the result is not correct because the values for a,b,c and d are never forwarded from instance 2 even though they would modify the global top-k result. It works, though, if you partition by the key field (tuple field 0, in this case) before doing the summation and local top-k. I

Re: Read CSV Parse Quoted Strings Function

2015-08-24 Thread Maximilian Michels
Hi Tamara, Quoted strings should not contain the quoting character. The way to work around this is to escape the quote characters. However, currently there is no option to escape quotes which pretty much forbids any use of quote characters within quoted fields. This should be fixed. I opened a

Read CSV Parse Quoted Strings Function

2015-08-24 Thread Tamara Mendt
Hi all, When using the parseQuotedStrings function for the CsvReader class, I have noticed that if the caracter of the quotes is also inside of the string, the parsing fails. For example, if there is a field of this form: RT @sportsguy33: New Time Warner slogan: Time Warner, where we make you