tenthe commented on issue #1269: URL: https://github.com/apache/streampipes/issues/1269#issuecomment-1427943977
Hello @flomickl, the problem with the CSV file is that it has no column for the "timestamp". When you replay the data, the data is streamed as fast as it can be read. This means that all events are be created and a timestamp will be appended in the adapter. It is very likely that multiple events will have the same timestamp and since this is the index in the data lake, only one of these events will be stored. One possible solution would be to add a timestamp to the raw events. Another solution is to mark one of the properties as a dimension property, since one event can be stored for each dimension even if it has the same timestamp. I hope this is helpful. Thanks a lot! Philipp -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
