re-aggregation (a mapreduce approach) #4

AltiMario · 2015-12-27T16:17:50Z

In the current scenario when you send a stream of messages to validate, they are analyzed with a multithreading techniques. It means that there is no sequential order respected during the elaboration.
Generally this is not a problem but in some cases yes. What happen if I have to validate data of a CVS file where I need to preserve the sequence?
The strategy adopted for the forecasting validation is to store the data into db, "synchronizing" this peace of code, and at the end analyze the ordered data with the forecasting algorithm.
It's a solution with too much overhead.
For the full integration with SeerCore I need to aggregate the streams into a unique file (because it's the standard input). It means that, if I want a solution multithreading I need to re-aggregate the file preserving the index (like in a mapreduce technique).

mastrogiovanni · 2016-01-05T23:17:06Z

We need to discuss the architecture....

…on of last item

AltiMario added the enhancement label Dec 27, 2015

AltiMario assigned mastrogiovanni Dec 27, 2015

mastrogiovanni added a commit that referenced this issue Jan 13, 2016

#4 index incremental validation clean up saved records after validati…

cb81db7

…on of last item

AltiMario unassigned mastrogiovanni Mar 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

re-aggregation (a mapreduce approach) #4

re-aggregation (a mapreduce approach) #4

AltiMario commented Dec 27, 2015

mastrogiovanni commented Jan 5, 2016

re-aggregation (a mapreduce approach) #4

re-aggregation (a mapreduce approach) #4

Comments

AltiMario commented Dec 27, 2015

mastrogiovanni commented Jan 5, 2016