[Feature Request] Evaluation of tasks without running it. #127

joychugh · 2018-08-02T03:54:11Z

It would be good to have a way to evaluate the task, if all it's dependencies are OK but not run it.
Something like FloRunner.evaluateTask(task). This can be useful if the the task you want to evaluate
a) runs for a long time (data pipelines)
b) makes a permanent or hard to revert change in an external system

danielnorberg · 2018-08-27T13:01:44Z

This is interesting. One thought is that this might require Flo to gain an understanding of external dependencies. Currently Flo does not have any external dependency primitive or representation. Everything is just tasks. For example, evaluating a bigquery input dependency is done by running a lookup task. But Flo doesn't understand that it is a lookup or a dependency resolution.

To compare with Luigi, it does not have any first class external dependency primitive either, but at least it has ExternalTask.

An easy to implement Flo analog to the luigi ExternalTask could be a ExternalDependencyOperator that could be implemented by BigQueryLookupOperator, etc. This would then allow the implementation of a general mechanism for evaluating just the dependencies of a Flo dag.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Evaluation of tasks without running it. #127

[Feature Request] Evaluation of tasks without running it. #127

joychugh commented Aug 2, 2018

danielnorberg commented Aug 27, 2018

[Feature Request] Evaluation of tasks without running it. #127

[Feature Request] Evaluation of tasks without running it. #127

Comments

joychugh commented Aug 2, 2018

danielnorberg commented Aug 27, 2018