Discussion on: Consider SQL when writing your next processing pipeline

View post

Replies for: Thanks for writing this. I’m interested to learn more. expressing pipelines First the term “expressing pipelines” I am not sure if I un...

Thanks for reading my post!

To my mind, a processing pipeline is anything that reads data from a number of source(s), joins/transforms/filters those data, and outputs the results to some number of destination(s). (Note that it is rare, but occasionally the output destination is the same as the input source.) So I would say both of your examples would qualify.

I wasn't familiar with Blaze, but having had a quick look, it does look like I am suggesting a similar approach, but indeed just going straight to SQL instead.

simkimsia • Jun 28 '19

Actually when you define processing pipeline as "anything that reads data from a number of source(s), joins/transforms/filters those data, and outputs the results to some number of destination(s)."

You're talking essentially about ETL right?

BenBirt • Jun 28 '19

More or less, yes!