What we want is to run an experiment in which we inter-weave two different sets of articles to create the relevancy feed. Some constraints are:
- There are two sets of articles.
- We don’t want duplicate articles.
- We want a primary set and a secondary set.
- We want the secondary set to be interspersed within the primary set.
- How we select what is in each set is it’s own unique problem.
What follows is a quick Postgresql query that is database agnostic. Basically, can I setup a collation of two sets of data. There are some refinements to how I collate, but I’m presently satisfied with the bones of the query.
WITH integers AS ( SELECT ROW_NUMBER() OVER (PARTITION BY '1') AS collation_order, 'integers' AS table_source, ,* FROM generate_series(1,12) ), threes AS ( SELECT (ROW_NUMBER() OVER (PARTITION BY '1') * 0.75 + 0.1) AS collation_order, 'threes' AS table_source, ,* FROM generate_series(3,27,3) ) ( (SELECT * FROM integers) UNION (SELECT * FROM threes WHERE generate_series NOT IN ( SELECT generate_series FROM integers)) ) ORDER BY collation_order
The results are as follows:
SQL Results of Collation of Two Datasets from Common Domains
I’m not yet certain if this is something I’ll incorporate into the possible Feed Variants, but I wanted to put this out there.