Skip to content

Cartesian product to generate similarity pairs

Madhav Sharan edited this page Aug 26, 2016 · 1 revision

If there are N videos we have a total of [ ( N * (N-1)/2 ) + N ] video pairs.

With new sequence file format we needed to create a new FileFormat to calculate cartesian splits and then provide pairs to mapper jobs for each split. This is done in -

Better explained in -

Clone this wiki locally