References of Current Spatial Join Algorithm #707
Replies: 1 comment 1 reply
-
|
Hello! Not sure if there is name for the "spatial join" algorithm itself, it basically works the same as a regular database "hash-join", but instead of creating a hash-table we create an r-tree on the smaller side join input which we use to find candidate matches when we later pass through the left-side input. The r-tree construction and probing algorithm is inspired by the flatbush implementation, but with some smaller modifications to make it possible to "suspend" and ongoing lookup if we find more matches than we can fit into a duckdb "vector". Kyle wrote an amazing guide on understanding the original Flatbush code, which I highly recommend! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @Maxxen , I'm very happy to see the new spatial join operator in duckdb and its improvement! I understand from the blog post that currently it's using R-index based join algorithm. I would like to understand more about the algorithm besides the bits described in the post. I wonder if there's a standard name for the algorithm that the duckdb spatial join essentially implements? It would be even better if you're aware of any academic papers describing this algorithm.
Beta Was this translation helpful? Give feedback.
All reactions