Efficient Large-scale Distance-Based Join Queries in SpatialHadoop
Identificadores
URI: http://hdl.handle.net/10835/5264
ISSN: 1384-6175
DOI: https://doi.org/10.1007/s10707-017-0309-y
ISSN: 1384-6175
DOI: https://doi.org/10.1007/s10707-017-0309-y
Compartir
Metadatos
Mostrar el registro completo del ítemAutor
García García, Francisco; Corral Liria, Antonio Leopoldo; Iribarne Martínez, Luis Fernando; Vassilakipoulos, Michael; Manolopoulos, YannisFecha
2017Resumen
Efficient processing of Distance-Based Join Queries (DBJQs) in spatial databases is of paramount importance in many application domains. The most representative and known DBJQs are the K Closest Pairs Query (KCPQ) and the ε Distance Join Query (εDJQ). These types of join queries are characterized by a number of desired pairs (K) or a distance threshold (ε) between the components of the pairs in the final result, over two spatial datasets. Both are expensive operations, since two spatial datasets are combined with additional constraints. Given the increasing volume of spatial data originating from multiple sources and stored in distributed servers, it is not always efficient to perform DBJQs on a centralized server. For this reason, this paper addresses the problem of computing DBJQs on big spatial datasets in SpatialHadoop, an extension of Hadoop that supports efficient processing of spatial queries in a cloud-based setting. We propose novel algorithms, based on plane-sweep, to perform...