Geoquery massive amounts of HDFS data from Spark processes
The geospatial dimension of data is getting revealed as the most natural, powerful and intuitive way to explore the expanding world of data and services. The ability to rely on real-world axis such as places, people, events and things can provide better answers for everyday tasks for individuals, as well as a deep understanding for businesses and administrations.
The Urban Data Analytics team at BDigital research efforts are focused on that scenario, with an offering built upon the ability to rapidly deploy pre-defined (but also arbitrary) analytic functions on geospatial time-series of data. Currently available developments already provide support for characterization, classification, clustering, anomaly detection and trajectory mining, while multivariate analytics and predictive functions (both on single and combined time-series) are targeted for the near future.
In order to enable such analytic operations on geospatially enabled data, the underlying computational infrastructure must provide the distributed computational processes with a tool to support large-speed and highly dynamic geoquerying operations on massive amounts of data. The combination of end-to-end geoquerying components and GIS enhancements for HDFS data has been implemented and tested by BDigital as the most promising solution for such requirements.