TY - GEN
T1 - EaserGeocoder
T2 - 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018
AU - Rashidian, Sina
AU - Jain, Shubham Kumar
AU - Dong, Xinyu
AU - Wang, Fusheng
N1 - Publisher Copyright:
© 2018 held by the owner/author(s).
PY - 2018/11/6
Y1 - 2018/11/6
N2 - Increased availability of large amounts of address data provides opportunities for data driven studies to improve decision making in business applications and support precision public health with high resolution geolocations. Geocoding large number of addresses is challenging due to high cost and often disclosure of sensitive data to vendors over the Web. Most geocoders take advantage of Web APIs which require sending private addresses over the Internet, which may not be an option for many applications with sensitive data including public health and geo-medicine. Meanwhile, the cost for geocoding massive number of addresses could be high and becomes a major hurdle for many users. To overcome these challenges, we developed an open source on-premise geocoding software EaserGeocoder, which uses a novel integrative geocoding model to achieve high accuracy through integrating multiple open data sources. EaserGeocoder takes advantage of machine learning based approaches to determine best answers from multiple data sources. EaserGeocoder can also be easily parallelized to achieve high scalability through parallelized search and distributed computing. EaserGeocoder is on a par with commercial geocoding systems, outperforms open source systems, and is available for free.
AB - Increased availability of large amounts of address data provides opportunities for data driven studies to improve decision making in business applications and support precision public health with high resolution geolocations. Geocoding large number of addresses is challenging due to high cost and often disclosure of sensitive data to vendors over the Web. Most geocoders take advantage of Web APIs which require sending private addresses over the Internet, which may not be an option for many applications with sensitive data including public health and geo-medicine. Meanwhile, the cost for geocoding massive number of addresses could be high and becomes a major hurdle for many users. To overcome these challenges, we developed an open source on-premise geocoding software EaserGeocoder, which uses a novel integrative geocoding model to achieve high accuracy through integrating multiple open data sources. EaserGeocoder takes advantage of machine learning based approaches to determine best answers from multiple data sources. EaserGeocoder can also be easily parallelized to achieve high scalability through parallelized search and distributed computing. EaserGeocoder is on a par with commercial geocoding systems, outperforms open source systems, and is available for free.
KW - Geocoding
KW - Geographic information system
KW - Text searching
UR - https://www.scopus.com/pages/publications/85058628202
U2 - 10.1145/3274895.3274929
DO - 10.1145/3274895.3274929
M3 - Conference contribution
AN - SCOPUS:85058628202
T3 - GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems
SP - 572
EP - 575
BT - 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018
A2 - Xiong, Li
A2 - Tamassia, Roberto
A2 - Banaei, Kashani Farnoush
A2 - Guting, Ralf Hartmut
A2 - Hoel, Erik
PB - Association for Computing Machinery
Y2 - 6 November 2018 through 9 November 2018
ER -