Header image

Introducing Geomancer: an open-source library for geospatial feature engineering

April 16, 2019 blog-post geospatial machine-learning feature-engineering geomancer open-source

Here at Thinking Machines, we work with a lot of geospatial data: we’ve identified gaps in OpenStreetMap (OSM), provided geospatial analytics for our clients, and harnessed machine learning to estimate poverty from satellite imagery. However, we realized that we were spending too much time in repetitive feature engineering tasks. So to operate on geospatial data at scale, we decided to automate our execution and delivery workflows.

Enter Geomancer, our open-source library for geospatial feature engineering! It leverages geospatial data such as OpenStreetMap (OSM) coupled with a data warehouse like BigQuery. We use this to create, share, and iterate features for our downstream machine learning tasks. This tool allows us to:

Let’s see Geomancer in action! Given a set of points, we can create a feature that gets the distance to the nearest supermarket within a 10-km radius:

Geomancer’s Core API is powered by a SQLAlchemy backend that handles the translation of a Spell into a SQL dialect. This makes the library highly-extensible, allowing you to add new feature-primitives and database backends for your specific use-case.

We hope that Geomancer can help you scale your geospatial feature engineering needs! You can get started by reading through our getting started demo and setup guide. You can find more details through the documentation. Lastly, contributions are welcome! Simply file an issue or submit a pull request through GitHub.

Are you interested in using machine learning and geospatial data to help you and your organization make better and more informed decisions? Get in touch with us at [email protected] to learn more!


How a Digital Logistics Company Used Data to Rethink Operations

A unified and cleaned data pipeline to empower meaningful analysis and optimized business decisions

Tourism in Bangkok: Road to Recovery with Mapbox Movement

While Bangkok’s COVID-19 recovery is slow as lockdowns are reinstated to contain rising cases, Mapbox Movement data shows that some areas in the capital are starting to pick up a year since the onset of the pandemic.

Effective Targeted Campaigns with Machine Intelligence

Advanced market segmentation using Machine Learning to develop targeted campaigns to improve consumer behavior around payment operations