Overture Link Data Pipeline
Year: 2025
Section: Data Engineering
Status: Complete
Organization: World Bank
Open-source ETL pipeline published for the World Bank to automate Overture Maps extraction, transformation, and publishing across operational geospatial workflows.
Outcome: Reduced manual geospatial processing time through a repeatable workflow supporting 176 countries, reusable queries, and both ArcGIS Online publishing and portable GIS exports.
Methods: Python CLI, ETL automation, Geospatial data engineering, DuckDB-based querying, ArcGIS Online integration
Problem
Teams often spend significant time repeating the same geospatial data preparation steps across countries, tools, and reporting cycles.
Work
I developed and published an open-source pipeline with modular Source -> Transform -> Publish/Export stages, including CLI commands for cached Overture dumps (overture-dump), ArcGIS Online publishing (arcgis-upload), and direct file export (export).
Delivery
The result is a reusable World Bank-ready workflow that scales across countries and sectors, shortens setup time for new analyses, and improves consistency for recurring production geospatial pipelines.