The MapScaping Podcast - GIS, Geospatial, Remote Sensing, earth observation and digital geography

MapScaping

A podcast for geospatial people. Weekly episodes that focus on the tech, trends, tools, and stories from the geospatial world. Interviews with the people that are shaping the future of GIS, geospatial as well as practitioners working in the geo industry. This is a podcast for the GIS and geospatial community subscribe or visit https://mapscaping.com to learn more

  1. JAN 19

    A5 Pentagons Are the New Bestagons

    How can you accurately aggregate and compare point-based data from different parts of the world? When analyzing crime rates, population, or environmental factors, how do you divide the entire globe into equal, comparable units for analysis?   For data scientists and geospatial analysts, these are fundamental challenges. The solution lies in a powerful class of tools called Discrete Global Grid Systems (DGGS). These systems provide a consistent framework for partitioning the Earth's surface into a hierarchy of cells, each with a unique identifier. The most well-known systems, Google's S2 and Uber's H3, have become industry standards for everything from database optimization to logistics.   However, these systems come with inherent trade-offs. Now, a new DGGS called A5 has been developed to solve some of the critical limitations of its predecessors, particularly concerning area distortion and analytical accuracy.   Why Gridding the Globe is Harder Than It Looks The core mathematical challenge of any DGGS is simple to state but difficult to solve: it is impossible to perfectly flatten a sphere onto a 2D grid without introducing some form of distortion. Think of trying to apply a perfect chessboard or honeycomb pattern to the surface of a ball; the shapes will inevitably have to stretch or warp to fit together without gaps.   All DGGS work by starting with a simple 3D shape, a polyhedron, and projecting its flat faces onto the Earth's surface. The choice of this initial shape and the specific projection method used are what determine the system's final characteristics. As a simple analogy, consider which object you’d rather be hit on the head with: a smooth ball or a spiky cube? The ball is a better approximation of a sphere. When you "inflate" a spiky polyhedron to the size of the Earth, the regions nearest the sharp vertices get stretched out the most, creating the greatest distortion.   A Quick Look at the Incumbents: S2 and H3   To understand what makes A5 different, it's essential to have some context on the most popular existing systems.   Google's S2: The Cube-Based Grid The S2 system is based on projecting a cube onto the sphere. On each face of this conceptual cube, a grid like a chessboard is applied. This approach is relatively simple but introduces significant distortion at the cube’s vertices, or "spikes." As the grid is projected onto the sphere, the cells near these vertices become stretched into diamond shapes instead of remaining square. S2 is widely used under the hood for optimizing geospatial queries in database systems like Google BigQuery.   Uber's H3: The Hexagonal Standard Uber's H3 system starts with an icosahedron—a 20-sided shape made of triangles. Because an icosahedron is a less "spiky" shape than a cube, H3 suffers from far less angular distortion. Its hexagonal cells look more consistent across the globe, making it popular for visualization. H3's immense success is also due to its excellent and user-friendly ecosystem of tools and libraries, making it easy for developers to adopt. However, H3 has one critical limitation for data analysis: it is not an equal-area system. This was a deliberate trade-off, not a flaw; H3 was built by a ride-sharing company trying to match drivers to riders, a use case where exact equal area doesn't particularly matter. To wrap a sphere in hexagons, you must also include exactly 12 pentagons—just like on a soccer ball. If you look closely at a football, you'll see the pentagonal panels are slightly smaller than the hexagonal ones. This same principle causes H3 cells to vary in size. The largest and smallest hexagons at a given resolution can differ in area by a factor of two, meaning that comparing raw counts in different cells is like comparing distances in miles and kilometers without conversion. For example, cells near Buenos Aires are smaller because of their proximity to one of the system's core pentagons, creating a potential source of error if not properly normalized.   Introducing A5: A New System Built for Accuracy A5 is a new DGGS designed from the ground up to prioritize analytical accuracy. It is based on a dodecahedron, a 12-sided shape with pentagonal faces that is, in the words of its creator, "even less spiky" than H3's icosahedron.   The motivation for A5 came from a moment of discovery. Its creator, Felix Palmer, stumbled upon a unique 2D tiling pattern made of irregular pentagons. This led to a key question: could this pattern be extended to cover the entire globe? The answer was yes, and it felt like uncovering something "very, very fundamental." This sense of intellectual curiosity, rather than a narrow business need, is the foundation upon which A5 is built. A5's single most important feature is that it is a true equal-area system. Using a specific mathematical projection, A5 ensures that every single cell at a given resolution level has the exact same area. This guarantee even accounts for the Earth's true shape as a slightly flattened ellipsoid, not a perfect sphere.   This is a game-changer for analysis. By providing cells of identical size, A5 eliminates the need for analysts to perform complex area-based normalization. This prevents a common source of error and dramatically simplifies workflows when calculating metrics like population density, risk exposure, or any other value that depends on a consistent spatial unit.   A5 vs. H3 vs. S2: A Head-to-Head Comparison The choice of base polyhedron and projection method results in significant differences between the major DGGS. Here is a direct comparison of their key technical characteristics. Metric A5 H3 S2 Base Polyhedron Dodecahedron (12 pentagonal faces) Icosahedron (20 triangular faces) Cube (6 square faces) Equal-Area Cells Yes (Exact) No (Up to 2x area variation) No Max Resolution ~30 square millimeters ~1 square meter ~1 square centimeter Global Hierarchy Yes (Single top-level world cell) No (122 top-level cells) Yes (6 top-level cells)   The A5 Ecosystem and its "Polyglot Mirroring" Approach   The success of H3 proves that a powerful mathematical system is not enough; it needs a rich ecosystem of accessible tools to gain adoption. A5 is being built with this principle in mind, but with a novel development strategy.   This approach is called "polyglot mirroring." Instead of building a single core library in C and creating language bindings, A5 maintains separate, complete, and equivalent codebases in multiple languages, including TypeScript, Python, and Rust. To keep these distinct codebases synchronized, Large Language Models (LLMs) are used to port changes and new features from one language to another. This strategy makes the system more accessible and maintainable for developers within each language's native community.   The power of this approach was proven in a true "wow moment" during A5's development. The creator, having never written a single line of Rust, fed the existing TypeScript and Python versions and a comprehensive test suite to an LLM. After about a week of guided iteration, the model produced a complete, working, high-performance Rust library. This demonstrates how modern tools can enable a single developer to build and maintain a truly multi-lingual ecosystem, something that would have been impossible just a few years ago.   Conclusion: When Should You Choose A5?   A5 offers a powerful and precise alternative to existing global grid systems. Its primary advantages make it the ideal choice for specific, demanding use cases.   • Statistical Validity: Any analysis where equal-area cells are paramount for accuracy is a prime candidate for A5. This includes density mapping, demographic studies, environmental modeling, and financial risk assessment. • Extreme Resolution: For applications requiring precision beyond what H3 or S2 can offer, A5's ability to index down to cells of approximately 30 square millimeters provides unmatched granularity. • Efficient Global Hierarchy: Workflows that need to query data at a global scale benefit from A5's simple hierarchy, which starts from a single cell representing the entire world. In contrast, loading global data with H3's 122 top-level cells could require 122 separate requests, creating unnecessary complexity and inefficiency. To explore the A5 system, see detailed visualizations, and understand the technical comparisons in more depth, visit the official website at a5geo.org.

    37 min
  2. 12/09/2025

    From Data Dump to Data Product

    This conversation with Jed Sundwall, Executive Director of Radiant Earth, starts with a simple but crucial distinction: the difference between data and data products. And that distinction matters more than you might think. We dig into why so many open data portals feel like someone just threw up a bunch of files and called it a day. Sure, the data's technically "open," but is it actually useful? Jed argues we need to be way more precise with our language and intentional about what we're building. A data product has documentation, clear licensing, consistent formatting, customer support, and most importantly - it'll actually be there tomorrow. From there, we explore Source Cooperative, which Jed describes as "object storage for people who should never log into a cloud console." It's designed to be invisible infrastructure - the kind you take for granted because it just works. We talk about cloud native concepts, why object storage matters, and what it really means to think like a product manager when publishing data. The conversation also touches on sustainability - both the financial kind (how do you keep data products alive for 50 years?) and the cultural kind (why do we need organizations designed for the 21st century, not the 20th?). Jed introduces this idea of "gazelles" - smaller, lighter-weight institutions that can move together and actually get things done. We wrap up talking about why shared understanding matters more than ever, and why making data easier to access and use might be one of the most important things we can do right now.

    46 min
  3. 12/02/2025

    Reflections from FOSS4G 2025

    Reflections from the FOSS4G 2025 conference    Processing, Analysis, and Infrastructure (FOSS4G is Critical Infrastructure) The high volume of talks on extracting meaning from geospatial data—including Python workflows, data pipelines, and automation at scale—reinforced the idea that FOSS4G represents critical infrastructure. AI Dominance: AI took up a lot of space at the conference. I was particularly interested in practical, near-term impact talks like AI assisted coding and how AI large language models can enhance geospatial workflows in QGIS. Typically, AI discussions focus on big data and earth observation, but these topics touch a larger audience. I sometimes wonder if adding "AI" to a title is now like adding a health warning: "Caution, a machine did this". Python Still Rules (But Rust is Chatting): Python remains the pervasive, default geospatial language. However, there was chatter about Rust. One person suggested rewriting QGIS in Rust might make it easier to attract new developers. Data Infrastructure, Formats, and Visualization When geospatial people meet, data infrastructure—the "plumbing" of how data is stored, organized, and accessed—always dominates. Cloud Native Won: Cloud native architecture captured all the attention. When thinking about formats, we are moving away from files on disk toward objects in storage and streaming subsets of data. Key cloud-native formats covered included COGs (Cloud Optimized GeoTIFFs), Zarr, GeoParquet, and PMTiles. A key takeaway was the need to choose a format that best suits the use case, defined by who will read the file and what they will use the data for, rather than focusing solely on writing it. The Spatial Temporal Asset Catalog (STAC) "stole the show" as data infrastructure, and DuckDB was frequently mentioned. Visualization is moving beyond interactive maps and toward "interactive experiences". There were also several presentations on Discrete Global Grid Systems (DGGS). Standards and Community Action Standards Matter: Standards are often "really boring," but they are incredibly important for interoperability and reaping the benefits of network effects. The focus was largely on OGC APIs replacing legacy APIs like WMS and WFS (making it hard not to mention PyGeoAPI). Community Empowerment: Many stories focused on community-led projects solving real-world problems. This represents a shift away from expert-driven projects toward community action supported by experts. Many used OSM (OpenStreetMap) as critical data infrastructure, highlighting the need for locals to fill in large empty chunks of the map. High-Level Takeaways for the Future If I had to offer quick guidance based on the conference, it would be: Learn Python. AI coding is constantly improving and worth thinking about. Start thinking about maps as experiences. Embrace the Cloud and understand cloud-native formats. Standards matter. AI is production-ready and will be an increasingly useful interface to analysis. Reflections: What Was Missing? The conference was brilliant, but a few areas felt underrepresented: Sustainable Funding Models: I missed a focus on how organizations can rethink their business models to maintain FOSS4G as critical infrastructure without maintainers feeling their time is an arbitrage opportunity. Niche Products: I would have liked more stories about side hustles and niche SAS products people were building, although I was glad to see the "Build the Thing" product workshop on the schedule. Natural Language Interface: Given the impact natural language is having on how we interact with maps and geo-data, I was surprised there wasn't more dedicated discussion around it. I believe it will be a dominant way we interact with the digital world. Art and Creativity: Beyond cartography and design talks, I was surprised how few talks focused on creative passion projects built purely for the joy of creation, not necessarily tied to making a part of something bigger.

    14 min
  4. 11/17/2025

    I have been making AI slop and you should too

    AI Slop: An Experiment in Discovery Solo Episode Reflection: I'm back behind the mic after about a year-long break. Producing this podcast takes more time than you might imagine, and I was pretty burnt out. The last year brought some major life events, including moving my family back to New Zealand from Denmark, dealing with depression, burying my father, starting a new business with my wife, and having a teenage daughter in the house. These events took up a lot of space. The Catalyst for Return: Eventually, you figure out how to deal with grief, stop mourning the way things were, and focus on the way things could be. When this space opened up in my life, AI came into the picture. AI got me excited about ideas again because for the first time, I could just build things myself without needing to pitch ideas or spend limited financial resources. On "AI Slop": I understand why some content is called "slop," but for those of us who see AI as a tool, I don't think the term is helpful. We don't refer to our first clumsy experiments with other technologies—like our first map or first lines of code—as slop. I believe that if we want to encourage curiosity and experimentation, calling the results of people trying to discover what's possible "slop" isn't going to help.   My AI Experimentation Journey My goal in sharing these experiments is to encourage you to go out and try AI yourself. Phase 1: SEO and Content Generation My experimentation began with generating SEO-style articles as a marketing tool. As a dyslexic person, I previously paid freelancers thousands of dollars over the years to help create content for my website because it was too difficult or time-consuming for me to create myself. Early Challenges & Learning: My initial SEO content wasn't great, and Google recognized this, which is why those early experiments don't rank in organic search. However, this phase taught me about context windows, the importance of prompting (prompt engineering), and which models and tools to use for specific tasks. Automation and Agents: I played around with automation platforms like Zapier, make.com, and n8n. I built custom agents, starting with Claude projects and custom GPTs. I even experimented with voice agents using platforms like Vappy and 11 Labs. Unexpected GIS Capabilities: During this process, I realized you can ask platforms like ChatGPT to perform GIS-related data conversions (e.g., geojson to KML or shapefile using geopandas), repro data, create buffers around geometries, and even upload a screenshot of a table from a PDF and convert it to a CSV file. While I wouldn't blindly trust an LLM for critical work, it's been interesting to learn where they make mistakes and what I can trust them for. AI as a Sparring Partner: I now use AI regularly to create QGIS plugins and automations. Since I often work remotely as the only GIS person on certain projects, I use AI—specifically talking to ChatGPT via voice on my phone—as a sparring partner to bounce ideas off of and help me solve problems when I get stuck. Multimodal Capabilities: The multimodal nature of Gemini is particularly interesting; if you share your screen while working in QGIS, Gemini can talk you through solving a problem (though you should consider privacy concerns).   The Shift to Single-Serve Map Applications I noticed that the digital landscape was changing rapidly. LLMs were becoming "answer engines," replacing traditional search on Google, which introduced AI Overviews. Since these models no longer distribute traffic to websites like mine the way they used to, I needed a new strategy. The Problem with Informational Content: Informational content on the internet is going to be completely dominated by AI. The Opportunity: Real Data: AI is great at generating content, but if you need actual data—like contours for your specific plot of land in New Zealand—you need real data, not generated data. New Strategy: My new marketing strategy is to create targeted, single-serve map applications and embed them in my website. These applications do one thing and one thing only, using open and valuable data to solve very specific problems. This allows me to rank in organic search because these are problems that LLMs have not yet mastered. Coding with AI: I started by using ChatGPT to code small client-side map applications, then moved to Claude, which is significantly better than OpenAI's models and is still my coding model of choice. Currently, I use Cursor AI as a development environment, swapping between Claude code, OpenAI's Codex, and other models. A Caveat: Using AI for coding can be incredibly frustrating. The quality of the code drops dramatically once it reaches a certain scale. However, even with flaws, it’s a thousand times better and faster than what I could do myself, making my ideas possible. Crucially, I believe that for the vast majority of use cases, mediocre code is good enough.   Success Story: GeoHound After practicing and refining my methods, I decided to build a Chrome extension. Every GIS professional can relate to the pain point of sifting through HTTP calls in the developer tools networking tab to find the URL for a web service to use in QGIS or ArcGIS. The Impossible Idea Made Possible: I had pitched this idea to multiple developers in the past, who were either uninterested or quoted between $10,000 and $15,000 to build it. The AI Result: Using AI, I had a minimum viable Chrome extension—GeoHound—that filtered out common geo web services within 3 hours. It took a few days of intermittent work before it was published to the Chrome and Edge web stores. Current Use: GeoHound has thousands of users (my own statistics suggest closer to or over 3,000 users, compared to the 1,000 shown on the Chrome store). While not perfect, it is clearly good enough, and this was something that was impossible for me just six months ago.   My Point: Now is the Time to Experiment AI is here, and it will lead to profound change. Experimenting with it is vital because it will: Help you develop the skills and knowledge needed to meet the needs of the people you serve. Help you better understand what is hype and what is not, allowing you to decipher which voices to listen to. We are moving from a world where information is ubiquitous to a world where knowledge is ubiquitous. Now is the time to be making sloppy mistakes. Don't let perfection stop you from learning how to make stuff that is going to be good enough. If your work consists of repetitive tasks that follow step-by-step recipes, that's going to be a tough gig going forward. Long-term, there will be new opportunities, but you need to be experimenting now to be in a position to take advantage of them. Resources Mentioned You will find a list of the tools I've been experimenting with in the show notes. Automation: make.com, n8n, Zapier Voice/Agents: 11 Labs, Vappy, custom GPT (MCP servers) Coding Models: Claude (current choice), OpenAI's Codex, ChatGPT Development Environment: Cursor AI LLMs/Multimodal: Gemini (studio.google.com) Browser Extension: GeoHound (for Chrome and Edge) https://chromewebstore.google.com/detail/nooldeimgcodenhncjkjagbmppdinhfe?utm_source=item-share-cb If you build anything interesting with these tools, please let me know! I'd love to hear about your own experiments.

    19 min
4.7
out of 5
114 Ratings

About

A podcast for geospatial people. Weekly episodes that focus on the tech, trends, tools, and stories from the geospatial world. Interviews with the people that are shaping the future of GIS, geospatial as well as practitioners working in the geo industry. This is a podcast for the GIS and geospatial community subscribe or visit https://mapscaping.com to learn more

You Might Also Like