Global-Scale 3D Mapping: The Data Challenge

To localize anywhere, you need maps everywhere. Building 3D maps at global scale is a data problem before it's an algorithm problem.

Data Sources

Meta has access to billions of geolocated images from Facebook, Instagram, and user-shared content.

Potential: Massive coverage, especially in populated areas. Challenges: Variable quality, privacy constraints, not uniformly distributed.

Vehicles or pedestrians with calibrated camera rigs capturing specific areas.

Potential: High quality, controlled conditions, known accuracy. Challenges: Expensive, doesn't scale to everywhere.

Partnerships with mapping companies, government data, open-source maps.

Potential: Professional quality, existing coverage. Challenges: Licensing, update frequency, format compatibility.

AR device users contribute mapping data during normal use.

Potential: Always fresh, covers where users actually go. Challenges: Quality control, privacy, opt-in rates.

High-quality mapping (survey-grade equipment):

Crowd-sourced mapping (user photos):

We need both: crowd-sourced for coverage, high-quality for validation.

80% of the world's photos are of 1% of locations.

Tourist sites: millions of photos Suburban neighborhoods: almost nothing

For VPS to be useful everywhere, we need maps everywhere. That means solving the coverage long tail.

Approaches:

Using public photos for mapping raises questions:

Protections:

Privacy isn't a feature - it's a constraint on everything we build.

Raw images → map requires:

At billions of images, every step is an infrastructure challenge.

Current state: processing capacity for 10M images/day. Need 10x that.