Stops matching
Matching Stages
| Stage | Count | % of Total | Many-to-One | Not Closest | Mean (m) | Median (m) | P95 (m) |
|---|---|---|---|---|---|---|---|
| Exact (UIC ref) | 21,540 | 34.1% | 13,573 | 5,568 | 20.93 | 8.65 | 78.55 |
| Name matching | 445 | 0.7% | 6 | 161 | 48.66 | 14.36 | 120.61 |
| Route matching (total) | 14,484 | 22.9% | 11 | 3,468 | 7.26 | 5.28 | 21.68 |
| GTFS | 14,484 | 22.9% | 11 | 3,468 | 7.26 | 5.28 | 21.68 |
| Distance (total) | 12,823 | 20.3% | 2,422 | 3,726 | - | - | - |
| Stage 0: Trio side matching | 2,386 | 3.8% | 2,386 | 830 | 8.75 | 4.48 | 23.93 |
| Stage 1: Group proximity (50m) | 9,448 | 15.0% | 26 | 2,454 | - | - | - |
| Stage 1a: UIC ref | 9,140 | 14.5% | 23 | 2,383 | 7.71 | 5.15 | 25.16 |
| Stage 1b: UIC name | 39 | 0.1% | 0 | 7 | 9.04 | 6.46 | 37.1 |
| Stage 1c: Name | 269 | 0.4% | 3 | 64 | 12.11 | 8.14 | 37.87 |
| Stage 1b: Group proximity (150m) | 501 | 0.8% | 5 | 377 | - | - | - |
| Stage 1d: UIC ref (150m) | 472 | 0.7% | 3 | 353 | 81.64 | 74.66 | 136.25 |
| Stage 1e: UIC name (150m) | 5 | 0.0% | 0 | 4 | 121.32 | 123.62 | 149.23 |
| Stage 1f: Name (150m) | 24 | 0.0% | 2 | 20 | 74.29 | 60.74 | 126.95 |
| Stage 2: Local ref | 75 | 0.1% | 4 | 26 | 17.29 | 13.84 | 39.9 |
| Stage 3a: Single candidate | 365 | 0.6% | 1 | 39 | - | - | - |
| 3a pass 1 | 331 | 0.5% | 1 | 34 | 12.5 | 8.49 | 43.94 |
| 3a pass 2 | 34 | 0.1% | 0 | 5 | 10.85 | 5.49 | 44.52 |
| Stage 3b: Ratio test | 48 | 0.1% | 0 | 0 | 2.85 | 2.42 | 6.62 |
| Post-processing: ATLAS Duplicate propagation | 327 | 0.5% | 327 | 120 | 23.55 | 8.7 | 100.28 |
| Post-processing: OSM stop partner propagation | 13,502 | 21.4% | 1,663 | 7,990 | 9.11 | 5.47 | 27.52 |
Distance Quality
Cross-Predicate Consistency
If we had matched every pair to its nearest OSM node instead:
Many-to-One Analysis
OSM platform/stop_position groups are counted as one OSM stop in this section.
Distribution: 1 ATLAS to N OSM stops
Ratio key: 1A:2O = one ATLAS platform matched to two OSM stops.
Distribution: N ATLAS to 1 OSM stop
Ratio key: 2A:1O = two ATLAS platforms matched to one OSM stop.
OSM Stops
OSM Stop Inventory
operator:wikidatanetwork:wikidataNon-node OSM Stops (Ways)
OSM Groups (Platform / Stop Position Pairs)
Groups by Type
| Group Type | Count |
|---|---|
| osm_trio | 1,194 |
| osm_pair_uic_equal_15m | 9,813 |
| osm_pair_uic | 2,787 |
| osm_pair_name | 668 |
| osm_pair_name_equal_15m | 30 |
| osm_pair_tram | 317 |
| osm_pair_tram_equal_15m | 12 |
Unmatched
Unmatched Analysis
Unmatched ATLAS (5,792)
| No OSM within 50m | 3,020 |
| Has nearby OSM | 2,772 |
Unmatched OSM (9,101)
| No ATLAS within 50m | 3,345 |
| Has nearby ATLAS | 5,756 |
| Has Attributes | Count |
|---|---|
| UIC & Local Ref & Routes | 157 |
| UIC & Local Ref | 112 |
| UIC & Routes | 1,907 |
| Local Ref & Routes | 85 |
| UIC only | 3,052 |
| Local Ref only | 88 |
| Routes only | 984 |
| None | 2,716 |
ATLAS
ATLAS Duplicates
| Total duplicate SLOIDs | 625 |
| Matched duplicates | 629 |
| Unmatched duplicates | 42 |
Problems
Problem Detection Statistics
| Problem Type | Count |
|---|---|
| Distance — Too far from OSM node | 1,275 |
| Unmatched — Isolated ATLAS / OSM node | 4,541 |
| Attributes — Missing or mismatched data | 973 |
| Problem Type | Count |
|---|---|
| Distance — Too far from OSM node | 4,710 |
| Unmatched — Isolated ATLAS / OSM node | 7,347 |
| Attributes — Missing or mismatched data | 341 |
| Duplicates — OSM or ATLAS side collisions | 330 |
| Problem Type | Count |
|---|---|
| Distance — Too far from OSM node | 5,702 |
| Unmatched — Isolated ATLAS / OSM node | 3,005 |
| Attributes — Missing or mismatched data | 11,161 |
| Duplicates — OSM or ATLAS side collisions | 1,440 |
Routes
Route Coverage
| ATLAS stops matched by GTFS routes | 42,530 / 55,034 | 77.3% |
| OSM stops with routes coverage | 31,930 | 64.9% |
Route-Route Matching
| Metric | Count |
|---|---|
| Route-route links (ATLAS ↔ OSM) | 2,034 |
| Linked ATLAS route IDs | 2,034 |
| Linked OSM route IDs | 2,034 |
| Coverage | Count | Percent |
|---|---|---|
| ATLAS route IDs with link | 2,034 / 3,995 | 50.9% |
| OSM route IDs with link | 2,034 / 2,866 | 71.0% |
| ATLAS itineraries with link | 1,050 / 14,004 | 7.5% |
| OSM itineraries with link | 1,050 / 8,157 | 12.9% |
| ATLAS route directions (total) | 7,523 | - |
| OSM route directions (total) | 4,892 | - |
Match Method Breakdown
| Method | Count | % of Total |
|---|---|---|
| Exact GTFS route ID | 1,555 | 76.5% |
| Normalized GTFS route ID | 479 | 23.5% |
| OSM display route ID matches ATLAS route ID | 0 | 0.0% |
Route Pipeline Shape
| Metric | Mean | Median |
|---|---|---|
| ATLAS itineraries per family | 3.51 | 3.0 |
| OSM itineraries per family | 2.85 | 2.0 |
| GTFS trips collapsed into one ATLAS itinerary | 109.53 | 16.0 |
| Matched family itinerary gap (ATLAS - OSM) | 0.94 | 0.0 |
| Matched Family Comparison | Count |
|---|---|
| Matched families where ATLAS has more itineraries | 880 |
| Matched families with equal itinerary counts | 805 |
| Matched families where OSM has more itineraries | 349 |
OSM Route Family Grouping Method
How OSM route relations are grouped into line families. Priority order: route_master → GTFS route ID → (route, ref, operator, network) → per-relation fallback.
| Grouping Method | Families | % of Families | Relations |
|---|---|---|---|
| route_master (explicit OSM grouping) | 2,660 | 92.8% | 7,905 |
| Normalized GTFS route ID | 8 | 0.3% | 10 |
| Synthetic key (route, ref, operator, network) | 198 | 6.9% | 242 |
| Per-relation fallback (no grouping key) | 0 | 0.0% | 0 |
| Total | 2,866 | 100% | 8,157 |
GTFS stop_id → ATLAS sloid Mapping
Matching Paths
① Direct SLOID match
Matches where original_stop_id directly contains a valid ATLAS SLOID.
② Strict matching
Matches on (UIC number, local ref) = (number, designation) in ATLAS.
③ Coordinate proximity
For remaining: ATLAS stops with the same UIC number within 0.5m (must be unambiguous).
④ Unique-number fallback
For remaining: if a UIC number maps to exactly one ATLAS row, use that sloid.
Cardinality
GTFS stop_id side
ATLAS sloid side
Unmatched Reasons
| Reason | Count |
|---|---|
| No ATLAS candidate for this UIC number | 645 |
| Non-unique ATLAS number (strict miss, fallback not applicable) | 32,362 |
| Coordinate proximity conflict (multiple candidates within radius) | 306 |