About
Data & methodology
Every figure on Gazetteer resolves to a source within one click. The list below is generated
automatically from content/sources.json — attribution strings and licence
terms are not hardcoded anywhere else in the codebase.
Indigenous Data Sovereignty: Gazetteer treats Indigenous Census variables as outcomes of colonisation, not overlays of difference. Where a number describes a community, the framing credits that community. ABS Census 2021 was released under CC BY 4.0 for public use; the responsibility for respectful framing is editorial, not legal.
Sources
ABS Census 2021
Australian Bureau of Statistics (Commonwealth Government of Australia)
Caveats
- Small-count suppression: ABS replaces counts <3 with zeros / randomised perturbation to protect privacy.
- Population floor: SA2s with population <200 are excluded from gazetteer rankings to reduce noise.
- 'Not stated' responses inflate denominators for percentage fields — affects unemployment rate, % no internet, % born overseas, etc.
- Census cadence means refresh is essentially one-shot until the 2026 Census release lands.
- Indigenous status undercount is documented by ABS — figures are 'persons who identified as Indigenous in the Census', not a comprehensive count.
- Some SA2 boundaries shifted between 2016 and 2021 ASGS editions — gazetteer uses 2021 boundaries throughout for consistency.
Fields
-
population(persons) -
Total population
-
median_age(years) -
Median age
-
median_household_income_weekly(AUD/week) -
Median household income (weekly)
Method:
G02— Selected Medians and AveragesEquivalised total household income excluded for v0; raw weekly median used as the visceral number.
-
median_rent_weekly(AUD/week) -
Median rent (weekly)
-
unemployment_rate(percent) -
Unemployment rate
Method:
G43— Labour Force Status by Age by SexDerivation: ABS's own Percent_Unem_loyment_P column from G43 (unemployed / labour force). Excludes 'not in labour force' by construction.
-
pct_renting(percent) -
Households renting
Method:
G37— Tenure Type and Landlord Type by Dwelling StructureDerivation: R_Tot_Total (all rental subcategories summed by ABS) / Total_Total (all tenure types including 'not stated').
-
avg_persons_per_bedroom(persons/bedroom) -
Average persons per bedroom
Method:
G02— Selected Medians and Averagesv0 overcrowding proxy. ABS-published median; values >1.0 indicate more persons than bedrooms on average. Will be superseded by overcrowding_rate (G32 / CNOS) once a dedicated resolver lands.
-
overcrowding_rate(percent) -
Overcrowding rate (Canadian National Occupancy Standard)
Method:
G32— Number of Bedrooms by Number of Persons Usually ResidentDerivation: Households requiring one or more additional bedrooms per CNOS / total occupied private dwellings.
-
pct_no_internet(percent) -
Dwellings with no internet connection
Method:
— -
pct_no_car(percent) -
Dwellings with no motor vehicle
-
pct_single_parent_families(percent) -
Single-parent families (share of all families)
Method:
G29— Family CompositionDerivation: OPF_Total_P (one-parent families, all persons) / Total_P (all families).
-
pct_no_post_school_qualification(percent) -
Adults (15+) with no post-school qualification
Method:
G49— Non-School Qualification: Level of Education by Age by SexDerivation: Persons 15+ where qualification is 'no qualification' or 'not applicable' / total persons 15+ excluding 'not stated'.
-
pct_indigenous(percent) -
Aboriginal and/or Torres Strait Islander population
Method:
G07— Indigenous Status by Age by SexCross-checked against the Aboriginal and Torres Strait Islander Peoples Profile (IP01) where available. Framing per Indigenous Data Sovereignty principle: outcomes-of-colonisation, not 'Indigenous presence as overlay'.
-
pct_born_overseas(percent) -
Persons born overseas
Method:
G01— Selected Person Characteristics by Sex (Birthplace aggregate)Derivation: Birthplace_Elsewhere_P / Tot_P_P. Sourced from G01 rather than G09 (which splits countries across files A–G); G01 carries the Australia/Elsewhere aggregate directly.
-
languages_top_3(language names + counts) -
Top 3 languages used at home (other than English)
Method:
G13— Language Used at Home by SexDerivation: Top 3 by speaker count, excluding 'English only', 'not stated', and 'inadequately described'.
ABS ASGS Edition 3 (2021)
Australian Bureau of Statistics (Commonwealth Government of Australia)
Caveats
- SA2 boundaries are designed for statistical purposes, not administrative — they don't always align with how locals perceive 'suburb' boundaries.
- SAL (Suburb and Locality) layer provides the human-readable suburb names but doesn't tile cleanly to SA2 — gazetteer uses SAL only for labelling, SA2 for data joins.
- Coastline polygons include very large SA2s for sparsely populated regions — visual choropleth needs careful classification (quantile or natural breaks) to avoid the map being dominated by empty outback SA2s.
PlanningAlerts
OpenAustralia Foundation (Australian non-profit)
Caveats
- Coverage is council-by-council, not all-AU — absence of DAs in an SA2 may mean no development OR may mean the council does not contribute to PlanningAlerts.
- DA descriptions are free-text from council planning portals and vary wildly in quality and detail.
- DA status (lodged, approved, refused) is a snapshot — councils update on different cadences.
- PlanningAlerts API rate limits constrain backfill: full historical reload takes multiple days at 1,000 req/day. Gazetteer uses rolling 90-day windows to stay under limits.
- Some councils only publish summary data without precise lat/lng — these DAs are dropped from the spatial overlay and noted in the coverage page.
NPI 2023-24
Department of Climate Change, Energy, the Environment and Water (DCCEEW), Commonwealth Government of Australia
NPI: pending data verification — see methodology.
Caveats
- PENDING VERIFICATION: do not render NPI data on the live site until the endpoint is confirmed and a sample retrieval succeeds. About page should render 'NPI: pending data verification' until then.
- NPI captures facilities above reporting thresholds only — small emitters are excluded by design.
- Self-reported by facility operators; quality varies. Substance categories follow NPI's own taxonomy (~93 substances).
- Emissions are annual totals — no temporal granularity within a reporting year.
- Some facilities aggregate multiple sub-sites into a single report at one lat/lng — geographic precision is operator-dependent.
Attribution grammar
Every per-field stat on the site is stored as a triple { source_id, method_id,
retrieved_at }. When a stat is tapped or hovered, that triple surfaces and links
back to the relevant entry above. This keeps attribution honest: no number appears without a
source, and no source appears only on the About page.