OAs and SOAs - Stick or Twist?

Posted by Nick Stripe on November 24th 2006

The key issue to be decided is whether we should aim for stability with OAs and SOAs through the next Census and beyond.  The consultation paper describes the issues surrounding this decision.  I know that there are many different views held.  Here are a selection of the comments made to me in the last couple of weeks, as I've been trailing the consultation:

  • redraw the geography completely using the same algorithm but based on 2011 Census data and 2011 postcodes;
  • keep SOAs, but redraw OAs within fixed SOA boundaries using 2011 Census data;
  • the fundamental building blocks need to be real not abstract: redraw the whole lot and build it up from street blocks;
  • we have to have ward, community, and parish level data too;
  • now it's established you must keep it fixed, that's what it's for;
  • as long as you've got grid referenced data you can give us any geography we want.  The disclosure risk from overlapping geographies is not so great.

To me, the message that has always come through most consistently, from members of all sectors of our user community, has been the stability one.  

Whether we can release data for more than one small area geography or not, the main geography must be consistent and stable.  That's what we've never had before.  Do you agree?  Let's open up the debate.  What should we do, and why?

SOA Census Output

Posted by Geoff Mole (not verified) on 08/02/2007 - 10:55

I use SOA data as building blocks for gp practice data. In East London where I work ethnic data for practices is of great interest.

SOAs would have much more interest currently, if the 2001 Census had been re-analysed at this level for richer tables than OA based aggregates provide. For my work this is particularly the case in tabulation of population by age, sex, and ethnic group.

Technically the 2001 Census could be reanalysed, but at too great a cost. It would be interesting though to have an idea what the 2011 output for SOAs could be -if an SOA level of table is introduced?
This must have a big effect on how greatly we should value SOAs.

OAs and SOAs

Posted by Simon Whalley (not verified) on 22/01/2007 - 15:56

The development of sensible geographical units which describe the area that they are trying to represent was one of the huge benefits of the 2001 Census from research pioneered by David Martin and Stan Openshaw. They allow the comparison of different areas knowing that the results you get are not biased by the areas having a big difference in the underlying population of the different areas.

Being at a finer scale has allowed much more detailed analysis to be carried out. I would argue that the principal of these geographical units should not be dropped for the next Census. They are a great idea and seeing that individual records can never be released, the best compromise available.

Changing the definition of Output Areas or Super Output Areas will annoy any researcher who wants to carry out time series analysis. However, areas do change over time, urban decline or new housing developments as a result of government planning. Where significant change has occurred, surely these areas should be re-computed so that at the next Census, the geographical units best describe the areas they are meant to represent. This will give researchers or anyone who is interested in using these data the idea that they can sensibly analyse these data.

Wards are a throw back to the dark ages when geographical units were created for ease of use rather than any idea of allowing sensible analysis to be carried out. Super Output Areas are far more sensible units to use instead and can be used as compariable units. The Index of Multiple Deprivation is created at this level. Other data sets like mid-year population estimates are now produced at this level. Government policy seems to be releasing data sets at this level at quite regular intervals (biannual). With wards consistently changing for non-geographical reasons these are all strong arguments for using Super Output Areas, and ignoring Wards.

OA/SOA boundaries from a health data perspective

Posted by Anonymous (not verified) on 26/01/2007 - 10:19

I work with health data and have had to adjust to using the SOA boundaries from ward boundaries. The benefit to them is that if national data is published at a consistent geograhy things can be compared. This isn't just about census data - all ONS data where possible was meant to be available at SOA level. Time series for health (and for many other areas) is really important for prioritising services.
I would like to see lower layer SOA boundaries remain the same and OA below that change to match areas. That way people can still create their own boundaries from lower layer SOA boundaries to suit their work. All organisations and services are interested in different geographical areas and this seems like a reasonable way of keeping a level of consistancy and giving people some flexibility in the areas they work with.

Fit for purpose

Posted by Richard Price (not verified) on 19/01/2007 - 10:48

It seems to me that the most important feature of Output Areas for 2011 is that they are for reporting of output from the 2011 census data. To restrict them to a geography that will be 10 years out of date by the time the new census data starts to become available and 20 years by the time the data is replaced, undermines the value available from the census. Very little other official data is available at OA level, so time series analaysis is restricted anyway, for that it would be better to allow the OA's to be redrawn but restrict them to middle level SOA boundaries.
I would also say that this is why I totally agree with David Martin's comments that the use of ward boundaries cannot be seriously, considered they are simply too unstable.
I am also concerned that there does not appear to be details of how OA change will be allowed under the 'stable' option and whether there is a commitment to keep average OA size the same. Will OA's that are now too small for disclosure control be added to one other neighbouring OA (increasing the average size of OA's) or split between several (in which case what will be the basis for doing this)? Similarly if an OA grows and is now double the limit will two OA's be created by splitting the original? Whilst what happens if two neighbouring OA's grow, but not sufficiently to split will they be sub-divided to create a third?

Re - Fit for purpose

Posted by Nick Stripe on 19/01/2007 - 12:29

Thanks for your thoughts Richard. I'll just quickly clarify thinking on handling OA change under the stable option:

- where OAs become too small they will simply be amalgamated with a neighbour, not split between several;
- where OAs become too large, they will be split within their existing boundary;
- if two neighbouring OAs grow, but neither grow sufficiently to split, they will not be.

We will be working within a preferred OA size range. The average size of OAs is likely to increase, but only slightly. The tightness of the grouping around this mean size is likely to become slightly less pronounced as well.

Geography to drive policy

Posted by Neil Bendel (Health Intelligence Manager, Manchester) (not verified) on 11/01/2007 - 09:43

The debate about geography is always interesting but in the drive to achieve the 'purest' statistical geography people tend to forget that a key purpose of creating data for small geographies is to influence where and how services are delivered and in doing so to make a difference to people's life. Hence there has to be a connection between the geography units at which data is OUTPUT and the areas at which services are DELIVERED. Wards may be imperfect from a theoretical point of view (and I agree with much of what is said) but in many local authorities and PCTs they are used as basis for service delivery and democratic accountability through ward co-ordination arrangements etc. The need to monitor the effectiveness and impact of these services means that ward level geography is, and should continue, to be a cornerstone of ONS policy.

What is the breakdown?

Posted by Philip Jones (not verified) on 13/12/2006 - 10:05

From what I've asked, it's traditional areas that people are interested in. Traditional counties, dioceses and parishes are what many people are interested in when it comes to census data.

Geographical stability over time

Posted by Paul Williamson (not verified) on 11/12/2006 - 12:41

Personally I'm delighted to hear that ONS are finally considering prioritising geographical comparability over time ahead of capturing "this year's" ward geography. (In 2001, of course, the decision was made the other way around.) Given where we are at, keeping the current SOAs for mid-layer 2011 outputs is the obvious way to go - if nothing else, it allows a more direct hook into the wealth of non-census information now made for these boundaries. Probably the same arguement applies for 2001 OAs, although I would be interested to see how many have to be changed/merged in order to keep populations above disclosure thresholds - or to reflect the building of new roads that cleave existing 'neighbourhoods' in half. The downside of going with SOAs and OAs is that, over time, they will become an increasingly irrational basis for an output geography due to population and other changes (ward bounardy changes, road and house building etc) that undermine the rationale for their original creation. For this reason, I still continue to believe that the 'real' solution to all of these problems is to adopt a grid-based geography, with sub-division/ aggregation of grid cells over time to keep cell populations only just above threshold, and with look-up tables (and utilities) provided to allow users to derive best estimates of their desired output geography(ies). A helpful spin-off is that the digitised boundaries of a cell-based geography would be cheap and easy to create and disseminate without the current troublesome copyright issues that attach to OA/SOA boundaries. From this point of view, going with SOAs for 2011 merely puts of the 'inevitable' for another few years...

Stick or twist?

Posted by David Martin (not verified) on 06/12/2006 - 21:30

As, perhaps, the person who can lay most claim to being the originator of the current output areas (but not working for ONS) I'd like to comment on a couple of threads emerging from this discussion:

Firstly, I would be quite willing to see us creating an output geography again from scratch, but only in return for some major, unambiguous benefits all round . In that category I would consider a really good match to Mastermap, or perhaps to a set of stable postcodes (neither of which we are sure can be delivered at present), but not just so that we can have a different set of small areas with fundamentally the same characteristics as the ones we already have. That route guarantees no clear benefit to anyone and cannot ultimately resolve any of our likes or dislikes about the present OAs. On the contrary it would prevent geographically consistent comparison over time (a massive lost opportunity) and would mean that users would have to reconstruct all their GIS applications based on the census (costly and inefficient), without any gains of corresponding magnitude.

Secondly, the issue of wards has of course cropped up and they have even been suggested as the geography to hold onto for the sake of consistency. Contemporary wards are the single most important geography for local government. However, they are renamed, recoded and redrawn continually and at different dates in different parts of the country; they range in size from under 1,000 to over 30,000 and there is no freely available, updated, boundary set. Perhaps their greatest failing is that people think they understand them when they don't, leading to massive confusion as to whether ward data are being matched to the correct codes and boundaries. I have no problem with the production of ward statistics as a secondary output product, but they are deficient in almost all important regards as the basis for an official statistical geography.

Geography

Posted by Census Data User (not verified) on 01/12/2006 - 14:27

As with all geographies whoever conceives it thinks it works. Rarely does it end up going ahead without interventions that go against the ethos of the original idea thus compromising the results from the beginning.

There are two points I'd like to make are.
1. Measuring change over time is an admirable aim, but what are you trying to measure?
• the change in land usage or
The main components here are land type, tenure and building types.

• the change in the composition in people.

Land doesn't move but people do. If you are measuring the change of people over time and not using linked records you are likely to be measuring different people using different snapshots in time.

Areas of deprivation can be measured and re-measured, most research points to a churning effect in undesirable areas with one group of residents being replaced by another group. An increase in affluence allows movement from these undesirable areas, to better areas.

As areas around water, transport infrastructure and inner cities increases in desirability the people who are in these areas change.

Mixed tenure developments suffer from similar issues with the less desirable housing usually being provided as affordable housing.

2. How much emphasis should be put on historical boundaries?
Are we more concerned in the here and now or historical aspect?
Should we not invest in algorithms to convert old data to new boundaries, rather than locking ourselves into something that might have been good when conceived but doesn't fit now.

I think that there would be less resistance in identify people from historical data that current.

Reconfiguring data from former areas

Posted by Richard Cooper (not verified) on 12/02/2007 - 17:37

The coment "Should we not invest in algorithms to convert old data to new boundaries, rather than locking ourselves into something that might have been good when conceived but doesn't fit now."
Is an interesting one -THe idea is worth considering, but as with David Martins post, I would ask how the benefits balance out.

Certainly the value of older data is lessened if you consider Censusses. This makes restructuring on the face of it a good option, but the number of datasets that would be restructured, leading to confusion when looking at older data reports, would be significant. In addition, the increasing use of regular and more frequent data means that more frequent reconfiguring could be very costly, and the decision on when it should be done a difficult one.

Consequently the 2001 Census makes sense as a 'review point'. This is something that I hadn't thought sensible until considering this, but the practicality seems to outweigh the advantage.

And what of the advantage - the reason is driven by a geography that 'doesn't fit now'. I'd ask the question 'Why?' if it is changes to population, etc. the polciy would handle that. If it is impractical boundaries, then that is nothing new, they always were problematic (no fault of David's work).

Previously a stumbling block in restructuring was the confidentiality restrictions from overlaid geographies. Is that still a problem?

Finally, the point OAs was flexibility, in order to create different user areas. In theory this would overcome the problem of becoming out-of-date, but it appears not to have done.

The question remains 'Why change?' and the answer remains, it seems, because they were not sufficiently fit for purpose in the first place, rather than things have changed around them.

Measuring change in social and domestic cohesion

Posted by Nick Gulliford (not verified) on 03/12/2006 - 10:41

There are 8 indicators of deprivation listed at the SEU web site and 7 indices published in the ONS neighbourhood statistics pages.

The missing indicator is 'family breakdown'.

I am not a statistician, but I should have thought that an index could be compiled for 'social and domestic cohesion' based on domestic violence, abortion, teenage pregnancies, STI's, out of wedlock births, single parents, marriage/divorce ratio, truancy, etc.

I'm interested in local community and faith leaders - GP's, school governors, parish councillors, clergy etc. - being able to measure easily whether their neighbourhoods are becoming more or less cohesive.

This would enable them to assess better whether local policies, programmes and educational courses addressing STI's, teenage pregnancy, truancy, family breakdown etc. are being effective.

Re:

Posted by Nick Stripe on 07/12/2006 - 09:33

Thanks for the comment, but I'm not sure what your concern is in respect of the geography?

Re - Geography

Posted by Nick Stripe on 01/12/2006 - 15:07

Some interesting comments here. Thank you. I'll ruminate on them a bit more and may return to them. But my initial thoughts surround:
1. Emphasising our desire to assess whether there may be methods that could be employed to protect confidentiality with overlapping small area geographies for the Census.
2. Re-iterating that we need a geography that works for non Census datasets too - e.g. all the administrative datasets on Neighbourhood Stats. We don't have access to unit records for these, they arrive pre aggregated - usually to relatively large building blocks (SOAs not OAs). This brings a problem in trying to convert these data to alternative boundaries.

OAs/SOAs

Posted by Phil Smith (not verified) on 01/12/2006 - 14:25

Understand the need for a consistent geography, but I've never found OAs/SOAs very useful. Normally my clients want to put a name to a location, so wards are much better, even if its simply area n in ward x.

I though OAs and SOAs were are move towards PostCode geography? Given that most people using GIS will have PostCode Admin Boundary converters, why not accept PostCodes as inevitable?

OAs/SOAs

Posted by Alison Peacock (not verified) on 12/12/2006 - 15:02

The availability of free digital boundaries for OAs/SOAs has opened up the potential of GIS for many organisations other than government, academic and commercial. Not all of these currently have access to digital postal geographies or postcode/admin boundary converters - if a decision is made to move in the direction of postcodes, will digital boundaries still be freely available?

Free Digital Boundaries

Posted by Richard Price (not verified) on 09/02/2007 - 10:57

Whilst boundaries are free for end users, they are not free for comercial VAR's to supply on to end users!? Note this is an OS issue but needs resolving before any decision is made about boundary sets.

Stability, consistency, democracy

Posted by Simon Dickson (not verified) on 30/11/2006 - 12:16

There's absolutely no doubt in my mind that a stable approach to geography would have a huge beneficial impact on official statistics generally, and their use across the entire population.

I spent more than two years working at ONS, leading web development efforts. We were ambitious, and I don't regret that: but every time we worked up an idea to make statistics more accessible to the general public, someone would throw the geography spanner into the works, and things would become too complicated to ever deliver. (If you need a case study, just look at Neighbourhood Statistics - sorry folks.) I felt it was becoming a stock excuse, and it made people start projects with an expectation of failure.

My memory may be failing me here, but weren't OAs and SOAs conceived with the intention of being a consistent future-proof standard? I have been out of the stats business for some time, so I don't know what has happened in the meanwhile.

But I do know one thing: a bad standard is still much, much better than no standard at all.

To redraw or not to redraw?

Posted by LSC Planning Manager (not verified) on 29/11/2006 - 15:33

I think the previous comment usefully summarises the issues. Planning and monitoring change and progress year on year are difficult to carry out without fixed borders to ensure comparisons are meaningful. However, populations change anyway (regardless of the problems caused to planners thereby!)so we are never really comparing like with like in population terms, merely geographical terms - and geographical/spatial research and planning are rarely carried out without consideration of the resident population.
So I would suggest that the areas be redrawn to reflect reality, but to enable tracking, let's also have ward level data.

A stable geography?

Posted by Analyst (not verified) on 28/11/2006 - 08:27

A stable geography supports analysis over time which is a good thing. However the super output areas do not always match the areas we wish to analyse and that can cause a problem so should there be a redraw?
I think there are two points to consider
(1) If SOAs were redrawn now to reflect local communities how long would it be before there were calls to redraw again because communities had changed?
(2) If it were possible to produce statistics for specific communities from SOAs would people be more inclined to forget to take into account other factor such as year on year variability and changes to the underlying population when making comparisons?
I think the current boundaries should be kept - what may be needed is more help and guidance for people who are unfamiliar with them and so still want figures for administrative areas such as wards.
I look forward to seeing what other people think.

Timer series and geography

Posted by Jayne Mills (not verified) on 23/01/2007 - 16:41

We cannot fall in into the trap of thinking that as long as we keep geographies stable then time series analysis between censues is straight forward. Definitional changes, changing census categories, census undercounting, etc etc will make comparisons difficult/flawed.

OAs are the building blocks and so should reflect the current situation on the ground if they are to be useful. Ideally LSOAs should remain stable. Still need ward data for all the reasons outlined earlier.

Time Series analysis

Posted by Steven Ward (not verified) on 09/02/2007 - 14:54

So most geographies change over time, but most people want to analyse data over time.

The need to analyse areas (small or otherwise) over time is for me driven by the continuing geographical inequalities that exist between areas and the desire of governments (at whatever level) to try and reduce these inequalities.

ONS, as a government body, surely needs to produce outputs that can effectively measure the effect of government policies.

I take on board the comments re population changes over time (ie that you are never measuring the same people), but this applies at every level, right up to a global level - the people on the planet today aren't the same as they were yesterday.

Maybe what we need is a consistent small area geography, from which other geographies can be built and changed as necessary ... maybe we could call them output areas?