Rules
Rules we follow to make decisions when designing data standards.
Date formats
Dates can be written in many different ways. To avoid confusion, all our dates comply with the internationally recognised ISO 8601 standard.
A date should be represented as YYYY-MM-DD
, so for example, 4 June 2018 would be written as 2018-06-04
.
Identifiers and references
When we create a unique identifier we:
- avoid the use of personal or private data in the identifier
- create an ID that will always be unique
- never reuse this ID
- assign an ID to all records
Creating unique identifiers can be complex, therefore we try not to ask data providers for true identifiers. Instead we ask for a reference
which we hope is at least unique to their organisation.
A reference
is more human readable. It should be something planners recognise, can be used in document and something a citizen can call up and talk about.
A reference
should be persisent and something the publisher issues and manages.
What we need are persistent, managed references planners would recognise, which appear in documents, and which you can ring up and talk to the call centre about, eg “CA01”, or the planning application reference.
For example, a unique id might be 10003-10927832-18389-183890
whereas a reference
might be CA01
for a conservation area.
A record with a document-url should also have a documentation-url
When we capture a document URL we want to know where it was published. We use a documentation-url to record this.
Knowing where it was published allows us to locate updated versions of the document, particularly if it becomes unavailable at the original URL.
Examples of where we use this rule
- The development plan document dataset in the Development Plan Standard
For geospatial data include both geometry and point fields
In some cases the geometry might not yet exist as data. In these cases, it is better to ask for a point than nothing at all.
Point data, although not as useful as the full geometry data, is still very useful. It still indicates where the entity exists and can be used in geography based searches.
All schemas must include entry, start and end dates
These dates provide information about the validity of the record.
They should not be used to include data that makes up part of the record, we use meaningfully named fields for dates that make up part of the record. Then, these dates are reserved for data about the record.
They are used to tell us:
entry-date
the date when the record was produced. Entry = the date that information has been entered as a record.start-date
the date when the record was known to be true from. More often than not it is the same as theentry-date
. For historical records it will be different. Start = the date the validity of the record starts.end-date
the date when the record is no longer true, current or valid. End = the date the validity of the record ends.
Take tree preservation orders (TPOs) as an example. For TPO records, the date fields are:
About the TPO
made-date
- the date the tree preservation order was “made”confirmed-date
- the date the tree preservation order was “confirmed”
About the record
entry-date
= the date this TPO record was created/provided.start-date
= the date the data in this TPO record was known to be true.end-date
= the date the TPO record is no longer true. This would be the date the TPO expired or was revoked, or the date the record was modified.
Include address-text field as well as urpn field
If you include a uprn
field also include an address-text
field. This protects against cases where no UPRN is available.
When no UPRN exists, a data provider can use the address-text
field to provide a textual address or a description of the location.
Examples of where we use this rule
We follow this rule in the Planning application standard and Tree preservation order standard