The strayr package provides tools to make working with Australian data
easier. This includes:
-
tidy versions of common structures used by the Australian Bureau of Statistics (ABS), like ANZSIC, ANZSCO, and OSCA:
-
a function to tidy up state names (
clean_states()); -
a function that knows whether particular dates are public holidays (
is_holiday()); and -
a table containing the start and end dates of school terms in each state and territory, back to 1978 (
school_terms).
This package is currently in development and subject to change. The
lifecycle badge will be changed to stable when it is stable (should be
relatively soon).
Contribute to this package: people are actively encouraged to contribute to this package.
You can install the current version of strayr with:
remotes::install_github("runapp-aus/strayr")Current structures stored in strayr are:
- Occupation Standard Classification for Australia (OSCA):
osca2024: occupation levels of OSCA, 2024
- Australian and New Zealand Standard Classification of Occupations
(ANZSCO), Cat. 1220.0:
anzsco2022: occupation levels of ANZSCO, 2022.anzsco2021: occupation levels of ANZSCO, 2021.anzsco2019: occupation levels of ANZSCO, 2013, Version 1.3.anzsco2013: occupation levels of ANZSCO, 2013, Version 1.2.anzsco2009: occupation levels ANZSCO, First Edition, Revision 1, 2009.
- Australian and New Zealand Standard Industrial Classification
(ANZSIC), Cat. 1292.0:
anzsic2006: industry levels of ANZSIC, 2006 (Revision 2.0).
- Australian Standard Classification of Education (ASCED), Cat. 1272.0:
The clean_state() function makes it easy to wrangle vectors of State
names and abbreviations - which might be in different forms and possibly
misspelled.
This package includes the auholidays dataset from the Australian
Public Holidays Dates Machine Readable
Dataset
as well as a helper function is_holiday.
This package includes a dataset with the start and end dates of school terms in each Australian state and territory from 1978 to 2024:
school_terms
#> # A tibble: 1,504 × 5
#> state year term start end
#> <chr> <int> <int> <date> <date>
#> 1 NSW 1978 1 1978-02-01 1978-05-05
#> 2 Vic 1978 1 1978-02-06 1978-05-12
#> 3 Qld 1978 1 1978-01-23 1978-04-28
#> 4 SA 1978 1 1978-02-06 1978-05-12
#> 5 WA 1978 1 1978-02-06 1978-05-12
#> 6 Tas 1978 1 1978-02-21 1978-05-26
#> 7 NT 1978 1 1978-02-06 1978-05-12
#> 8 ACT 1978 1 1978-02-01 1978-05-05
#> 9 NSW 1978 2 1978-05-22 1978-08-25
#> 10 Vic 1978 2 1978-05-29 1978-08-25
#> # ℹ 1,494 more rowsThe parse_income_range function provides some tools for extracting
numbers from income ranges commonly used in Australian data. For
example:
parse_income_range("$1-$199 ($1-$10,399)", limit = "lower")
#> [1] 1The strayr package also provides tools to access sf objects
contained in absmapsdata.
See ?strayr::read_absmap for more information.
read_absmap("sa42021")