# A tibble: 5 × 2
species count
<chr> <int>
1 Perameles nasuta 967
2 Perameles fasciata 344
3 Perameles gunnii 106
4 Perameles pallescens 89
5 Perameles bougainville 6
Martin Westgate
about
|
who we are |
code
|
what we build |
workflows
|
what we recommend |
future
|
where we’re going |
who we are
who we are
|
|
Dr Shandiya Balasubramaniam |
|
|
Dr Dax Kellie |
|
|
Dr Martin Westgate |
|
|
Dr Amanda Buyan |
|
|
Ms Juliet Seers |
principles
|
|
(science & decision) support |
|
|
modern, reproducible workflows require code |
|
|
community-building for data cleaning, publication and re-use |
data life-cycle
|
|
|
|
||
| publish | check | re-use |
what we build
open science package suite
|
|
galah
|
|
|
potions
|
|
|
galaxias
|
|
|
corella
|
|
|
delma
|
galah: access data from the GBIF node network
galah: tidy syntax
# A tibble: 5 × 2
species count
<chr> <int>
1 Perameles nasuta 967
2 Perameles fasciata 344
3 Perameles gunnii 106
4 Perameles pallescens 89
5 Perameles bougainville 6
galah: reproducible workflows
# A tibble: 3 × 4
occurrenceID eventDate species occurrenceStatus
<chr> <dttm> <chr> <chr>
1 https://naturemapr.org/sightings… 2025-01-14 06:47:00 Perame… PRESENT
2 https://naturemapr.org/sightings… 2025-01-12 08:17:00 Perame… PRESENT
3 https://naturemapr.org/sightings… 2025-01-09 09:27:00 Perame… PRESENT
galah: access sensitive data (coming soon)
|
|
Request access via https://www.rasd.org.au |
galaxias: format & publish biodiversity data
galaxias: assumptions about scientists
|
|
may want to publish data, but not have the tools to do so |
|
|
don’t want to learn an unfamiliar data format |
|
|
should retain control over what data is published, when, and by whom |
galaxias: format to Darwin Core
my_data_dwc <- df |>
set_occurrences(occurrenceID = composite_id(location_id,
sequential_id()),
basisOfRecord = "humanObservation") |>
set_coordinates(decimalLatitude = latitude,
decimalLongitude = longitude) |>
set_datetime(dmy(date)) |>
set_scientific_name(scientificName = species,
taxonRank = "species")
my_data_dwc# A tibble: 2 × 8
location_id occurrenceID basisOfRecord decimalLatitude decimalLongitude
<chr> <chr> <chr> <dbl> <dbl>
1 A A-01 humanObservation -35.3 149.
2 B B-02 humanObservation -35.3 149.
# ℹ 3 more variables: eventDate <date>, scientificName <chr>, taxonRank <chr>
what we recommend
ALA labs: home
ALA labs: posts
data cleaning
where we’re going
themes
|
|
artificial intelligence and machine learning |
|
|
biodiversity indicators |
|
|
data quality |
artificial intelligence & machine learning
biodiversity indicators
data quality
The ALA Science & Decision Support Team are:
https://labs.ala.org.au
These slides were made with Quarto, R, and:
Martin Westgate | Atlas of Living Australia