INBO CODING CLUB

26 April, 2018

Welcome

Intro: What have I done?

stations <- get_stations("air_pressure") %>%
    filter(stringr::str_detect(station_no, "03"))

air_pressure <- stations %>%
    group_by(ts_id) %>%
    do(get_timeseries_tsid(.$ts_id, period = "P1D", 
                           to = lubridate::today())) %>%
    ungroup() %>%
    left_join(stations, by = "ts_id")

air_pressure %>% 
    ggplot(aes(x = Timestamp, y = Value)) + 
    geom_point() + xlab(format(lubridate::today() - 1, format="%B %d %Y")) + 
    facet_wrap(c("station_name", "stationparameter_name")) + 
    scale_x_datetime(date_labels = "%H:%M",
                     date_breaks = "6 hours")

-> get data of yesterday on Waterinfo.be using the https://inbo.github.io/wateRinfo/ package

If you want to share your code snippet, copy paste your snippet within a section of three backticks (```):

As an example:

library(tidyverse)

...

(you can copy paste this example and add your code further down, but do not fill in your code in this section)

Your snippets:

c(…,…) and | do the same? NOT!

visdata <- read.csv(file = "Copy of 20180426_visdata_cleaned.csv", sep = ",")

soorten1 <- visdata %>%
  filter(str_detect(soort, c("garnaal", "krab", "kreeft")))
soorten2 <- visdata %>%
  filter(str_detect(soort, "garnaal|krab|kreeft"))
anti_join(soorten2, soorten1)
anti_join(soorten1, soorten2)

… and anti-join is a good way of checking the differences!

with str_c(c("garnaal", "krab", "kreeft"), collapse="|") you actually achieve the same…

soorten1 <- visdata %>%
  filter(str_detect(soort, str_c(c("garnaal", "krab", "kreeft"), collapse="|"))
soorten2 <- visdata %>%
  filter(str_detect(soort, "garnaal|krab|kreeft"))
anti_join(soorten2, soorten1)
anti_join(soorten1, soorten2)

the glue::glue() usage

The default date print format:

soorten2 <- visdata %>%
  mutate(meetpuntomschrijving = str_to_lower(meetpuntomschrijving)) %>%
  filter(str_detect(soort, "garnaal|krab|kreeft")) %>%
  mutate(description = 
           glue::glue("{soort} bij {meetpuntomschrijving} op {format(datum, '%A, %B %d, %Y')}"))

Defining a custom date print format (vb. https://www.statmethods.net/input/dates.html to see the meaning of the %x symbols):

soorten2 <- visdata %>%
  mutate(meetpuntomschrijving = str_to_lower(meetpuntomschrijving)) %>%
  filter(str_detect(soort, "garnaal|krab|kreeft")) %>%
  mutate(description = 
           glue::glue("{soort} bij {meetpuntomschrijving} op {format(datum, '%A, %B %d, %Y')}"))

which day of the week? solution

The usage of label provides a label instead of a number and with the locale you can define a language:

my_date <- "August 2nd, 2018 14:00"
wday(mdy_hm(my_date), label = TRUE,
     locale = "English")

check your conflicts in the namespace

When different packages have the same function, this can give problems. To see the potential issues on overlap, check:

conflicts()

Read surveys file and add Date field; solution

read data… surveys <- read_csv("data/20180222_surveys.csv")

cfr.

surveys$date <- dmy(str_c(surveys$day, surveys$month, surveys$year, sep = "-"))

versus:

surveys %>%
    mutate(date = dmy(str_c(day, month, year, sep = "-")))

Remark: `separate` is a tidyr function

fish %>% 
    separate(meetpuntomschrijving, into = c("place_1", "place_2"), sep = " ")

About `lubridate::pretty_dates` and `ggplot`

ggplot(daily_counts, aes(x = day, y = n, group = 1)) +
    geom_line(stat = "identity") +
    scale_x_datetime(breaks = lubridate::pretty_dates(daily_counts$day, n = 5)) +
    ylab("visitors") +
    xlab("")

User stats grofwild; solution

grofwild <- read_delim(file = "../data/20180316_grofwild_logs.csv", delim = " ")

grofwild %>%
  filter(type == "AppStart") %>%
  mutate(hours = hour(time)) %>%
  count(hours) %>%
  complete(data.frame(hours = 0:23), fill = list(n = 0)) %>%
  ggplot() +
  geom_bar(aes(x = hours, y = n), stat = "identity") +
  scale_x_continuous("hour of the day", breaks = seq(0, 23, 2)) +
  ylab("number of visitors")

Reminder about data import:

old-skool R (it's better to not use it)

gent <- read.csv(...)

comma separated values

gent <- readr::read_csv("../data/20180222_survey_data_spreadsheet_tidy.csv")

semicolon separated values

gent <- readr::read_csv2("../data/20180123_gent_groeiperwijk.csv")

Syntax	Example	Reference
# Header	Header	基本排版
- Unordered List	Unordered List
1. Ordered List	Ordered List
- [ ] Todo List	Todo List
> Blockquote	Blockquote
Bold font	Bold font
Italics font	Italics font
~~Strikethrough~~	~~Strikethrough~~
19^th^	19^th
H~2~O	H₂O
++Inserted text++	Inserted text
==Marked text==	Marked text
[link text](https:// "title")	Link
![image alt](https:// "title")	Image
`Code`	`Code`	在筆記中貼入程式碼
```javascript var i = 0; ```	`var i = 0;`	在筆記中貼入程式碼
:smile:		Emoji list
{%youtube youtube_id %}	Externals
$L^aT_eX$	L^aT_eX
:::info This is a alert area. :::	This is a alert area.