knitr::opts_chunk$set(echo = TRUE, warning = FALSE)

The data used in this analysis were obtained from peer-reviewed articles and trial network data after systematically reviewing Brazilian literature on Fusarium head bligh (FHB)after 2000. The experiments were conducted in the south Brazil, the main wheat growing region and where FHB causes significant losses due to warm and wet springs. The disease and yield data used in our study has been published as a summary means for different treatments within experiments with replicates. There were two response variables of our interest in the studies:

  • FHB index (%, sev): proportion of diseased spikelets in the sample of spikes
  • Crop yield (kg/ha, yield): grain weight at 13% humidity

Data from each trial were extracted from the publication or spreasheets and organized in a single spreadsheet where each row represents a treatment within a trial (study).

Data import

Let’s import the dataset and select only the important variables for performing our analysis.

library(tidyverse) # for data exploration, wrangling and plottting

fhb_dat <- read_csv("data/fhb_data.csv")
head(fhb_dat)

Let’s have a quick look on all variables available and their types.

dplyr::glimpse(fhb_dat)
## Observations: 317
## Variables: 23
## $ trial       <dbl> 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2…
## $ treatment   <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 1…
## $ year        <dbl> 2000, 2000, 2000, 2000, 2000, 2000, 2000, 2000, 2000…
## $ location    <chr> "Passo Fundo", "Passo Fundo", "Passo Fundo", "Passo …
## $ state       <chr> "RS", "RS", "RS", "RS", "RS", "RS", "RS", "RS", "RS"…
## $ lon         <dbl> -52.40667, -52.40667, -52.40667, -52.40667, -52.4066…
## $ lat         <dbl> -28.26278, -28.26278, -28.26278, -28.26278, -28.2627…
## $ sowing      <dbl> 36703, 36703, 36703, 36703, 36703, 36703, 36703, 367…
## $ cultivar    <chr> "BRS23", "BRS23", "BRS23", "BRS23", "BRS23", "BRS23"…
## $ resistance  <chr> "S", "S", "S", "S", "S", "S", "S", "S", "S", "S", "S…
## $ reaction    <chr> "S", "S", "S", "S", "S", "S", "S", "S", "S", "S", "S…
## $ n           <dbl> 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9…
## $ fungicide   <chr> "AACHECK", NA, "Folicur 200 CE", "Folicur 200 CE", "…
## $ AI          <chr> "Check", NA, "Tebuconazole", "Tebuconazole", "Tebuco…
## $ rep         <dbl> 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4…
## $ inc_check   <dbl> 73.3, 73.3, 73.3, 73.3, 73.3, 73.3, 73.3, 73.3, 73.3…
## $ inc         <dbl> 73.3, 63.2, 44.6, 48.3, 54.8, 36.3, 42.9, 34.2, 12.8…
## $ sev_check   <dbl> 38.8, 38.8, 38.8, 38.8, 38.8, 38.8, 38.8, 38.8, 38.8…
## $ sev         <dbl> 38.80, 31.20, 16.60, 18.90, 19.50, 11.40, 13.90, 12.…
## $ yield_check <dbl> 2838, 2838, 2838, 2838, 2838, 2838, 2838, 2838, 2838…
## $ yield       <dbl> 2838, 3149, 3366, 3484, 3600, 3564, 3735, 3802, 4312…
## $ author      <chr> "Panisson et al.2002", "Panisson et al.2002", "Panis…
## $ publication <chr> "Artigo", "Artigo", "Artigo", "Artigo", "Artigo", "A…

Let’s reduce the number of columns by selecting the most important variables for our analysis.

fhb_dat <- fhb_dat %>%
  select(1:7, 12, 19, 21)

Create yield class

Here we will group the yield data into two production situations, low or high yield, based on median of maximum yields across trials.

# Summarizing the yield max by study
yield_max <- fhb_dat %>%
  group_by(trial) %>% 
  summarize(max_yield = max(yield))

# Grouping the yield max informationnew information in a new column
fhb_dat_selec <- left_join(fhb_dat, yield_max)

summary(yield_max$max_yield) # the median value of the yield mx by trials was 3631 kg/ha
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    1460    2919    3631    3676    4312    5692
# Create a new categorical variable - Baseline yield

fhb_dat_selec <- fhb_dat_selec %>%
  mutate(yield_class = case_when(max_yield <= 3631 ~ "low",
                         max_yield > 3631 ~ "high"))

Export dataset

library(readr)
write_csv(fhb_dat_selec, "data/fhb_data_new.csv")
Copyright 2017 Emerson Del Ponte