03-survey.Rmd


# Survey about movement ecology

```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE, fig.path = "posts/moveco-images/")
library(tidyverse)
library(ggplot2)
library(ggrepel)
library(dplyr)
library(reshape)
library(RColorBrewer)
library(kableExtra)
library(cowplot)
source("./R/multiplot.R")
data_dir <- "./survey/"
data <- read.csv(paste0(data_dir,"MoveEcoSurvey.csv"),stringsAsFactors = FALSE) 
data <- data %>% 
  filter(Status != 1)
cols_data <- (grep("Q3", colnames(data))[1]:grep("Q19", colnames(data)))
answers <- dim(data)[1] - sum(sapply(1:dim(data)[1],function(x){
  all(is.na(data[x,cols_data]))
}))
cols_data_real <- (grep("Q5", colnames(data))[1]:grep("Q19", colnames(data)))
answers_real <- dim(data)[1] - sum(sapply(1:dim(data)[1],function(x){
  all(is.na(data[x,cols_data_real]))
}))
```

## Description of the survey

As a complementary source of information for our review, we elaborated a survey to get the perspectives of movement ecologists about the field and how it is evolving. 

The exact formulation of the questions in the survey can be downloaded [here](survey/Survey.pdf).


## Participation in the survey 

The survey was designed to be completely anonymous. There was no previous selection of the participants and no probabilistic sampling was involved. The survey was advertised by mailing lists (r-sig-geo and r-sig-ecology), individual emails to researchers, the lab's website <https://mablab.org>, the Gordon Research Conference and Seminar on Movement Ecology of Animals, and Twitter. It took place during the Winter/Spring of 2019.

The survey got exemption from the Institutional Review Board at University of Florida (IRB02 Office, Box 112250, University of Florida, Gainesville, FL 32611-2250).

A total of `r answers` people participated in the survey, and `r answers_real` answered at least one question concerning the field (and not just their years of experience in research or the field). No question was mandatory, so participants could opt to not answer some questions. 


```{r years-research, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_1 <- data %>% 
  select(Q3) %>% 
  filter(!is.na(Q3))
wrong_answer <- which(data_question_1$Q3>1900)
if (length(wrong_answer)>0){
  data_question_1$Q3[wrong_answer] <- 2019-data_question_1$Q3[wrong_answer]
}
# boxplot(data_question_1)

p1 <- ggplot(data = data_question_1, mapping = aes(x = Q3)) +
  geom_histogram(binwidth = 2*diff(quantile(data_question_1$Q3,probs=c(0.25,0.75)))/(nrow(data_question_1)^(1/3)),
                 aes(y=..density..), colour="#999999",fill="white") +
  geom_density(colour="black") +
   theme_classic() +
  theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
                  axis.title =element_text(size=20),
                  axis.text =element_text(size=15)) +
  labs(x = "Years in research")

# show(p1)
```


```{r years-moveco, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Histogram of the years in research of all participants (left), and histogram of the years working on movement (right). Both histograms are skewed to the right.", fig.cap = " "}
data_question_2 <- data %>% 
  select(Q4) %>% 
  filter(!is.na(Q4))

p2 <- ggplot(data = data_question_2, mapping = aes(x = Q4)) +
  geom_histogram(binwidth = 2*diff(quantile(data_question_2$Q4,probs=c(0.25,0.75)))/(nrow(data_question_2)^(1/3)),
                 aes(y=..density..), colour="#999999",fill="white") +
  geom_density(colour="black") +
   theme_classic() +
  theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
                  axis.title =element_text(size=20),
                  axis.text =element_text(size=15)) +
  labs(x = "Years in movement")

multiplot(p1,p2,cols = 2)

```

We first asked the participants how many years they have been doing research, and how many years they have been working on animal movement. They have spent a median of `r round(median(data_question_1$Q3),0)` years in research, and a median of `r round(median(data_question_2$Q4),0)` years in animal movement. 


## Movement ecology framework

In a 2008 article, “A movement ecology paradigm for unifying organismal movement research”, Nathan et al. defined a movement ecology framework where the movement propagation process is produced by the motion and the navigation processes, with internal and external factors affecting movement. We asked the participants:

* Would you say that most research articles in movement ecology analyze these components of the movement ecology framework?

* Would you say that the these components of the movement ecology framework are currently being more, less or equally studied compared to 10 years ago?

The results are shown in the graphs below. Most participants perceived that external factors are being addressed in most movement ecology papers, that motion is being addressed in at least half of the literature, and that navigation and external factors are addressed in less than half. Interestingly, when asked which components were most studied now than 10 years ago, most participants agreed on internal factors. 


```{r moveco-frame, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_3 <- data[,grep("Q5", colnames(data))]
# data_question_3 <- data_question_3 %>% filter(!is.na(Q5))
# boxplot(data_question_2)

# data_question_3 <- lapply(data_question_3, factor)  
categories <- c("Studied in most papers", "Studied in about half of papers", "Studied in a few papers", "I have no idea")
frame_counts <- t(sapply(1:ncol(data_question_3),function(x){
  data_line <- factor(data_question_3[which(!is.na(data_question_3[,x])),x],levels=categories)
  count_p_use <- as.numeric(table(data_line))
  return(count_p_use)
}))  
frame_counts<-data.frame(frame_counts)

new_categories <- c(">50% papers", "~50% of papers", "<50% papers", "No idea")


colnames(frame_counts) <- new_categories
rownames(frame_counts) <- c("Motion","Navigation","Int. Factors", "Ext. Factors")

total_frame <- rowSums(frame_counts)
frame_counts$component<-row.names(frame_counts)

df1<-melt(frame_counts[, ],id.vars='component',variable_name='response')
g<-unlist(by(df1, df1$component, function(x) sum(x$value)))
color.pallete<-brewer.pal(4,'YlGnBu')
color.pallete[1]<-'lightgray'
df1$component<-factor(df1$component,levels=rownames(frame_counts)[order(frame_counts[,1],frame_counts[,2])])
df1$response<-factor(df1$response,levels=rev(new_categories))

p3 <- ggplot(data=df1) + geom_col(aes(x=component, y=value,fill=response)) + ggtitle("Focus on components") +
  ylab('Responses') +coord_flip() +
  scale_fill_manual(values=color.pallete) +theme_classic() +
  theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
        axis.title =element_text(size=20),
        axis.text =element_text(size=16),
        legend.text=element_text(size=10),
        legend.title=element_blank(),
        legend.position="bottom")

# show(p3)
```

```{r moveco-frame past, echo=FALSE, message=F, eval=TRUE, include=TRUE,fig.width = 10, fig.alt = "Horizontal stacked bar plots of the participants opinion on how much the community is focusing on each MEF component (left), and on how much focus there is compared to ten years ago (right). Results described in the text above."}
data_question_8 <- data[,grep("Q10", colnames(data))]
# data_question_3 <- data_question_3 %>% filter(!is.na(Q5))
# boxplot(data_question_2)

# data_question_3 <- lapply(data_question_3, factor)  
categories <- c("More Studied", "Equally studied", "Less studied", "I have no idea")
frame_counts <- t(sapply(1:ncol(data_question_8),function(x){
  data_line <- factor(data_question_8[which(!is.na(data_question_8[,x])),x],levels=categories)
  count_p_use <- as.numeric(table(data_line))
  return(count_p_use)
}))  
frame_counts<-data.frame(frame_counts)

new_categories <- c("> studied", "= studied", "< studied", "No idea")

colnames(frame_counts) <- new_categories
rownames(frame_counts) <- c("Motion","Navigation","Int. Factors", "Ext. Factors")

total_frame <- rowSums(frame_counts)
frame_counts$component<-row.names(frame_counts)

df2<-melt(frame_counts[, ],id.vars='component',variable_name='response')
g2<-unlist(by(df2, df2$component, function(x) sum(x$value)))
color.pallete<-brewer.pal(4,'YlGnBu')
color.pallete[1]<-'lightgray'
df2$component<-factor(df2$component,levels=rownames(frame_counts)[order(frame_counts[,1],frame_counts[,2])])
df2$response<-factor(df2$response,levels=rev(new_categories))

p8 <- ggplot(data=df2) + geom_col(aes(x=component, y=value,fill=response)) + ggtitle("Compared to 10 years ago") +
  ylab('Responses') +coord_flip() +
  scale_fill_manual(values=color.pallete) +theme_classic() +
  theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
        axis.title =element_text(size=20),
        axis.text =element_text(size=16),
        legend.text=element_text(size=10),
        legend.title=element_blank(),
        legend.position="bottom")

multiplot(p3,p8,cols=2)


```

## Taxa

We asked the participants which taxa they considered to be studied the most in movement ecology, and to select up to 3 taxa. The number of votes per taxon are shown in the graph below. Birds and then mammals were indicated as the most studied.


```{r taxa, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plot of the participants vote on which taxonomical group they considered to be the most studied in movement ecology. In decreasing order of votes: Aves, Mammalia, Sharks and Rays, Bony-fish, Homo Sapiens, Arthropoda, Reptilia, and Amphibia."}
data_question_4 <- data[,grep("Q6", colnames(data))]

categories <- c("Mammalia (excluding Human)", "Aves", "Reptilia (excluding Aves)", "Arthropoda", "Homo Sapiens", 
                "Amphibia", "Osteichthyes (Bony-fish)", "Chondrichthyes (Sharks and Rays)") # NO OTHERS

taxa_counts <- sapply(1:(ncol(data_question_4)-2),function(x){
  sum(!is.na(data_question_4[,x]))
})
  
new_categories <- c("Mammalia", "Aves", "Reptilia", "Arthropoda", "Homo Sapiens", 
                "Amphibia", "Bony-fish", "Sharks and Rays") # NO OTHERS


taxa_df <- data.frame(taxa = new_categories, responses = taxa_counts)

taxa_df <- taxa_df %>% 
  arrange(responses)
taxa_df$taxa <- factor(taxa_df$taxa,levels=taxa_df$taxa)


p4 <- ggplot(data = taxa_df, mapping = aes(x = taxa, y = responses)) +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Taxa") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=20),axis.text.y = element_text(size=16))
show(p4)
```

## Tracking devices

We asked the participants the following questions:

* Which tracking device do you consider to be used the most in movement ecology? (up to 3)

* Which tracking devices do you think are used more often now compared to 10 years ago? (up to 3)

* Which tracking devices do you think are used less often now compared to 10 years ago? (up to 3)


```{r device, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plots indicating participants' votes on which devices are the most used. In decreasing order of votes: GPS, satellite tags, radio tags, encounter, light loggers, acoustic telemetry, accelerometer, data storage tags, and images."}
data_question_5 <- data[,grep("Q7", colnames(data))]

categories <- c("Light Loggers (e.g. GLS)", "Satellite Tags (e.g. ARGOS, PSAT, PTT, etc.)", "Radio tags (VHF or UHF based)", "GPS", "Images (Video and static)", "Acoustic telemetry", "Accelerometer", "Encounter (e.g capture mark recapture, banding, direct observation)", "data storage tags") # ONE OTHER

device_counts <- sapply(1:(ncol(data_question_5)),function(x){
  sum(!is.na(data_question_5[,x]))
})
device_counts <- device_counts[-(ncol(data_question_5)-1)]


new_categories <- c("Light Loggers", "Satellite Tags", "Radio tags", "GPS", "Images", "Acoustic telemetry", "Accelerometer", "Encounter", "data storage tags") # ONE OTHER
  
device_df <- data.frame(device = new_categories, responses = device_counts)

device_df <- device_df %>% 
  arrange(responses)
device_df$device <- factor(device_df$device,levels=device_df$device)

p5 <- ggplot(data = device_df, mapping = aes(x = device, y = responses)) + ggtitle("Most used") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Device") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
show(p5)
```

```{r device more, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_9 <- data[,grep("Q11", colnames(data))]

categories <- c("Light Loggers (e.g. GLS)", "Satellite Tags (e.g. ARGOS, PSAT, PTT, etc.)", "Radio tags (VHF or UHF based)", "GPS", "Images (Video and static)", "Acoustic telemetry", "Accelerometer", "Encounter (e.g capture mark recapture, banding, direct observation)", "Radar") # ONE OTHER

device_counts <- sapply(1:(ncol(data_question_9)),function(x){
  sum(!is.na(data_question_9[,x]))
})
device_counts <- device_counts[-(ncol(data_question_9)-1)]


new_categories <- c("Light Loggers", "Satellite Tags", "Radio tags", "GPS", "Images", "Acoustic telemetry", "Accelerometer", "Encounter", "data storage tags") # ONE OTHER
  
device_df <- data.frame(device = new_categories, responses = device_counts)

device_df <- device_df %>% 
  arrange(responses)
device_df$device <- factor(device_df$device,levels=device_df$device)

p9 <- ggplot(data = device_df, mapping = aes(x = device, y = responses)) + ggtitle("More used compared to 10 years ago") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Device") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p9)
```


```{r device less, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.width=10, fig.alt = "Left: horizontal bar plots indicating participants' votes on which devices are more used now compared to ten years ago. In decreasing order of votes: accelerometer, gps, acoustic telemetry, satellite tags, light loggers, images, data storage tags, encounter, radio tags. Right: horizontal bar plots indicating participants' votes on which devices are less used now compared to ten years ago. In decreasing order of votes: radio tags, encounter, satellite tags, light loggers, acoustic telemetry, images, none, and gps."}
data_question_10 <- data[,grep("Q12", colnames(data))]

categories <- c("Light Loggers (e.g. GLS)", "Satellite Tags (e.g. ARGOS, PSAT, PTT, etc.)", "Radio tags (VHF or UHF based)", "GPS", "Images (Video and static)", "Acoustic telemetry", "Accelerometer", "Encounter (e.g capture mark recapture, banding, direct observation)", "Other (describe)") # ONE OTHER

device_counts <- sapply(1:(ncol(data_question_10)),function(x){
  sum(!is.na(data_question_10[,x]))
})
device_counts <- device_counts[-(ncol(data_question_10)-1)]


new_categories <- c("Light Loggers", "Satellite Tags", "Radio tags", "GPS", "Images", "Acoustic telemetry", "Accelerometer", "Encounter", "None") # ONE OTHER
  
device_df <- data.frame(device = new_categories, responses = device_counts)

device_df <- device_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
device_df$device <- factor(device_df$device,levels=device_df$device)

p10 <- ggplot(data = device_df, mapping = aes(x = device, y = responses)) + ggtitle("Less used compared to 10 years ago") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Device") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p10)

# show(p5) 

multiplot(p9,p10,cols=2)
```

The results are shown in the graphs above. GPS and satellite tags (e.g. PSAT, PTT) are the most used devices, according to the participants. When compared to 10 years ago, they expressed that accelerometers and GPS devices are more used than before, while radio tags and encounter techniques (e.g. capture mark recapture, banding, direct observation) are less used. 

## Software

We asked the participants the following questions:

* Which software do you think is used the most for movement analysis? (up to 3)

* For movement analysis, which software do you think are used more often now compared to 10 years ago?  (up to 3)

* For movement analysis, which software do you think are used less often now compared to 10 years ago? (up to 3)

```{r software, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plots indicating participants' votes on which software are the most used. In decreasing order of votes: R, ArcGIS, QGIS, Python, Matlab, RDBMs."}
data_question_6 <- data[,grep("Q8", colnames(data))]

categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "Relational databases (RDBMs)", "QGIS") # NO OTHER

soft_counts <- sapply(1:(ncol(data_question_6)-2),function(x){
  sum(!is.na(data_question_6[,x]))
})

new_categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "RDBMs", "QGIS") # NO OTHER
  
soft_df <- data.frame(software = new_categories, responses = soft_counts)

soft_df <- soft_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
soft_df$software <- factor(soft_df$software,levels=soft_df$software)

p6 <- ggplot(data = soft_df, mapping = aes(x = software, y = responses)) + ggtitle("Most used") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Software") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
show(p6)
```


```{r software more, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_11 <- data[,grep("Q13", colnames(data))]

categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "Relational databases (RDBMs)", "QGIS") # NO OTHER

soft_counts <- sapply(1:(ncol(data_question_11)-2),function(x){
  sum(!is.na(data_question_11[,x]))
})

new_categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "RDBMs", "QGIS") # NO OTHER
  
soft_df <- data.frame(software = new_categories, responses = soft_counts)

soft_df <- soft_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
soft_df$software <- factor(soft_df$software,levels=soft_df$software)

p11 <- ggplot(data = soft_df, mapping = aes(x = software, y = responses)) + ggtitle("More used compared to 10 years ago") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Software") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p11)
```


```{r software less, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.width=10, fig.alt = "Left: horizontal bar plots indicating participants' votes on which software are more used now compared to ten years ago. In decreasing order of votes: R, QGIS, Python, Matlab, ArcGIS, RDBMs. Right: horizontal bar plots indicating participants' votes on which software are less used now compared to ten years ago. In decreasing order of votes: Matlab, ArcGIS, SPSS, SAS, RDBMs, Python."}
data_question_12 <- unlist(data[,grep("Q14", colnames(data))])

data_question_12 <- data_question_12[!is.na(data_question_12)]

new_categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "RDBMs", "QGIS") 

soft <- factor(unlist(sapply(1:length(data_question_12),function(i){
   words <- unlist(strsplit(data_question_12[i], " "))
  unlist(sapply(1:length(words),function(j){
    ind <- grep(words[j],new_categories)
    if (length(ind) > 0){
    return(new_categories[ind])
  }
  })
  )
  
})))

soft_counts <- as.numeric(table(soft))

soft_df <- data.frame(software = levels(soft), responses = soft_counts)

soft_df <- soft_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
soft_df$software <- factor(soft_df$software,levels=soft_df$software)

p12 <- ggplot(data = soft_df, mapping = aes(x = software, y = responses)) + ggtitle("Less used compared to 10 years ago") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Software") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p12)

multiplot(p11,p12,cols=2)

```


The results are shown in the graphs above. R, and in the a lesser degree, ArcGIS, are the most used software, according to the participants. When compared to 10 years ago, they expressed that R is more used than before, while Matlab, ArcGIS, SPSS and SAS are less used. 


## Methods

We asked the participants the following questions:

* Which statistical/mathematical methods do you consider to be used the most for movement analysis? 

* Which methods do you think are used more often now compared to 10 years ago? (up to 3)

* Which methods do you think are used less often now compared to 10 years ago? (up to 3)

There are many analytical tools applied/developed in movement ecology. To keep it simple, we provided the participants with 9 (arbitrarily selected) options: generalized linear models (GLMs) and generalized additive models (GAMs), machine learning, model selection criteria, multivariate exploratory methods, net squared displacement (NSD), spatial point processes, state-space and Hidden Markov models (SSMs and HMMs, respectively), step and resource selection functions (SSFs and RSFs, respectively), and test statistics and p-values. We provided an "Other" option for researchers to indicate methods that would not be within the 9 other options. In the end, the "Other" option was poorly used.


```{r methods, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plots indicating participants' votes on which statistical/mathematical methods are the most used. In decreasing order of votes: SSF and RSF, SSM and HMM, GLMs and GAMs, model selection, tests, NSD, spatial point, multivariate, other, and machine learning."}
data_question_7 <- data[,grep("Q9", colnames(data))]

categories <- c("Net squared displacement", "Test statistics and p-values", "Model selection criteria (e.g AIC)", 
                "GLMs and GAMs", "Multivariate exploratory methods", "Machine learning", "Step and resource selection functions", "State-space and Hidden Markov models", "Spatial point processes", "Other") # NO OTHER

methods_counts <- sapply(1:(ncol(data_question_7)),function(x){
  sum(!is.na(data_question_7[,x]))
})
methods_counts <- methods_counts[-(ncol(data_question_7))]

new_categories <- c("NSD", "Tests", "Model selection", 
                "GLMs and GAMs", "Multivariate", "Machine learning", "SSF and RSF", "SSM and HMM", "Spatial point", "Other") 
  
methods_df <- data.frame(method = new_categories, responses = methods_counts)

methods_df <- methods_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
methods_df$method <- factor(methods_df$method,levels=methods_df$method)

p7 <- ggplot(data = methods_df, mapping = aes(x = method, y = responses)) + ggtitle("Most used") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Method") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
show(p7)
```


```{r methods more, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_13 <- data[,grep("Q15", colnames(data))]

categories <- c("Net squared displacement", "Test statistics and p-values", "Model selection criteria (e.g AIC)", 
                "GLMs and GAMs", "Multivariate exploratory methods", "Machine learning", "Step and resource selection functions", "State-space and Hidden Markov models", "Spatial point processes", "Other") # NO OTHER

methods_counts <- sapply(1:(ncol(data_question_13)),function(x){
  sum(!is.na(data_question_13[,x]))
})
methods_counts <- methods_counts[-(ncol(data_question_13))]

new_categories <- c("NSD", "Tests", "Model selection", 
                "GLMs and GAMs", "Multivariate", "Machine learning", "SSF and RSF", "SSM and HMM", "Spatial point", "Other") 
  
methods_df <- data.frame(method = new_categories, responses = methods_counts)

methods_df <- methods_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
methods_df$method <- factor(methods_df$method,levels=methods_df$method)

p13 <- ggplot(data = methods_df, mapping = aes(x = method, y = responses)) + ggtitle("More used compared to 10 years ago") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Method") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p13)
```


```{r methods less, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.width=10, fig.alt = "Left: horizontal bar plots indicating participants' votes on which statistical/mathematical methods are more used now compared to ten years ago. In decreasing order of votes: SSM and HMM, machine learning, SSF and RSF, model selection, multivariate, spatial point, GLMs and GAMs, NSD, and other. Right: horizontal bar plots indicating participants' votes on which statistical/mathematical methods are less used now compared to ten years ago. In decreasing order of votes: tests, NSD, multivariate, spatial point, SSF and RSF, model selection, other, and GLMs and GAMs."}
data_question_14 <- data[,grep("Q16", colnames(data))]

categories <- c("Net squared displacement", "Test statistics and p-values", "Model selection criteria (e.g AIC)", 
                "GLMs and GAMs", "Multivariate exploratory methods", "Machine learning", "Step and resource selection functions", "State-space and Hidden Markov models", "Spatial point processes", "Other") # NO OTHER

methods_counts <- sapply(1:(ncol(data_question_14)),function(x){
  sum(!is.na(data_question_14[,x]))
})
methods_counts <- methods_counts[-(ncol(data_question_14))]

new_categories <- c("NSD", "Tests", "Model selection", 
                "GLMs and GAMs", "Multivariate", "Machine learning", "SSF and RSF", "SSM and HMM", "Spatial point", "Other") 
  
methods_df <- data.frame(method = new_categories, responses = methods_counts)

methods_df <- methods_df %>% 
  arrange(responses) %>% 
  filter(responses > 0)
methods_df$method <- factor(methods_df$method,levels=methods_df$method)

p14 <- ggplot(data = methods_df, mapping = aes(x = method, y = responses)) + ggtitle("Less used compared to 10 years ago") +
  geom_bar(stat = "identity",position="identity") +
coord_flip() +  
   xlab("Method") + ylab("Number of responses") +
  theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p14)

multiplot(p13,p14,cols=2)
```


The results are shown in the graphs above. The most used methods in movement ecology, according to the participants, are SSFs and RSFs, SSMs and HMMs, GLMs and GAMs. When compared to 10 years ago, they expressed that SSMs and HMMs, as well as machine learning, and in a lesser degree, SSFs and RSFs are more used, while hypothesis tests are less used. 

The "Other" option was used for speed threshold (1) and mininum convex polygon and kernel density utilization (1), for the first question; visual analytics (1), and continuous time movement modeling (1) for the second question; descriptive metrics (1) and "none" (1) for the third. 

## Big changes in the field

We asked the participants three final open questions.

```{r open questions, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_15 <- data %>% 
  select(Q17) %>% 
  filter(!is.na(Q17))

data_question_16 <- data %>% 
  select(Q18) %>% 
  filter(!is.na(Q18))

data_question_17 <- data %>% 
  select(Q19) %>% 
  filter(!is.na(Q19))

```


* In your opinion, what has revolutionized the field in the last 10 years? (Please keep it to three topics)

`r nrow(data_question_15)` participants answered this first question. 31 answers were related to tagging devices (smaller, long-lasting, cheaper, battery-saving devices) that gave rise to longer or higher resolution data on many species. 15 answers were related to software development and computational power, 6 to statistical methods, 2 to the availability of remote sensing data, 2 to availability of tracking data online (in open data portals like movebank), 1 to molecular markers, 1 to citizen science, and there were other references to "mechanistic thinking and approaches" (1), the increase in the number of studies (1) and Nathan et al. 2008 paper in PNAS (1).


* Compared to 10 years ago, what would you be able to work on now that you could not do 10 years ago? (Please keep it to three topics; your answer may involve taxa, devices, methods or others)

`r nrow(data_question_16)` participants answered this question. 28 answers were related to tagging devices that allowed working with smaller species, obtaining longer datasets, tracking more individuals and having high resolution data on their movement; among them, 4 mentioned the opportunity to work with accelerometry data. 5 answers were related to working with environmental data to link to the movement data (one mentioned it to do habitat selection analysis), 4 people mentioned studies in physiology, 3 of behavior, 3 of movement models and analysis in general, 1 studies on the mechanisms underlying movement, 1 multitaxa comparison, 1 citizen science thanks to apps on the phone. 2 people mentioned computational simulation and 2 processing data. 


* In your opinion, what will revolutionize the field in the next 10 years? (Please keep it to three topics)

`r nrow(data_question_17)` participants answered this question. 21 gave answers related to tagging devices (including use of ICARUS technology, 3D monitoring, development of more reliable and less invasive devices, improvement of accelerometers, and combined use of devices). 5 people referred to the development of statistical and mathematical methods to study movement processes (including machine learning techniques); 3 mentioned the availability of high resolution remote sensing data; 2 answers were related to software and computing power, 2 to connecting movement ecology to physiology, 2 to evolution, 1 to ontogeny, 1 to large-scale navigation experiments, 1 to integrating navigation, genetics, environment, physiology, life history and population dynamic studies; 1 to the development of data platforms, and 1 to "improved conceptualization of the field".


## Summary 

* Most participants in the survey had less than a decade of experience in the field, so their farthest point of comparison in the past is from less than 10 years. 

* Participants identified external factors as the dominant component in movement ecology papers, though recognizing that there is an increasing but still weaker interest in the link between internal factors and movement. 

* Birds and mammals were identified as the most studied taxa in the field. 

* GPS tags were identified as the most used tags, while the use of accelerometers is increasing and the use of radio tags, decreasing. 

* The R software is the most used software according to the participants, and it is far more used than 10 years ago. On the other hand, Matlab, ArcGIS, SPSS and SAS use have decreased in the last decade.

* RSFs, SSFs, SSMs, HMMs, as well as GLMs and GAMs, are the most used statistical methods in movement ecology. SSM, HMM and machine learning techniques have experienced a notable increase in their use in the last decade, while the use of statistical tests, mainly, have decreased. 

* When asked for what has revolutionized the field and what will play a major role in the next 10 years, most of the answers were related to tracking devices (miniaturization, reliability, battery saving, combined use of them). In a lesser degree, software, data and methods to analyze movement were mentioned. Most participants identified tracking technology as the game changer in the field.