-
Notifications
You must be signed in to change notification settings - Fork 0
/
03-survey.Rmd
594 lines (403 loc) · 31 KB
/
03-survey.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
# Survey about movement ecology
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE, fig.path = "posts/moveco-images/")
library(tidyverse)
library(ggplot2)
library(ggrepel)
library(dplyr)
library(reshape)
library(RColorBrewer)
library(kableExtra)
library(cowplot)
source("./R/multiplot.R")
data_dir <- "./survey/"
data <- read.csv(paste0(data_dir,"MoveEcoSurvey.csv"),stringsAsFactors = FALSE)
data <- data %>%
filter(Status != 1)
cols_data <- (grep("Q3", colnames(data))[1]:grep("Q19", colnames(data)))
answers <- dim(data)[1] - sum(sapply(1:dim(data)[1],function(x){
all(is.na(data[x,cols_data]))
}))
cols_data_real <- (grep("Q5", colnames(data))[1]:grep("Q19", colnames(data)))
answers_real <- dim(data)[1] - sum(sapply(1:dim(data)[1],function(x){
all(is.na(data[x,cols_data_real]))
}))
```
## Description of the survey
As a complementary source of information for our review, we elaborated a survey to get the perspectives of movement ecologists about the field and how it is evolving.
The exact formulation of the questions in the survey can be downloaded [here](survey/Survey.pdf).
## Participation in the survey
The survey was designed to be completely anonymous. There was no previous selection of the participants and no probabilistic sampling was involved. The survey was advertised by mailing lists (r-sig-geo and r-sig-ecology), individual emails to researchers, the lab's website <https://mablab.org>, the Gordon Research Conference and Seminar on Movement Ecology of Animals, and Twitter. It took place during the Winter/Spring of 2019.
The survey got exemption from the Institutional Review Board at University of Florida (IRB02 Office, Box 112250, University of Florida, Gainesville, FL 32611-2250).
A total of `r answers` people participated in the survey, and `r answers_real` answered at least one question concerning the field (and not just their years of experience in research or the field). No question was mandatory, so participants could opt to not answer some questions.
```{r years-research, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_1 <- data %>%
select(Q3) %>%
filter(!is.na(Q3))
wrong_answer <- which(data_question_1$Q3>1900)
if (length(wrong_answer)>0){
data_question_1$Q3[wrong_answer] <- 2019-data_question_1$Q3[wrong_answer]
}
# boxplot(data_question_1)
p1 <- ggplot(data = data_question_1, mapping = aes(x = Q3)) +
geom_histogram(binwidth = 2*diff(quantile(data_question_1$Q3,probs=c(0.25,0.75)))/(nrow(data_question_1)^(1/3)),
aes(y=..density..), colour="#999999",fill="white") +
geom_density(colour="black") +
theme_classic() +
theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
axis.title =element_text(size=20),
axis.text =element_text(size=15)) +
labs(x = "Years in research")
# show(p1)
```
```{r years-moveco, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Histogram of the years in research of all participants (left), and histogram of the years working on movement (right). Both histograms are skewed to the right.", fig.cap = " "}
data_question_2 <- data %>%
select(Q4) %>%
filter(!is.na(Q4))
p2 <- ggplot(data = data_question_2, mapping = aes(x = Q4)) +
geom_histogram(binwidth = 2*diff(quantile(data_question_2$Q4,probs=c(0.25,0.75)))/(nrow(data_question_2)^(1/3)),
aes(y=..density..), colour="#999999",fill="white") +
geom_density(colour="black") +
theme_classic() +
theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
axis.title =element_text(size=20),
axis.text =element_text(size=15)) +
labs(x = "Years in movement")
multiplot(p1,p2,cols = 2)
```
We first asked the participants how many years they have been doing research, and how many years they have been working on animal movement. They have spent a median of `r round(median(data_question_1$Q3),0)` years in research, and a median of `r round(median(data_question_2$Q4),0)` years in animal movement.
## Movement ecology framework
In a 2008 article, “A movement ecology paradigm for unifying organismal movement research”, Nathan et al. defined a movement ecology framework where the movement propagation process is produced by the motion and the navigation processes, with internal and external factors affecting movement. We asked the participants:
* Would you say that most research articles in movement ecology analyze these components of the movement ecology framework?
* Would you say that the these components of the movement ecology framework are currently being more, less or equally studied compared to 10 years ago?
The results are shown in the graphs below. Most participants perceived that external factors are being addressed in most movement ecology papers, that motion is being addressed in at least half of the literature, and that navigation and external factors are addressed in less than half. Interestingly, when asked which components were most studied now than 10 years ago, most participants agreed on internal factors.
```{r moveco-frame, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_3 <- data[,grep("Q5", colnames(data))]
# data_question_3 <- data_question_3 %>% filter(!is.na(Q5))
# boxplot(data_question_2)
# data_question_3 <- lapply(data_question_3, factor)
categories <- c("Studied in most papers", "Studied in about half of papers", "Studied in a few papers", "I have no idea")
frame_counts <- t(sapply(1:ncol(data_question_3),function(x){
data_line <- factor(data_question_3[which(!is.na(data_question_3[,x])),x],levels=categories)
count_p_use <- as.numeric(table(data_line))
return(count_p_use)
}))
frame_counts<-data.frame(frame_counts)
new_categories <- c(">50% papers", "~50% of papers", "<50% papers", "No idea")
colnames(frame_counts) <- new_categories
rownames(frame_counts) <- c("Motion","Navigation","Int. Factors", "Ext. Factors")
total_frame <- rowSums(frame_counts)
frame_counts$component<-row.names(frame_counts)
df1<-melt(frame_counts[, ],id.vars='component',variable_name='response')
g<-unlist(by(df1, df1$component, function(x) sum(x$value)))
color.pallete<-brewer.pal(4,'YlGnBu')
color.pallete[1]<-'lightgray'
df1$component<-factor(df1$component,levels=rownames(frame_counts)[order(frame_counts[,1],frame_counts[,2])])
df1$response<-factor(df1$response,levels=rev(new_categories))
p3 <- ggplot(data=df1) + geom_col(aes(x=component, y=value,fill=response)) + ggtitle("Focus on components") +
ylab('Responses') +coord_flip() +
scale_fill_manual(values=color.pallete) +theme_classic() +
theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
axis.title =element_text(size=20),
axis.text =element_text(size=16),
legend.text=element_text(size=10),
legend.title=element_blank(),
legend.position="bottom")
# show(p3)
```
```{r moveco-frame past, echo=FALSE, message=F, eval=TRUE, include=TRUE,fig.width = 10, fig.alt = "Horizontal stacked bar plots of the participants opinion on how much the community is focusing on each MEF component (left), and on how much focus there is compared to ten years ago (right). Results described in the text above."}
data_question_8 <- data[,grep("Q10", colnames(data))]
# data_question_3 <- data_question_3 %>% filter(!is.na(Q5))
# boxplot(data_question_2)
# data_question_3 <- lapply(data_question_3, factor)
categories <- c("More Studied", "Equally studied", "Less studied", "I have no idea")
frame_counts <- t(sapply(1:ncol(data_question_8),function(x){
data_line <- factor(data_question_8[which(!is.na(data_question_8[,x])),x],levels=categories)
count_p_use <- as.numeric(table(data_line))
return(count_p_use)
}))
frame_counts<-data.frame(frame_counts)
new_categories <- c("> studied", "= studied", "< studied", "No idea")
colnames(frame_counts) <- new_categories
rownames(frame_counts) <- c("Motion","Navigation","Int. Factors", "Ext. Factors")
total_frame <- rowSums(frame_counts)
frame_counts$component<-row.names(frame_counts)
df2<-melt(frame_counts[, ],id.vars='component',variable_name='response')
g2<-unlist(by(df2, df2$component, function(x) sum(x$value)))
color.pallete<-brewer.pal(4,'YlGnBu')
color.pallete[1]<-'lightgray'
df2$component<-factor(df2$component,levels=rownames(frame_counts)[order(frame_counts[,1],frame_counts[,2])])
df2$response<-factor(df2$response,levels=rev(new_categories))
p8 <- ggplot(data=df2) + geom_col(aes(x=component, y=value,fill=response)) + ggtitle("Compared to 10 years ago") +
ylab('Responses') +coord_flip() +
scale_fill_manual(values=color.pallete) +theme_classic() +
theme(axis.text.y = element_text( hjust = 1,vjust=0.5),
axis.title =element_text(size=20),
axis.text =element_text(size=16),
legend.text=element_text(size=10),
legend.title=element_blank(),
legend.position="bottom")
multiplot(p3,p8,cols=2)
```
## Taxa
We asked the participants which taxa they considered to be studied the most in movement ecology, and to select up to 3 taxa. The number of votes per taxon are shown in the graph below. Birds and then mammals were indicated as the most studied.
```{r taxa, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plot of the participants vote on which taxonomical group they considered to be the most studied in movement ecology. In decreasing order of votes: Aves, Mammalia, Sharks and Rays, Bony-fish, Homo Sapiens, Arthropoda, Reptilia, and Amphibia."}
data_question_4 <- data[,grep("Q6", colnames(data))]
categories <- c("Mammalia (excluding Human)", "Aves", "Reptilia (excluding Aves)", "Arthropoda", "Homo Sapiens",
"Amphibia", "Osteichthyes (Bony-fish)", "Chondrichthyes (Sharks and Rays)") # NO OTHERS
taxa_counts <- sapply(1:(ncol(data_question_4)-2),function(x){
sum(!is.na(data_question_4[,x]))
})
new_categories <- c("Mammalia", "Aves", "Reptilia", "Arthropoda", "Homo Sapiens",
"Amphibia", "Bony-fish", "Sharks and Rays") # NO OTHERS
taxa_df <- data.frame(taxa = new_categories, responses = taxa_counts)
taxa_df <- taxa_df %>%
arrange(responses)
taxa_df$taxa <- factor(taxa_df$taxa,levels=taxa_df$taxa)
p4 <- ggplot(data = taxa_df, mapping = aes(x = taxa, y = responses)) +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Taxa") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=20),axis.text.y = element_text(size=16))
show(p4)
```
## Tracking devices
We asked the participants the following questions:
* Which tracking device do you consider to be used the most in movement ecology? (up to 3)
* Which tracking devices do you think are used more often now compared to 10 years ago? (up to 3)
* Which tracking devices do you think are used less often now compared to 10 years ago? (up to 3)
```{r device, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plots indicating participants' votes on which devices are the most used. In decreasing order of votes: GPS, satellite tags, radio tags, encounter, light loggers, acoustic telemetry, accelerometer, data storage tags, and images."}
data_question_5 <- data[,grep("Q7", colnames(data))]
categories <- c("Light Loggers (e.g. GLS)", "Satellite Tags (e.g. ARGOS, PSAT, PTT, etc.)", "Radio tags (VHF or UHF based)", "GPS", "Images (Video and static)", "Acoustic telemetry", "Accelerometer", "Encounter (e.g capture mark recapture, banding, direct observation)", "data storage tags") # ONE OTHER
device_counts <- sapply(1:(ncol(data_question_5)),function(x){
sum(!is.na(data_question_5[,x]))
})
device_counts <- device_counts[-(ncol(data_question_5)-1)]
new_categories <- c("Light Loggers", "Satellite Tags", "Radio tags", "GPS", "Images", "Acoustic telemetry", "Accelerometer", "Encounter", "data storage tags") # ONE OTHER
device_df <- data.frame(device = new_categories, responses = device_counts)
device_df <- device_df %>%
arrange(responses)
device_df$device <- factor(device_df$device,levels=device_df$device)
p5 <- ggplot(data = device_df, mapping = aes(x = device, y = responses)) + ggtitle("Most used") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Device") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
show(p5)
```
```{r device more, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_9 <- data[,grep("Q11", colnames(data))]
categories <- c("Light Loggers (e.g. GLS)", "Satellite Tags (e.g. ARGOS, PSAT, PTT, etc.)", "Radio tags (VHF or UHF based)", "GPS", "Images (Video and static)", "Acoustic telemetry", "Accelerometer", "Encounter (e.g capture mark recapture, banding, direct observation)", "Radar") # ONE OTHER
device_counts <- sapply(1:(ncol(data_question_9)),function(x){
sum(!is.na(data_question_9[,x]))
})
device_counts <- device_counts[-(ncol(data_question_9)-1)]
new_categories <- c("Light Loggers", "Satellite Tags", "Radio tags", "GPS", "Images", "Acoustic telemetry", "Accelerometer", "Encounter", "data storage tags") # ONE OTHER
device_df <- data.frame(device = new_categories, responses = device_counts)
device_df <- device_df %>%
arrange(responses)
device_df$device <- factor(device_df$device,levels=device_df$device)
p9 <- ggplot(data = device_df, mapping = aes(x = device, y = responses)) + ggtitle("More used compared to 10 years ago") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Device") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p9)
```
```{r device less, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.width=10, fig.alt = "Left: horizontal bar plots indicating participants' votes on which devices are more used now compared to ten years ago. In decreasing order of votes: accelerometer, gps, acoustic telemetry, satellite tags, light loggers, images, data storage tags, encounter, radio tags. Right: horizontal bar plots indicating participants' votes on which devices are less used now compared to ten years ago. In decreasing order of votes: radio tags, encounter, satellite tags, light loggers, acoustic telemetry, images, none, and gps."}
data_question_10 <- data[,grep("Q12", colnames(data))]
categories <- c("Light Loggers (e.g. GLS)", "Satellite Tags (e.g. ARGOS, PSAT, PTT, etc.)", "Radio tags (VHF or UHF based)", "GPS", "Images (Video and static)", "Acoustic telemetry", "Accelerometer", "Encounter (e.g capture mark recapture, banding, direct observation)", "Other (describe)") # ONE OTHER
device_counts <- sapply(1:(ncol(data_question_10)),function(x){
sum(!is.na(data_question_10[,x]))
})
device_counts <- device_counts[-(ncol(data_question_10)-1)]
new_categories <- c("Light Loggers", "Satellite Tags", "Radio tags", "GPS", "Images", "Acoustic telemetry", "Accelerometer", "Encounter", "None") # ONE OTHER
device_df <- data.frame(device = new_categories, responses = device_counts)
device_df <- device_df %>%
arrange(responses) %>%
filter(responses > 0)
device_df$device <- factor(device_df$device,levels=device_df$device)
p10 <- ggplot(data = device_df, mapping = aes(x = device, y = responses)) + ggtitle("Less used compared to 10 years ago") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Device") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p10)
# show(p5)
multiplot(p9,p10,cols=2)
```
The results are shown in the graphs above. GPS and satellite tags (e.g. PSAT, PTT) are the most used devices, according to the participants. When compared to 10 years ago, they expressed that accelerometers and GPS devices are more used than before, while radio tags and encounter techniques (e.g. capture mark recapture, banding, direct observation) are less used.
## Software
We asked the participants the following questions:
* Which software do you think is used the most for movement analysis? (up to 3)
* For movement analysis, which software do you think are used more often now compared to 10 years ago? (up to 3)
* For movement analysis, which software do you think are used less often now compared to 10 years ago? (up to 3)
```{r software, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plots indicating participants' votes on which software are the most used. In decreasing order of votes: R, ArcGIS, QGIS, Python, Matlab, RDBMs."}
data_question_6 <- data[,grep("Q8", colnames(data))]
categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "Relational databases (RDBMs)", "QGIS") # NO OTHER
soft_counts <- sapply(1:(ncol(data_question_6)-2),function(x){
sum(!is.na(data_question_6[,x]))
})
new_categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "RDBMs", "QGIS") # NO OTHER
soft_df <- data.frame(software = new_categories, responses = soft_counts)
soft_df <- soft_df %>%
arrange(responses) %>%
filter(responses > 0)
soft_df$software <- factor(soft_df$software,levels=soft_df$software)
p6 <- ggplot(data = soft_df, mapping = aes(x = software, y = responses)) + ggtitle("Most used") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Software") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
show(p6)
```
```{r software more, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_11 <- data[,grep("Q13", colnames(data))]
categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "Relational databases (RDBMs)", "QGIS") # NO OTHER
soft_counts <- sapply(1:(ncol(data_question_11)-2),function(x){
sum(!is.na(data_question_11[,x]))
})
new_categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "RDBMs", "QGIS") # NO OTHER
soft_df <- data.frame(software = new_categories, responses = soft_counts)
soft_df <- soft_df %>%
arrange(responses) %>%
filter(responses > 0)
soft_df$software <- factor(soft_df$software,levels=soft_df$software)
p11 <- ggplot(data = soft_df, mapping = aes(x = software, y = responses)) + ggtitle("More used compared to 10 years ago") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Software") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p11)
```
```{r software less, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.width=10, fig.alt = "Left: horizontal bar plots indicating participants' votes on which software are more used now compared to ten years ago. In decreasing order of votes: R, QGIS, Python, Matlab, ArcGIS, RDBMs. Right: horizontal bar plots indicating participants' votes on which software are less used now compared to ten years ago. In decreasing order of votes: Matlab, ArcGIS, SPSS, SAS, RDBMs, Python."}
data_question_12 <- unlist(data[,grep("Q14", colnames(data))])
data_question_12 <- data_question_12[!is.na(data_question_12)]
new_categories <- c("R", "Python", "ArcGIS", "Matlab", "SAS", "SPSS", "RDBMs", "QGIS")
soft <- factor(unlist(sapply(1:length(data_question_12),function(i){
words <- unlist(strsplit(data_question_12[i], " "))
unlist(sapply(1:length(words),function(j){
ind <- grep(words[j],new_categories)
if (length(ind) > 0){
return(new_categories[ind])
}
})
)
})))
soft_counts <- as.numeric(table(soft))
soft_df <- data.frame(software = levels(soft), responses = soft_counts)
soft_df <- soft_df %>%
arrange(responses) %>%
filter(responses > 0)
soft_df$software <- factor(soft_df$software,levels=soft_df$software)
p12 <- ggplot(data = soft_df, mapping = aes(x = software, y = responses)) + ggtitle("Less used compared to 10 years ago") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Software") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p12)
multiplot(p11,p12,cols=2)
```
The results are shown in the graphs above. R, and in the a lesser degree, ArcGIS, are the most used software, according to the participants. When compared to 10 years ago, they expressed that R is more used than before, while Matlab, ArcGIS, SPSS and SAS are less used.
## Methods
We asked the participants the following questions:
* Which statistical/mathematical methods do you consider to be used the most for movement analysis?
* Which methods do you think are used more often now compared to 10 years ago? (up to 3)
* Which methods do you think are used less often now compared to 10 years ago? (up to 3)
There are many analytical tools applied/developed in movement ecology. To keep it simple, we provided the participants with 9 (arbitrarily selected) options: generalized linear models (GLMs) and generalized additive models (GAMs), machine learning, model selection criteria, multivariate exploratory methods, net squared displacement (NSD), spatial point processes, state-space and Hidden Markov models (SSMs and HMMs, respectively), step and resource selection functions (SSFs and RSFs, respectively), and test statistics and p-values. We provided an "Other" option for researchers to indicate methods that would not be within the 9 other options. In the end, the "Other" option was poorly used.
```{r methods, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.alt = "Horizontal bar plots indicating participants' votes on which statistical/mathematical methods are the most used. In decreasing order of votes: SSF and RSF, SSM and HMM, GLMs and GAMs, model selection, tests, NSD, spatial point, multivariate, other, and machine learning."}
data_question_7 <- data[,grep("Q9", colnames(data))]
categories <- c("Net squared displacement", "Test statistics and p-values", "Model selection criteria (e.g AIC)",
"GLMs and GAMs", "Multivariate exploratory methods", "Machine learning", "Step and resource selection functions", "State-space and Hidden Markov models", "Spatial point processes", "Other") # NO OTHER
methods_counts <- sapply(1:(ncol(data_question_7)),function(x){
sum(!is.na(data_question_7[,x]))
})
methods_counts <- methods_counts[-(ncol(data_question_7))]
new_categories <- c("NSD", "Tests", "Model selection",
"GLMs and GAMs", "Multivariate", "Machine learning", "SSF and RSF", "SSM and HMM", "Spatial point", "Other")
methods_df <- data.frame(method = new_categories, responses = methods_counts)
methods_df <- methods_df %>%
arrange(responses) %>%
filter(responses > 0)
methods_df$method <- factor(methods_df$method,levels=methods_df$method)
p7 <- ggplot(data = methods_df, mapping = aes(x = method, y = responses)) + ggtitle("Most used") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Method") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
show(p7)
```
```{r methods more, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_13 <- data[,grep("Q15", colnames(data))]
categories <- c("Net squared displacement", "Test statistics and p-values", "Model selection criteria (e.g AIC)",
"GLMs and GAMs", "Multivariate exploratory methods", "Machine learning", "Step and resource selection functions", "State-space and Hidden Markov models", "Spatial point processes", "Other") # NO OTHER
methods_counts <- sapply(1:(ncol(data_question_13)),function(x){
sum(!is.na(data_question_13[,x]))
})
methods_counts <- methods_counts[-(ncol(data_question_13))]
new_categories <- c("NSD", "Tests", "Model selection",
"GLMs and GAMs", "Multivariate", "Machine learning", "SSF and RSF", "SSM and HMM", "Spatial point", "Other")
methods_df <- data.frame(method = new_categories, responses = methods_counts)
methods_df <- methods_df %>%
arrange(responses) %>%
filter(responses > 0)
methods_df$method <- factor(methods_df$method,levels=methods_df$method)
p13 <- ggplot(data = methods_df, mapping = aes(x = method, y = responses)) + ggtitle("More used compared to 10 years ago") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Method") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p13)
```
```{r methods less, echo=FALSE, message=F, eval=TRUE, include=TRUE, fig.width=10, fig.alt = "Left: horizontal bar plots indicating participants' votes on which statistical/mathematical methods are more used now compared to ten years ago. In decreasing order of votes: SSM and HMM, machine learning, SSF and RSF, model selection, multivariate, spatial point, GLMs and GAMs, NSD, and other. Right: horizontal bar plots indicating participants' votes on which statistical/mathematical methods are less used now compared to ten years ago. In decreasing order of votes: tests, NSD, multivariate, spatial point, SSF and RSF, model selection, other, and GLMs and GAMs."}
data_question_14 <- data[,grep("Q16", colnames(data))]
categories <- c("Net squared displacement", "Test statistics and p-values", "Model selection criteria (e.g AIC)",
"GLMs and GAMs", "Multivariate exploratory methods", "Machine learning", "Step and resource selection functions", "State-space and Hidden Markov models", "Spatial point processes", "Other") # NO OTHER
methods_counts <- sapply(1:(ncol(data_question_14)),function(x){
sum(!is.na(data_question_14[,x]))
})
methods_counts <- methods_counts[-(ncol(data_question_14))]
new_categories <- c("NSD", "Tests", "Model selection",
"GLMs and GAMs", "Multivariate", "Machine learning", "SSF and RSF", "SSM and HMM", "Spatial point", "Other")
methods_df <- data.frame(method = new_categories, responses = methods_counts)
methods_df <- methods_df %>%
arrange(responses) %>%
filter(responses > 0)
methods_df$method <- factor(methods_df$method,levels=methods_df$method)
p14 <- ggplot(data = methods_df, mapping = aes(x = method, y = responses)) + ggtitle("Less used compared to 10 years ago") +
geom_bar(stat = "identity",position="identity") +
coord_flip() +
xlab("Method") + ylab("Number of responses") +
theme_cowplot() + theme(text = element_text(size=14),axis.text.y = element_text(size=16))
# show(p14)
multiplot(p13,p14,cols=2)
```
The results are shown in the graphs above. The most used methods in movement ecology, according to the participants, are SSFs and RSFs, SSMs and HMMs, GLMs and GAMs. When compared to 10 years ago, they expressed that SSMs and HMMs, as well as machine learning, and in a lesser degree, SSFs and RSFs are more used, while hypothesis tests are less used.
The "Other" option was used for speed threshold (1) and mininum convex polygon and kernel density utilization (1), for the first question; visual analytics (1), and continuous time movement modeling (1) for the second question; descriptive metrics (1) and "none" (1) for the third.
## Big changes in the field
We asked the participants three final open questions.
```{r open questions, echo=FALSE, message=F, eval=TRUE, include=TRUE}
data_question_15 <- data %>%
select(Q17) %>%
filter(!is.na(Q17))
data_question_16 <- data %>%
select(Q18) %>%
filter(!is.na(Q18))
data_question_17 <- data %>%
select(Q19) %>%
filter(!is.na(Q19))
```
* In your opinion, what has revolutionized the field in the last 10 years? (Please keep it to three topics)
`r nrow(data_question_15)` participants answered this first question. 31 answers were related to tagging devices (smaller, long-lasting, cheaper, battery-saving devices) that gave rise to longer or higher resolution data on many species. 15 answers were related to software development and computational power, 6 to statistical methods, 2 to the availability of remote sensing data, 2 to availability of tracking data online (in open data portals like movebank), 1 to molecular markers, 1 to citizen science, and there were other references to "mechanistic thinking and approaches" (1), the increase in the number of studies (1) and Nathan et al. 2008 paper in PNAS (1).
* Compared to 10 years ago, what would you be able to work on now that you could not do 10 years ago? (Please keep it to three topics; your answer may involve taxa, devices, methods or others)
`r nrow(data_question_16)` participants answered this question. 28 answers were related to tagging devices that allowed working with smaller species, obtaining longer datasets, tracking more individuals and having high resolution data on their movement; among them, 4 mentioned the opportunity to work with accelerometry data. 5 answers were related to working with environmental data to link to the movement data (one mentioned it to do habitat selection analysis), 4 people mentioned studies in physiology, 3 of behavior, 3 of movement models and analysis in general, 1 studies on the mechanisms underlying movement, 1 multitaxa comparison, 1 citizen science thanks to apps on the phone. 2 people mentioned computational simulation and 2 processing data.
* In your opinion, what will revolutionize the field in the next 10 years? (Please keep it to three topics)
`r nrow(data_question_17)` participants answered this question. 21 gave answers related to tagging devices (including use of ICARUS technology, 3D monitoring, development of more reliable and less invasive devices, improvement of accelerometers, and combined use of devices). 5 people referred to the development of statistical and mathematical methods to study movement processes (including machine learning techniques); 3 mentioned the availability of high resolution remote sensing data; 2 answers were related to software and computing power, 2 to connecting movement ecology to physiology, 2 to evolution, 1 to ontogeny, 1 to large-scale navigation experiments, 1 to integrating navigation, genetics, environment, physiology, life history and population dynamic studies; 1 to the development of data platforms, and 1 to "improved conceptualization of the field".
## Summary
* Most participants in the survey had less than a decade of experience in the field, so their farthest point of comparison in the past is from less than 10 years.
* Participants identified external factors as the dominant component in movement ecology papers, though recognizing that there is an increasing but still weaker interest in the link between internal factors and movement.
* Birds and mammals were identified as the most studied taxa in the field.
* GPS tags were identified as the most used tags, while the use of accelerometers is increasing and the use of radio tags, decreasing.
* The R software is the most used software according to the participants, and it is far more used than 10 years ago. On the other hand, Matlab, ArcGIS, SPSS and SAS use have decreased in the last decade.
* RSFs, SSFs, SSMs, HMMs, as well as GLMs and GAMs, are the most used statistical methods in movement ecology. SSM, HMM and machine learning techniques have experienced a notable increase in their use in the last decade, while the use of statistical tests, mainly, have decreased.
* When asked for what has revolutionized the field and what will play a major role in the next 10 years, most of the answers were related to tracking devices (miniaturization, reliability, battery saving, combined use of them). In a lesser degree, software, data and methods to analyze movement were mentioned. Most participants identified tracking technology as the game changer in the field.