Distance sampling in the Real World We've talked a lot about models We've also talked about assumptions Our example is relatively well-behaved What can we do about all the nasty real world stuff?
Some days...
Aims Here we want to cover common questions Not definitive answers Some guidance on where to look for answers
What should my sample size be?
What do we mean by "sample size"? Number of animal (groups) recorded detection function Number of segments spatial model Number of segments with observations spatial model
Re-frame
How would we know when we have enough samples? We don't Heavily context-dependent Go back to assumptions
"How many data?"
Pilot studies and "you get what you pay for" Designing surveys is hard Designing surveys is essential Better to fail one season than fail for 5, 10 years Get information early, get it cheap Inform design from a pilot study
Avoiding rules of thumb Think about assumptions Detection function Spatial model Think about design Spatial coverage Covariate coverage
Spatial coverage (IWC POWER)
Covariate coverage
Sometimes things are complicated Weather has a big effect on detectability Need to record during survey Disambiguate between distribution/detectability Potential confounding can be BAD
Visibility during POWER 2014
Thanks to Hiroto Murase and co. for this data!
Covariates can make a big difference!
Disappointment Sometimes you don't have enough data Or, enough coverage Or, the right covariates
Sometimes, you can't build a spatial model
@kitabet
"Which of options X, Y, Z is correct?"
Alternatives problem When faced with options, try them. Where does the sensitivity lie? What's really going on? What is your objective?
"How big should our segments be?"
Segment size If you think it's an issue test it Resolution of covariates also important Maybe species-/domain-dependent? (Solutions on the horizon to avoid this)
"Is our model right?"
Model validation Some variety of cross-validation Temporal replication Leave out 1 year, fit to others, predict, assess Spatial “pseudo-jackknife” th
Leave out every n segment, refit, … (Maybe leave out 2, 3 etc…)
Modelling philosophy
Which covariates should we include? Dynamic vs static variables Spatial terms? Habitat models?
Getting help
Resources Bibliography has pointers to these topics Distance sampling Google Group Friendly, helpful, low traffic see distancesampling.org/distancelist.html
Advanced topics
This is a whirlwind tour...
...and some of this is experimental
Smoother zoo
Cyclic smooths What if things “wrap around”? (Time, angles, …) Match value and derivative Use bs="cc" See ?smooth.construct.cs.smooth.spec
Smoothing in complex regions Edges are important Whales don't live on land Bad things happen when we don't account for this Include boundary info in smoother ?soap
Multivariate smooths Thin plate splines are isotropic 1 unit in any direction is equal Fine for space, not for other things
Tensor products sx,z (x, z) = ∑k1 ∑k2 βk sx (x)sz (z) As many covariates as you like! (But takes time) te() or ti() (instead of s())
Black bears like to sunbathe
Random effects normal random effects exploits equivalence of random effects and splines ? gam.vcomp useful when you just have a “few” random effects ?random.effects
Making things faster
Parallel processing Some models are very big/slow Run on multiple cores Use engine="bam"! Some constraints in what you can do Wood, Goude and Shaw (2015)
Summary Lots of complicated problems Lots of potential solutions (see also “other approaches” mini-lecture) Need to get simple things right first Trade assumptions for data
Real survey data is messy ... Weather has a big effect on detectability. Need to record during survey. Disambiguate ... Parallel processing. Some models are very ...
Provide foundation for domain ontologies with spatially extended objects. ⢠Applications in geography, activity recognition, robotics, NL, biologyâ¦
This chapter also presents a system that generates reports combining automatically generated ... in different circumstances, our system converts each kernel expression into a standard, simplified ..... (2013) developed an analytic method for ...
refer to [9], [12], [13] for the details), but we describe the basic concepts. In a bond .... and the ground) have collided, and they are now in contact over some area ..... as one dynamical system, instead of looking at separate direc- tions (three
1. 2. 3. 4. B. A. 3. 2. 1. 5. C. D. 4. 6. 7. 8. A. A. SHEET 1 OF 1. Alarm Clock. TITLE. Display (Model B) - Dots PCB. REV. PART #. CLK-PC-06. DOCUMENT #.
just an inference engine, but also a way to construct new models and a way to check ... 3. A model comparison procedure. Search strategies requires an objective to ... We call this system the automatic Bayesian covariance discovery (ABCD).
Aug 11, 2005 - There is a great deal of literature on Bayesian model comparison for nonspatial .... structure of the explanatory variables in X into account. ...... Further computational savings can be achieved by noting that the grid can be.
One can multiply any number of kernels together in this way to produce kernels combining several ... Figure 1.3 illustrates the SE-ARD kernel in two dimensions. Ã. = â ...... We'll call a kernel which enforces these symmetries a Möbius kernel.
logical spatial logics [10], whereas temporal information is described by a Kripke ..... minutes, depending on the formula, on a quite standard laptop computer.
Aug 11, 2005 - represents a cross-section of regions located in space, for example, counties, states, or countries. y ¼ rWy Ñ ... If the sample data are to determine the posterior model probabilities, the prior probabilities ..... averaged estimate
Jun 30, 2014 - The new economic geography models presented in Fujita et al (2001) are also related to our analysis. In their ...... Afghanistan. Bangladesh. Brazil. Chile. China. Dominican Republic. India. Indonesia. Korea, Rep. Malaysia. Mexico. Bhu
soil surveys are a major source of soil spatial information. There are ...... paid to sample the small draws when the 64 sites were distributed over the study area.
An up-to-date pdf version of this tutorial is maintained for teaching purposes in the file ... 1. Introduction: provides a guide to R's syntax and preparing for the tutorial .... To check the classes of all the variables in a spatial dataset, you can
towns which rely on a properly dimensioned sewage system to collect water run-off. Fig. ... As weather predictions are considered reliable up to 1 week ahead, we ..... (Available from http://www.abi.org.uk/Display/File/Child/552/Financial-Risks-.