[Ipmpack-users] Bootstrapping IPMs with Discrete Stages -- Troubleshooting High Estimates of Lambda

Thu Feb 23 05:56:57 CET 2017

Hello IPMpack users!

I have a fairly specific question, although I imagine the answer might be useful to others, as well.

I am attempting to build distributions of estimated lambda from 500 IPMs constructed using 500 bootstrapped datasets for demography in a perennial plant. The goal is to use these models in randomization tests to determine the population-scale significance of an experimental treatment. The bootstrapped datasets were built by sampling even numbers of individuals at random from each of the four quartiles of the range of sizes in the original dataframe (to avoid erroneous distributions of size arising by chance). I then built IPMs for each bootstrapped dataset and calculated an estimate of lambda for each, using the minimum and maximum values of ln-transformed size in each dataset to define the minimum and maximum sizes for building the IPM for that dataset. While this generally resulted in reasonable values for lambda, there are a few models that produced astronomically high estimates (somewhere in the vicinity of 15-50/500 models).

My question is: how to troubleshoot this? I've checked the diagnostics on the models giving high estimates for lambda but the models appear to fit well and the P matrices all look good (and the distribution of values look very similar to models giving reasonable estimates of lambda). There are no gaps in sizes due to the manner in which I bootstrapped the data. I've tried changing the maximum sizes by multiplying them by a constant, and increasing the bin numbers. I've played around with different model structures (adding in or deleting squared and cubed values for size). I also tried increasing or decreasing the number of draws for each bootstrapped dataset. I haven't had much luck with any of these approaches, although the number of models estimating unrealistic values of lambda decreased when I calculated separate maximum and minimum values of size for each model based on the maximum or minimum in the dataset.

Any ideas of other things to try in order to address this problem? I'm stumped!

A very abbreviated form of the models is as follows (all measures of size are natural log-transformed):

P Matrix:

survival~size

growth~size+size^2+size^3

- there are also 2 discrete stages: a seed bank (into which most seeds go if they do not germinate the spring after they fall), and a seedling stage, which can come from seeds germinating from the seedbank or the spring after they fall (all seedlings are roughly the same size so not included in the continuous model). Only surviving seedlings enter the continuous stage, and their size when they do so is modeled by a constant mean and standard deviation.

F Matrix:

probability of flowering (fec.0)~size

number of seeds if the plant did flower (fec.1)~size+size^2+size^3

-Constants describe the proportion of seeds that gerinate in the following spring vs. entering the seed bank.

Thanks in advance for any ideas or suggestions you might have!

Cheers,

Katie Baer
kathryn.baer at umconnect.umt.edu
PhD Candidate - Maron Lab<http://cas.umt.edu/dbs/labs/maron/>
Department of Biological Sciences<http://cas.umt.edu/dbs/>
University of Montana
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/ipmpack-users/attachments/20170223/4bd6905e/attachment.html>