[Biomod-commits] Error in newx$cptable[max(keep), 1L] <- cp : subscript out of bounds

jason b mackenzie jasonbmackenzie at gmail.com
Fri Feb 10 09:42:48 CET 2012


Hi Wilfried

Thanks for the suggestions to drop low performing techniques (CTA, ANN and SRE) and redundant evaluation methods (KAPPA), and to choose another pseudo absence strategy (random). I look forward to reading the paper, and that's all great practical guidance. 

I ran all the models mostly for exploratory purposes, with the assumption that i could identify the worst performers with evaluation scores and drop them from Ensemble.Forecasting ( ) with flags such as weight.method='proportional', decay = 1.6, qual.th = 0.5. With that said, it makes since to drop weak approaches up front, so thanks again for the advice.

In case there are other methods producing curious errors besides CTA, a different species fell over yesterday on FDA, with the following error:
Model=Flexible Discriminant Analysis 
Error in family$linkfun(mustart) : 
  Argument mu must be a nonempty numeric vector

Cheers, 
Jason



On Feb 10, 2012, at 6:11 PM, Wilfried Thuiller wrote:

> Dear Jason,
> 
> The problem is related to the classification tree. It seems the tree has no branches and BIOMOD does not manage to get the optimal number of leaves. 
> My take on that is that your species is very difficult to model for CTA and it does not manage to find any good combination of variables explaining the distribution. 
> Perhaps we need to make an error trap around this call as this is not the first time we got this error, which is not a BIOMOD bug, but rather the failure of CTA to find an optimal number of leaves. 
> I will suggest removing the model for this species.
> 
> Looking at the code you sent with this message and the other one, I would also suggest:
> 
> Removing some techniques. Some generally performed quite poorly compared to the other ones. CTA and ANN are the weakest ones. SRE is more like a null model, the most liberal. I would drop the three. 
> Same for evaluation, you do not need to use both TSS and KAPPA, they are quite redundant. Just keep TSS, less influenced by prevalence and ROC. 
> 
> I would recommend using "random" for pseudo absence instead of SRE or circle which tends to over fit a bit the data.
> I take the liberty to send you this paper in press with Methods in Ecology and Evolution that discuss the different strategies for selecting pseudo-absence:
> PDF
> 
> Hope it helps,
> 
> Wilfried
> 
> 
> 
> 
> Le 7 févr. 2012 à 05:06, jason b mackenzie a écrit :
> 
>> Dear all,
>> 
>> I recently had a BIOMOD run crash mid-stream after 20+ species appear to have run successfully. The error message is pasted below. Any ideas or workarounds? 
>> 
>> ...
>> #####			 ERFA2 			#####
>> #####		   pseudo-absence run 1        		#####
>> Model=Artificial Neural Network 
>> 	 3 Fold Cross Validation + 3 Repetitions 
>> Calibration and evaluation phase: Nb of cross-validations:  3 
>> Evaluating Predictor Contributions in  ANN ... 
>> Model=Classification tree 
>> 	 50 Fold Cross-Validation 
>> Error in newx$cptable[max(keep), 1L] <- cp : subscript out of bounds
>> > warnings()
>> Warning messages:
>> 1: In predict.lm(object, newdata, se.fit, scale = 1, type = ifelse(type ==  ... :
>>   prediction from a rank-deficient fit may be misleading
>> 2: In cor(g.pred[, ], as.integer(.Rescaler4(as.numeric(predict(model.sp,  ... :
>>   the standard deviation is zero
>> ...
>> 
>> 
>> FYI, not sure if this matters, but my starting conditions were set as follows:
>> 
>> Initial.State(Response=SP.ENV[,12:75], Explanatory=SP.ENV[,4:11], IndependentResponse=NULL, 
>> 		IndependentExplanatory=NULL)
>> 
>> Models(GLM = FALSE, GAM = TRUE, Spline=4, GBM = TRUE, No.trees = 3000, CTA = TRUE, CV.tree = 50, ANN = TRUE, CV.ann = 3, 
>> 		SRE = TRUE, quant = 0.025, FDA = TRUE, MARS = TRUE, RF = TRUE, NbRunEval = 3, DataSplit = 70, 
>> 		Yweights = NULL, Roc = TRUE, Optimized.Threshold.Roc = TRUE, Kappa = TRUE, TSS = TRUE, 
>> 		KeepPredIndependent = FALSE, VarImport = 5, NbRepPA = 2, strategy = "sre", nb.absences = 1000)
>> 
>> 
>> Thanks in advance for any help!
>> 
>> Cheers, 
>> Jason
>> 
>> 
>> --
>> 
>> jason b mackenzie
>> 80 banambila street
>> aranda, act 2614
>> 
>> 0447 002 629  [mobile]
>> jasonbmackenzie    [skype]
>> 
>> 
>> 
>> 
>> _______________________________________________
>> Biomod-commits mailing list
>> Biomod-commits at lists.r-forge.r-project.org
>> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/biomod-commits
> 
> --------------------------
> Dr. Wilfried Thuiller
> Laboratoire d'Ecologie Alpine, UMR CNRS 5553
> Université Joseph Fourier
> BP53, 38041 Grenoble cedex 9, France
> tel: +33 (0)4 76 51 44 97
> fax: +33 (0)4 76 51 42 79
> 
> Email: wilfried.thuiller at ujf-grenoble.fr
> Personal website: http://www.will.chez-alice.fr
> Team website: http://www-leca.ujf-grenoble.fr/equipes/emabio.htm
> 
> ERC Starting Grant TEEMBIO project: http://www.will.chez-alice.fr/Research.html
> FP6 European EcoChange project: http://www.ecochange-project.eu
> 
> 
> 
> 
> 
> 
> 


--

jason b mackenzie
80 banambila street
aranda, act 2614

0447 002 629  [mobile]
jasonbmackenzie    [skype]




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/biomod-commits/attachments/20120210/643a6c44/attachment-0001.html>


More information about the Biomod-commits mailing list