From avraham.adler at gmail.com Mon Aug 1 19:49:06 2016 From: avraham.adler at gmail.com (Avraham Adler) Date: Mon, 1 Aug 2016 13:49:06 -0400 Subject: [datatable-help] Timing on development/next release version for Windows Message-ID: Firstly, thank you for an extremely useful and well-constructed package. As someone restricted to Windows, I am looking forward to the next release, especially the fwrite functionality. Unfortunately, the development version (1.9.7) continually lists as failing for Windows. I understand there are many issues to address, but is there any, even tentative, ETA for a successful Windows build of 1.9.7, if not a full 1.9.8 release? Thank you, Avi From cyrille.laurent.sage at gmail.com Tue Aug 2 15:32:27 2016 From: cyrille.laurent.sage at gmail.com (cyrille laurent sage) Date: Tue, 2 Aug 2016 09:32:27 -0400 Subject: [datatable-help] datatable-help Digest, Vol 78, Issue 1 In-Reply-To: References: Message-ID: <010c01d1ecc2$50687a90$f1396fb0$@gmail.com> Bon zibiou je suis desole c t pas toi mes plus sinceres excuse, gros bisouu, Bon now je v le tuer cet encule de MD de merde. Papy nrv -----Original Message----- From: datatable-help-bounces at lists.r-forge.r-project.org [mailto:datatable-help-bounces at lists.r-forge.r-project.org] On Behalf Of datatable-help-request at lists.r-forge.r-project.org Sent: Tuesday, August 02, 2016 6:00 AM To: datatable-help at lists.r-forge.r-project.org Subject: datatable-help Digest, Vol 78, Issue 1 Send datatable-help mailing list submissions to datatable-help at lists.r-forge.r-project.org To subscribe or unsubscribe via the World Wide Web, visit https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help or, via email, send a message with subject or body 'help' to datatable-help-request at lists.r-forge.r-project.org You can reach the person managing the list at datatable-help-owner at lists.r-forge.r-project.org When replying, please edit your Subject line so it is more specific than "Re: Contents of datatable-help digest..." Today's Topics: 1. Timing on development/next release version for Windows (Avraham Adler) ---------------------------------------------------------------------- Message: 1 Date: Mon, 1 Aug 2016 13:49:06 -0400 From: Avraham Adler To: datatable-help at lists.r-forge.r-project.org Subject: [datatable-help] Timing on development/next release version for Windows Message-ID: Content-Type: text/plain; charset=UTF-8 Firstly, thank you for an extremely useful and well-constructed package. As someone restricted to Windows, I am looking forward to the next release, especially the fwrite functionality. Unfortunately, the development version (1.9.7) continually lists as failing for Windows. I understand there are many issues to address, but is there any, even tentative, ETA for a successful Windows build of 1.9.7, if not a full 1.9.8 release? Thank you, Avi ------------------------------ _______________________________________________ datatable-help mailing list datatable-help at lists.r-forge.r-project.org https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help End of datatable-help Digest, Vol 78, Issue 1 ********************************************* From mel at mbacou.com Thu Aug 11 09:34:05 2016 From: mel at mbacou.com (Bacou, Melanie) Date: Thu, 11 Aug 2016 03:34:05 -0400 Subject: [datatable-help] Help summarizing monthly data sequences Message-ID: <6c903b16-07b4-d51c-2a8b-b16fda577e58@mbacou.com> Hi, I hope this is an acceptable data.table problem (also posted to StackOverflow http://stackoverflow.com/questions/38890034/using-data-table-to-summarize-monthly-sequences-count-specific-events). I have a 3-column table with: - `id` geographic location IDs (303,453 locations) - `month` month over 24 years 1990-2014 - `spei` a climatic index that varies between -7 and 7. I need to count the occurrence of droughts at each location over the entire 1990-2014 period. A drought event is defined as "a period in which the SPEI is continuously negative and the SPEI reaches a value of -1.0 or less. Drought starts when the SPEI first falls below zero and ends with the first positive SPEI value following a value of -1.0 or less". I know this should be feasible using shift() and rolling joins but would very welcome some pointers! # Sample table structure > dt <- data.table( id = rep(1:303453, each=24*12), month = rep(seq(as.Date("1990-01-01"), as.Date("2014-12-31"), "month"), 303453), spei = runif(303453*24*12, -7, 7)) Hoping some of you are more used to working with time series. Many thanks, --Mel. -- Melanie BACOU International Food Policy Research Institute Snr. Program Manager, Spatial Data and Analytics Work +1(202)862-5699 E-mail m.bacou at cgiar.org Visit www.harvestchoice.org From zach.okeeffe at gmail.com Sat Aug 13 20:07:25 2016 From: zach.okeeffe at gmail.com (Zachary O'Keeffe) Date: Sat, 13 Aug 2016 14:07:25 -0400 Subject: [datatable-help] using set to create a large date vector converted to factor Message-ID: Hello, I'm a huge fan of data.table and use it almost exclusively. I've not used this mailing list before, but I signed up to report an error I encountered because the error message asked me to. I have a fairly large data.table (480,000 rows) with over 6,000 unique dates. I can create a factor version of the date variable using the := syntax, but not with set(), which I generally try to use as per the recommendation in the package documentation. Factoring a date column with set() does work for smaller data.tables though. See below. Note that in addition to using as.factor(BigDT[["Date"]]) I used BigDT[,Date] and also tried creating the vector outside of set(), which works, but when I feed it to set it does not work. Best, Zach > TestDT<-data.table(x=1:10,Date=seq(as.Date("2012-01-01"),as.Date("2012-01-10"),by="1 day")) > set(TestDT,NULL,"DateFactor",as.factor(TestDT[["Date"]])) > set(BigDT,NULL,"DateFactor",as.factor(BigDT[["Date"]])) Error in set(BigDT, NULL, "DateFactor", as.factor(BigDT[["Date"]])) : Internal error, please report (including result of sessionInfo()) to datatable-help: oldtncol (0) < oldncol (41) but tl of class is marked. > BigDT[,DateFactor:=as.factor(Date)] > nrow(BigDT) [1] 480743 > nlevels(BigDT[["DateFactor"]]) [1] 6119 > sessionInfo() R version 3.2.1 (2015-06-18) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Debian GNU/Linux stretch/sid locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 LC_PAPER=en_US.UTF-8 LC_NAME=C [9] LC_ADDRESS=C LC_TELEPHONE=C LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] data.table_1.9.6 loaded via a namespace (and not attached): [1] tools_3.2.1 chron_2.3-47 -------------- next part -------------- An HTML attachment was scrubbed... URL: From zach.okeeffe at gmail.com Mon Aug 29 04:32:52 2016 From: zach.okeeffe at gmail.com (Zachary O'Keeffe) Date: Sun, 28 Aug 2016 22:32:52 -0400 Subject: [datatable-help] allocation error Message-ID: Hello, While creating many transformed versions of variables in and outside of loops, I've been seeing this error lately: Error in set(MapYrCtyAllGradeMiAreaPopSt, NULL, "StYearHwyMiGradeAllLogPerStPop", : Internal logical error. DT passed to assign has not been allocated enough column slots. l=100, tl=100, adding 1 I found out to resolve it from this Stack Overflow post: https://stackoverflow.com/questions/29615181/r-warning-when-creating-a-long-list-of-dummies Which directed me to the alloc.col help page. I mainly wanted to write to suggest making the error a bit more informative/helpful, e.g. directing the user to look at alloc.col/trulength help page. Also, I was wondering if you could explain why I ran into the error using set(), but not :=? Thanks, Zach -------------- next part -------------- An HTML attachment was scrubbed... URL: From aragorn168b at gmail.com Tue Aug 30 19:39:29 2016 From: aragorn168b at gmail.com (Arunkumar Srinivasan) Date: Tue, 30 Aug 2016 19:39:29 +0200 Subject: [datatable-help] allocation error In-Reply-To: References: Message-ID: Sure, makes sense. Could you please file an issue here:?https://github.com/Rdatatable/data.table/issues Thanks! --? Arun On 29 August 2016 at 04:33:43, Zachary O'Keeffe (zach.okeeffe at gmail.com) wrote: Hello, While creating many transformed versions of variables in and outside of loops, I've been seeing this error lately: Error in set(MapYrCtyAllGradeMiAreaPopSt, NULL, "StYearHwyMiGradeAllLogPerStPop", ?:? ? Internal logical error. DT passed to assign has not been allocated enough column slots. l=100, tl=100, adding 1 I found out to resolve it from this Stack Overflow post: https://stackoverflow.com/questions/29615181/r-warning-when-creating-a-long-list-of-dummies Which directed me to the alloc.col help page. I mainly wanted to write to suggest making the error a bit more informative/helpful, e.g. directing the user to look at alloc.col/trulength help page. Also, I was wondering if you could explain why I ran into the error using set(), but not :=? Thanks, Zach _______________________________________________ datatable-help mailing list datatable-help at lists.r-forge.r-project.org https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help -------------- next part -------------- An HTML attachment was scrubbed... URL: