[datatable-help] Order of DT after non-keyed by
Joseph Voelkel
jgvcqa at rit.edu
Wed Apr 11 03:39:20 CEST 2012
Thanks, Matthew, for not only the answer but also the historical context and keyby.
Joe
-----Original Message-----
From: Matthew Dowle [mailto:mdowlenoreply at virginmedia.com] On Behalf Of Matthew Dowle
Sent: Tuesday, April 10, 2012 8:22 PM
To: Joseph Voelkel
Cc: datatable-help at r-forge.wu-wien.ac.at
Subject: Re: [datatable-help] Order of DT after non-keyed by
Yes, correct behaviour. That's an ad hoc by which preserves the group order (the order of first appearance of each group). From ?data.table, the second part of this sentence :
"The order of the rows within each group is preserved, as is the order of the groups."
That was new in 1.6.3 :
o Ad hoc grouping now returns results in the same order each
group first appears in the table, rather than sorting the
groups. Thanks to Steve Lianoglou for highlighting. The order
of the rows within each group always has and always will be
preserved. For larger datasets a 'keyed by' is still faster;
e.g., by=key(DT).
To reorder the ad hoc by result, change 'by=' to 'keyby=' (in v1.8.0).
Not be confused with keyed by!
Matthew
On Tue, 2012-04-10 at 19:51 -0400, Joseph Voelkel wrote:
> Here is a simple example of a simple question:
>
>
>
> dt<-data.table(a=rep(1:5,1:5),b=1,c=rep(1:3,5))
>
> dt
>
> dt[,seq_along(b),by=a] # expected behavior (note: dt is already in
> order of a)
>
> setkey(dt,c) # to sort by c
>
> dt
>
> dt[,seq_along(b),by=a] # expected behavior? Appears to be in order of
> unique(dt$a)
>
>
>
> Thanks,
>
>
>
> Joe Voelkel
>
>
> _______________________________________________
> datatable-help mailing list
> datatable-help at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable
> -help
More information about the datatable-help
mailing list