[datatable-help] Order of DT after non-keyed by

Matthew Dowle mdowle at mdowle.plus.com
Wed Apr 11 02:22:07 CEST 2012


Yes, correct behaviour. That's an ad hoc by which preserves the group
order (the order of first appearance of each group). From ?data.table,
the second part of this sentence :

"The order of the rows within each group is preserved, as is the order
of the groups."

That was new in 1.6.3 :

o   Ad hoc grouping now returns results in the same order each 
    group first appears in the table, rather than sorting the
    groups. Thanks to Steve Lianoglou for highlighting. The order
    of the rows within each group always has and always will be 
    preserved. For larger datasets a 'keyed by' is still faster;
    e.g., by=key(DT).

To reorder the ad hoc by result, change 'by=' to 'keyby=' (in v1.8.0).
Not be confused with keyed by!

Matthew


On Tue, 2012-04-10 at 19:51 -0400, Joseph Voelkel wrote:
> Here is a simple example of a simple question:
> 
>  
> 
> dt<-data.table(a=rep(1:5,1:5),b=1,c=rep(1:3,5))
> 
> dt
> 
> dt[,seq_along(b),by=a] # expected behavior (note: dt is already in
> order of a)
> 
> setkey(dt,c) # to sort by c
> 
> dt
> 
> dt[,seq_along(b),by=a] # expected behavior? Appears to be in order of
> unique(dt$a)
> 
>  
> 
> Thanks,
> 
>  
> 
> Joe Voelkel
> 
> 
> _______________________________________________
> datatable-help mailing list
> datatable-help at lists.r-forge.r-project.org
> https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help




More information about the datatable-help mailing list