[Rcpp-devel] efficient ingestion of "sparse csv"

Vincent Carey stvjc at channing.harvard.edu
Tue May 11 04:56:10 CEST 2021


This problem has been discussed in various places but I don't
see a clear solution.  Certain applications are generating
large comma-delimited files with mostly zero entries.  The aim
is to ingest efficiently, converting to sparse representation
a record at a time.  Presumably a triplet format would be the
initial internal representation, with an aim to convert at
the end to Matrix dgCmatrix format.  Has anyone tackled this
in Rcpp or RcppArmadillo?

-- 
The information in this e-mail is intended only for the person to whom it 
is
addressed. If you believe this e-mail was sent to you in error and the 
e-mail
contains patient information, please contact the Partners Compliance 
HelpLine at
http://www.partners.org/complianceline 
<http://www.partners.org/complianceline> . If the e-mail was sent to you in 
error
but does not contain patient information, please contact the sender 
and properly
dispose of the e-mail.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20210510/27cab85b/attachment.html>


More information about the Rcpp-devel mailing list