[Rprotobuf-commits] r904 - papers/jss
noreply at r-forge.r-project.org
noreply at r-forge.r-project.org
Tue Nov 25 00:25:50 CET 2014
Author: murray
Date: 2014-11-25 00:25:50 +0100 (Tue, 25 Nov 2014)
New Revision: 904
Modified:
papers/jss/article.Rnw
papers/jss/article.bib
Log:
Improve section 7 and update the conclusions to note we've been using
this package at Google for 5 years, not 3 years. Done with section 7
updates. 6 and 8 need work.
Modified: papers/jss/article.Rnw
===================================================================
--- papers/jss/article.Rnw 2014-11-24 23:06:05 UTC (rev 903)
+++ papers/jss/article.Rnw 2014-11-24 23:25:50 UTC (rev 904)
@@ -1078,7 +1078,10 @@
RPC protocols, and for storing structured information in a variety of
persistent storage systems since 2000 \citep{dean2009designs}. The
\pkg{RProtoBuf} package has been in widespread use by hundreds of
-analysts at Google since 2010.
+statisticians and software engineers at Google since 2010. This
+section describes a simplified example of a common design pattern of
+collecting a large structured data set in one language for later
+analysis in \proglang{R}.
Many large data sets in fields such as particle physics and information
processing are stored in binned or histogram form in order to reduce
@@ -1196,8 +1199,9 @@
@
\end{center}
-One of the authors has used this design pattern for several
-large-scale studies of distributed storage systems
+One of the authors has used this design pattern with large-scale \proglang{C++}
+MapReduces over very large data sets to write out histogram protocol
+buffers for several large-scale studies of distributed storage systems
\citep{sciencecloud,janus}.
\section{Application: Data interchange in web services}
@@ -1372,7 +1376,7 @@
and extends the \proglang{R} system with the ability to create, read,
write, parse, and manipulate Protocol
Buffer messages. \pkg{RProtoBuf} has been used extensively inside Google
-for the past three years by statisticians, analysts, and software engineers.
+for the past five years by statisticians, analysts, and software engineers.
At the time of this writing there are over 300 active
users of \pkg{RProtoBuf} using it to read data from and otherwise interact
with distributed systems written in \proglang{C++}, \proglang{Java}, \proglang{Python}, and
Modified: papers/jss/article.bib
===================================================================
--- papers/jss/article.bib 2014-11-24 23:06:05 UTC (rev 903)
+++ papers/jss/article.bib 2014-11-24 23:25:50 UTC (rev 904)
@@ -1,3 +1,9 @@
+ at article{dean2009designs,
+ title={Designs, lessons and advice from building large distributed systems},
+ author={Dean, Jeff},
+ journal={Keynote from LADIS},
+ year={2009}
+}
@article{eddelbuettel2011rcpp,
title = {Rcpp: Seamless R and C++ Integration},
author = {Dirk Eddelbuettel and Romain Fran{\c{c}}ois},
More information about the Rprotobuf-commits
mailing list