[Subgroup-commits] r63 - in pkg/rsubgroup: . R inst/java man

noreply at r-forge.r-project.org noreply at r-forge.r-project.org
Wed Jul 1 21:51:32 CEST 2015


Author: atzmueller
Date: 2015-07-01 21:51:32 +0200 (Wed, 01 Jul 2015)
New Revision: 63

Modified:
   pkg/rsubgroup/DESCRIPTION
   pkg/rsubgroup/NEWS
   pkg/rsubgroup/R/subgroup.R
   pkg/rsubgroup/inst/java/subgroup.jar
   pkg/rsubgroup/man/SDTaskConfig-class.Rd
   pkg/rsubgroup/man/subgroup-package.Rd
Log:
  * Implement/enable new quality function (Adjusted residuals, cf. Agresti 2007)
  ==> qf="ares"
  * For a binary target variable, the resulting patterns now also store the
  chi-squared value comparing subgroup and population w.r.t. the target in the
  parameters field.
  * ToDataFrame shows the chi-squared value for a binary target.

Modified: pkg/rsubgroup/DESCRIPTION
===================================================================
--- pkg/rsubgroup/DESCRIPTION	2015-06-19 07:12:54 UTC (rev 62)
+++ pkg/rsubgroup/DESCRIPTION	2015-07-01 19:51:32 UTC (rev 63)
@@ -2,7 +2,7 @@
 Type: Package
 Title: Subgroup Discovery and Analytics
 Version: 0.7
-Date: 2015-06-19
+Date: 2015-07-01
 Author: Martin Atzmueller
 Maintainer: Martin Atzmueller <martin at atzmueller.net>
 Description: A collection of efficient and effective tools and

Modified: pkg/rsubgroup/NEWS
===================================================================
--- pkg/rsubgroup/NEWS	2015-06-19 07:12:54 UTC (rev 62)
+++ pkg/rsubgroup/NEWS	2015-07-01 19:51:32 UTC (rev 63)
@@ -3,7 +3,7 @@
 For a detailed list of changes, see the svn commit history at
 svn://r-forge.r-project.org/svnroot/subgroup/
 
-Changes in version 0.7 (2015-06-xx)
+Changes in version 0.7 (2015-07-xx)
 ===================================
 * Improvements
   * document setting Java heap space before loading the rsubgroup library.
@@ -17,7 +17,13 @@
   function, a match of a pattern and a data instance can be checked now.
   * In SDTaskConfig, postfilter can be a single filter or a vector of filters,
   that are then applied in order on the results. This allows e.g., the combination
-  of minimal improvement filtering with weighted covering post-processing. 
+  of minimal improvement filtering with weighted covering post-processing.
+  * Implement/enable new quality function (Adjusted residuals, cf. Agresti 2007)
+  ==> qf="ares"
+  * For a binary target variable, the resulting patterns now also store the
+  chi-squared value comparing subgroup and population w.r.t. the target in the
+  parameters field.
+  * ToDataFrame shows the chi-squared value for a binary target.
 
 * Bug fixes:
   * fix providing attributes=NULL (i.e., automatically include all attributes)

Modified: pkg/rsubgroup/R/subgroup.R
===================================================================
--- pkg/rsubgroup/R/subgroup.R	2015-06-19 07:12:54 UTC (rev 62)
+++ pkg/rsubgroup/R/subgroup.R	2015-07-01 19:51:32 UTC (rev 63)
@@ -138,7 +138,8 @@
     size <- J(J(sg, "getStatistics"), "getSubgroupSize")
     p <- J(J(sg, "getStatistics"), "getP")
     p0 <- J(J(sg, "getStatistics"), "getP0")
-    return(list(p = p, p0 = p0, size = size))
+    chi2 <- J("org.vikamine.kernel.subgroup.SGUtils")$calculateChi2OfSubgroup(J(sg, "getStatistics"))
+    return(list(p = p, p0 = p0, chi2=chi2, size = size))
   } else if (J(target, "isNumeric")) {
     size <- J(J(sg, "getStatistics"), "getSubgroupSize")
     mean <- J(J(sg, "getStatistics"), "getSGMean")
@@ -263,6 +264,9 @@
   sizes <- list()
   length(sizes) <- length(patterns)
   ps <- list()
+  length(ps) <- length(patterns)
+  chi2 <- list()
+  length(chi2) <- length(patterns)
   
   i <- 1
   for (pattern in patterns) {
@@ -274,6 +278,7 @@
       isNumeric <- TRUE
     } else {
       ps[i] <- .FormatDoubleSignificantDigits(pattern at parameters$p, ndigits)
+      chi2[i] <- .FormatDoubleSignificantDigits(pattern at parameters$chi2, ndigits)
       isNumeric <- FALSE
     }
     i <- i + 1
@@ -289,6 +294,7 @@
         quality=as.vector(qualities, "numeric"),
         p=as.vector(ps, "numeric"), 
         size=as.vector(sizes, "numeric"),
+        chi2=as.vector(chi2, "numeric"),
         description=as.vector(descriptions, "character"))
   }
   return(dataframe)

Modified: pkg/rsubgroup/inst/java/subgroup.jar
===================================================================
(Binary files differ)

Modified: pkg/rsubgroup/man/SDTaskConfig-class.Rd
===================================================================
--- pkg/rsubgroup/man/SDTaskConfig-class.Rd	2015-06-19 07:12:54 UTC (rev 62)
+++ pkg/rsubgroup/man/SDTaskConfig-class.Rd	2015-07-01 19:51:32 UTC (rev 63)
@@ -15,11 +15,12 @@
 \section{Slots}{
   \describe{
     \item{\code{qf}:}{A quality function; one of:
-	Binomial-Test \code{bin},
-	Chi-Square-Test \code{chi2},
+    Adjusted Residuals \code{ares},
+	Binomial Test \code{bin},
+	Chi-Square Test \code{chi2},
+	Gain \code{gain},
 	Lift \code{lift},
 	Piatetsky-Shapiro \code{ps},
-	Gain \code{gain},
 	Relative Gain \code{relgain},
 	Weighted Relative Accuracy \code{wracc}.
 	The default is \code{qf = "ps"}.

Modified: pkg/rsubgroup/man/subgroup-package.Rd
===================================================================
--- pkg/rsubgroup/man/subgroup-package.Rd	2015-06-19 07:12:54 UTC (rev 62)
+++ pkg/rsubgroup/man/subgroup-package.Rd	2015-07-01 19:51:32 UTC (rev 63)
@@ -29,7 +29,7 @@
 Package: \tab rsubgroup\cr
 Type: \tab Package\cr
 Version: \tab 0.7\cr
-Date: \tab 2015-06-xx\cr
+Date: \tab 2015-07-xx\cr
 License: \tab GPL (>= 3)\cr
 LazyLoad: \tab yes\cr
 }



More information about the Subgroup-commits mailing list