[Subgroup-commits] r69 - in pkg/rsubgroup: . man
noreply at r-forge.r-project.org
noreply at r-forge.r-project.org
Mon Apr 20 09:35:52 CEST 2020
Author: atzmueller
Date: 2020-04-20 09:35:52 +0200 (Mon, 20 Apr 2020)
New Revision: 69
Modified:
pkg/rsubgroup/DESCRIPTION
pkg/rsubgroup/NEWS
pkg/rsubgroup/man/CreateSDTask.Rd
pkg/rsubgroup/man/DiscoverSubgroups.Rd
pkg/rsubgroup/man/SDTaskConfig-class.Rd
Log:
Modified: pkg/rsubgroup/DESCRIPTION
===================================================================
--- pkg/rsubgroup/DESCRIPTION 2020-03-15 21:42:07 UTC (rev 68)
+++ pkg/rsubgroup/DESCRIPTION 2020-04-20 07:35:52 UTC (rev 69)
@@ -1,8 +1,8 @@
Package: rsubgroup
Type: Package
Title: Subgroup Discovery and Analytics
-Version: 0.9
-Date: 2020-03-04
+Version: 1.0
+Date: 2020-04-20
Author: Martin Atzmueller
Maintainer: Martin Atzmueller <martin at atzmueller.net>
Description: A collection of efficient and effective tools and
Modified: pkg/rsubgroup/NEWS
===================================================================
--- pkg/rsubgroup/NEWS 2020-03-15 21:42:07 UTC (rev 68)
+++ pkg/rsubgroup/NEWS 2020-04-20 07:35:52 UTC (rev 69)
@@ -3,6 +3,12 @@
For a detailed list of changes, see the svn commit history at
svn://r-forge.r-project.org/svnroot/subgroup/
+Changes in version 1.0 (2020-04-20)
+===================================
+* internal enhancements in subgroup.jar, i.e., the VIKAMINE kernel library,
+ e.g. according to better error messages relating to the R connection.
+* Improved documentation and examples.
+
Changes in version 0.9 (2020-03-04)
===================================
* internal enhancements in subgroup.jar, i.e., the VIKAMINE kernel library
Modified: pkg/rsubgroup/man/CreateSDTask.Rd
===================================================================
--- pkg/rsubgroup/man/CreateSDTask.Rd 2020-03-15 21:42:07 UTC (rev 68)
+++ pkg/rsubgroup/man/CreateSDTask.Rd 2020-04-20 07:35:52 UTC (rev 69)
@@ -9,7 +9,10 @@
}
\arguments{
\item{source}{a data.frame or the a character string giving
-the filename of an ARFF file to use.}
+the filename of an ARFF file to use. Providing a file name
+directly provides the data to the subgroup discovery
+algorithms on the Java side, which is more memory efficient
+than converting the data frame to the Java representation.}
\item{target}{the target variable (constructed by as.target)
to consider for subgroup discovery.}
\item{config}{an instance of SDTaskConfig providing various
Modified: pkg/rsubgroup/man/DiscoverSubgroups.Rd
===================================================================
--- pkg/rsubgroup/man/DiscoverSubgroups.Rd 2020-03-15 21:42:07 UTC (rev 68)
+++ pkg/rsubgroup/man/DiscoverSubgroups.Rd 2020-04-20 07:35:52 UTC (rev 69)
@@ -10,7 +10,10 @@
}
\arguments{
\item{source}{a data.frame or the a character string giving
-the filename of an ARFF file to use.}
+the filename of an ARFF file to use. Providing a file name
+directly provides the data to the subgroup discovery
+algorithms on the Java side, which is more memory efficient
+than converting the data frame to the Java representation.}
\item{target}{the target variable (constructed by as.target)
to consider for subgroup discovery.}
\item{config}{an instance of SDTaskConfig providing various
Modified: pkg/rsubgroup/man/SDTaskConfig-class.Rd
===================================================================
--- pkg/rsubgroup/man/SDTaskConfig-class.Rd 2020-03-15 21:42:07 UTC (rev 68)
+++ pkg/rsubgroup/man/SDTaskConfig-class.Rd 2020-04-20 07:35:52 UTC (rev 69)
@@ -36,17 +36,20 @@
The default is \code{qf = "ps"}.
}
\item{\code{k}:}{The maximum number (top-k) of patterns
- to discover. The default is \code{k = 20}}
+ to discover, i.e., the best k rules according to the selected
+ quality function. The default is \code{k = 20}}
\item{\code{minqual}:}{The minimal quality (default \code{minqual = 0}).}
- \item{\code{minsize}:}{The minimal size of a subgroup
+ \item{\code{minsize}:}{The minimal size of a subgroup (as an integer)
(minimal coverage of database records, default \code{minsize = 0}).}
- \item{\code{mintp}:}{The minimal true positive (tp) threshold
+ \item{\code{mintp}:}{The minimal true positive (tp) threshold, an integer
(minimal (absolute) number of true positives in a subgroup, relevant for
binary target concepts only), defaults to \code{mintp = 0}}.
- \item{\code{maxlen}:}{The maximal description length of
- a pattern, i.e., the maximal number of conjunctions (default \code{maxlen = 7}).}
+ \item{\code{maxlen}:}{The maximal length of a description of
+ a pattern, i.e., the maximal number of conjunctions. This impacts both
+ understandability and efficiency. Simpler rules are easier to understand,
+ and a small \code{maxlen} will restrict the search space (default \code{maxlen = 7}).}
\item{\code{nodefaults}:}{Ignore default values, i.e.,
- do not include the respective first value of each
+ do not include the respective first value (with index 0) of each
attribute (default \code{nodefaults=FALSE}, i.e., include all values).}
\item{\code{relfilter}:}{Controls, whether irrelevant
patterns are filtered during pattern mining; negatively
More information about the Subgroup-commits
mailing list