[Traminer-users] Antw: Re: Antw: selecting the number of clusters
Rimantas Vosylis
rvosylis at live.com
Thu Jun 11 11:03:07 CEST 2015
Dear Gilbert,indeed I found the WeightedCluster package last night and it did help me bigtime in my analysis. In fact I rerun everything with PAM clustering and got better results. So thanks for this reference - it I would not have found it myself that would be a big help!
Rimantas
From: Gilbert.Ritschard at unige.ch
To: traminer-users at lists.r-forge.r-project.org
Date: Thu, 11 Jun 2015 07:06:19 +0000
Subject: Re: [Traminer-users] Antw: Re: Antw: selecting the number of clusters
Did you look at the possibilities offered by the WeightedCluster package of Matthias Studer? The package comes with a vignette that nicely documents the proposed
tools.
Gilbert
From: traminer-users-bounces at lists.r-forge.r-project.org
[mailto:traminer-users-bounces at lists.r-forge.r-project.org] On Behalf Of
Rimantas Vosylis
Sent: Wednesday, June 10, 2015 17:40
To: 'Users questions'
Subject: Re: [Traminer-users] Antw: Re: Antw: selecting the number of clusters
Dear Gerhard,
Indeed it is possible to input the distance martix, but it is also mandatory to specify the data
L I tried to input the sequence object as data but it does not work
L
I will look through the literature You suggested!
Rimantas
From:
traminer-users-bounces at lists.r-forge.r-project.org [mailto:traminer-users-bounces at lists.r-forge.r-project.org]
On Behalf Of Gerhard Wührer
Sent: Wednesday, June 10, 2015 1:55 PM
To: traminer-users at lists.r-forge.r-project.org
Subject: [Traminer-users] Antw: selecting the number of clusters
Hello,
please try the R - package 'nbclust' do decide how many clusters are feasable. In addition to that statistical measures,
inspect the different cluster solutions by content and how meaningful interpretations are. You can also do some kind of x-square test where you align the clusters with variables not used in the cluster analysis. At least you have some kind of face validity.
Best regards - Gerhard A. Wührer
o. Univ.-Prof. Dkfm. Dr. Gerhard A. Wührer
Institut für Handel, Absatz und Marketing
Johannes Kepler Universität Linz
Altenberger Str. 69
4040 Linz/Austria
tel.: 004373224689401
fax.:004373224689404
mail: gerhard.wuehrer at jku.at
URL:
www.marketing.jku.at
>>> Rimantas Vosylis <rvosylis at live.com> 10.06.2015
12:30 >>>
Dear Traminer users,
I am trying to build a typology of sequences by using cluster analysis with OM and Ward algorith.
I have a problem of choosing the number of clusters. I use several empirical indexes, but they don‘t help me a lot. I use Calinski
and harabasz (CH) index, but it has a peak at two cluster solution and the goes down. I also use average shilloute width but it gives me the similar results as CH index. I also run pseudo ANOVA to see which cluster solution explains most variance, but it tells
me the opposite – the more the clusters the higher the pseudo R2 gets. When I look at the various plots (e.g. seqdplot) I see that the most meaningful solutions (I have several types of sequences) lie somewhere between 4-6 clusters.
Could You perhaps suggest which indexes worked best for You and matched Your expectations / theoretical knowledge and that I could
use in my analysis?
Thank You in advance!!
Sincerely,
Rimantas Vosylis
PhD student, lecturer
Insitute of Psychology
Faculty of Social Technologies
Mykolas Romeris University
e-mail:
rimantasv at mruni.eu
e-mail2:
rvosylis at live.com
_______________________________________________
Traminer-users mailing list
Traminer-users at lists.r-forge.r-project.org
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/traminer-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/traminer-users/attachments/20150611/70224066/attachment.html>
More information about the Traminer-users
mailing list