<html><body><div style="font-family: Arial, Helvetica, sans-serif; font-size: 12pt; color: #000000"><div>I'll try two answers : </div><div><br></div><div>1/ your question is not a simple technical decision, it's also a research choice and we can't answer without knowing your dataset and your research objectives. For example, you have 30 time steps (1 per year) and If you work with calendar-time: for the 30 years old you have 30 values, and for the 25 years old 25 values. You could assign null values during the first 5 years for 25 yo individuals. Another option would be to align each individual at his birthday year (time as process). On both cases, if you compute a distance in your dataset, sure the cohort will impact the results, but you can't erase the differences between 30 and 25 yo individuals, they do exist.<br></div><div><br></div><div>2/ if you want to minimize the importance of the cohort, the easiest way is to suppress the time as quantity and consider only the succession of states. Convert your sequences into distinct states sequences (seqdss) and compute your distances with this DSS object.<br></div><div><br></div><div>Hope it helps.<br></div><div><br></div><div>Hadrien<br></div><div><br></div><hr id="zwchr"><div style="color:#000;font-weight:normal;font-style:normal;text-decoration:none;font-family:Helvetica,Arial,sans-serif;font-size:12pt;" data-mce-style="color: #000; font-weight: normal; font-style: normal; text-decoration: none; font-family: Helvetica,Arial,sans-serif; font-size: 12pt;"><b>De: </b>"Rimantas Vosylis" <rvosylis@live.com><br><b>À: </b>traminer-users@lists.r-forge.r-project.org<br><b>Envoyé: </b>Lundi 16 Février 2015 13:41:30<br><b>Objet: </b>[Traminer-users] linking short sequences with clusters based on long sequences<br><div><br></div><style><!--
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#0563C1;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:#954F72;
text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
{mso-style-priority:34;
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:3.0cm 1.0cm 2.0cm 3.0cm;}
div.WordSection1
{page:WordSection1;}
@list l0
{mso-list-id:1037894516;
mso-list-type:hybrid;
mso-list-template-ids:195356550 69664783 69664793 69664795 69664783 69664793 69664795 69664783 69664793 69664795;}
@list l0:level1
{mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level2
{mso-level-number-format:alpha-lower;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level3
{mso-level-number-format:roman-lower;
mso-level-tab-stop:none;
mso-level-number-position:right;
text-indent:-9.0pt;}
@list l0:level4
{mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level5
{mso-level-number-format:alpha-lower;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level6
{mso-level-number-format:roman-lower;
mso-level-tab-stop:none;
mso-level-number-position:right;
text-indent:-9.0pt;}
@list l0:level7
{mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level8
{mso-level-number-format:alpha-lower;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level9
{mso-level-number-format:roman-lower;
mso-level-tab-stop:none;
mso-level-number-position:right;
text-indent:-9.0pt;}
ol
{margin-bottom:0cm;}
ul
{margin-bottom:0cm;}
--></style><div class="WordSection1"><p class="MsoNormal">Dear Traminer users and experts,</p><p class="MsoNormal"> </p><p class="MsoNormal">I wrote this question few weeks ago but no one answered. I will make it brief this time, so maybe I will get some response <span style="font-family:Wingdings" data-mce-style="font-family: Wingdings;">J</span></p><p class="MsoNormal"> </p><p class="MsoNormal">I am interested in transitions to adulthood. I have two groups one is called 30-year-olds and another one - 25-year-olds. For both of these groups I have a sequence of life situation statuses. For 30-year-olds the sequence is longer than for 25-year-olds.</p><p class="MsoNormal"> </p><p class="MsoNormal">I want find the typology these sequences (transitions to adulthood) and I also want to assign sequences of 25-year-olds and 30-year-olds to these types (trajectories).</p><p class="MsoNormal">So the main issue for me is how can I assign the 25-year-olds that have shorter sequences to the clusters that were found based on analyses that also would include 30-year-old group.</p><p class="MsoNormal">I came up with several strategies, but I am not sure which on is better, or maybe there is something else I can do but I don’t know.</p><p class="MsoNormal"> </p><p class="MsoListParagraph" style="text-indent:-18.0pt;mso-list:l0 level1 lfo1" data-mce-style="text-indent: -18.0pt; mso-list: l0 level1 lfo1;"><span style="mso-list:Ignore" data-mce-style="mso-list: Ignore;">1.<span style="font:7.0pt "Times New Roman"" data-mce-style="font: 7.0pt 'Times New Roman';"> </span></span>The first strategy is that I simply run optimal matching calculations for the full dataset (including the ones that have long sequences and shorter ones) and those that have shorter ones’ are already assigned to some cluster.</p><p class="MsoNormal">Q1. My first question to You is: does this seem like a valid strategy to assign 25-year-olds to the clusters that are actually created using also 30-year-olds?</p><p class="MsoNormal"> </p><p class="MsoNormal">2. The second strategy is that I first analyze only 30-year-olds, then I extract ideal types representing each cluster, then I include these ideal types into dataset of only 25-year-olds and I rerun Optimal matching analysis. Then based on the shortest distance from each ideal type sequence to each participants’ sequence I assign them to those clusters. Something similar was discussed by Martin, P., Schoon, I., Ross, A., Beyond Transitions: Applying Optimal Matching to Life Course Research</p><p class="MsoNormal"> </p><p class="MsoNormal">Q2. Does this seem like a more valid strategy than the first one?</p><p class="MsoNormal"> </p><p class="MsoNormal">Q3. Perhaps You could provide another option on how to do such assigning?</p><p class="MsoNormal"> </p><p class="MsoNormal"><b>I would really appreciate any help on any of these questions. </b></p><p class="MsoNormal"> </p><p class="MsoNormal">Rimantas</p><p class="MsoNormal"> </p></div><br>_______________________________________________<br>Traminer-users mailing list<br>Traminer-users@lists.r-forge.r-project.org<br>https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/traminer-users</div><div><br></div></div></body></html>