{"id":267,"date":"2017-04-18T03:36:23","date_gmt":"2017-04-18T03:36:23","guid":{"rendered":"http:\/\/sites.warnercnr.colostate.edu\/gwhite\/?page_id=267"},"modified":"2017-04-18T03:36:23","modified_gmt":"2017-04-18T03:36:23","slug":"data-cloning","status":"publish","type":"page","link":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/data-cloning\/","title":{"rendered":"Data Cloning"},"content":{"rendered":"<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/index.html\">Contents<\/a> &#8211; <a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/idx.htm\">Index<\/a><\/span><\/p>\n<hr \/>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: medium\"><b>Data Cloning<\/b><\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">Cloning the data is a numerical approach to identifying parameters that are not estimable (<\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/pertinentliterature.htm\">Lele et al. 2007<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">, <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/pertinentliterature.htm\">Lele et al. 2010<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">).\u00a0 The reason for a parameter not being estimable is because it may be confounded with 1 or more other parameters.\u00a0 An example is the last phi and p combination in a Cormack-Jolly-Seber model, where only the product of phi and p can be estimated, but not the unique values of each.<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">The data are cloned by including multiple copies of the <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/encounter_histories_file.htm\">encounter histories<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">, i.e., duplicating the encounter histories.\u00a0 In MARK, all that needs to be done is to multiply the encounter history frequencies of each group by the number of clones desired.\u00a0 Consider the example of cloning the data 100 times.\u00a0 An encounter history for an analysis with 2 groups and no <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/individual_covariates.htm\">individual covariates<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\"> that looks like this:<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">\u00a0 11001010010 3 2;<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">could be cloned 100 times by entering the following encounter history:<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">\u00a0 11001010010 300 200;<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">By cloning the data, the sample size is increased without changing the parameter estimates.\u00a0 So, if the original estimates are compared to the cloned estimates, the values of the estimates will remain the same for parameters that are not confounded and are otherwise properly estimated.\u00a0 However, because the sample size has been increased, the standard errors of the cloned estimates will be smaller than the original standard errors.\u00a0 The expected result for parameters that are estimable is SE(original) = SE(cloned)*sqrt(number of clones).\u00a0 As an example, if the data are cloned 100 times, then the standard errors of the cloned data will be 1\/10 of the original standard errors.<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">MARK has the option to clone the data in the <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/results_browser.htm\">Results Browser<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\"> under the Output | Specific Model Output | Data Cloning menu choice.\u00a0 You should highlight the model you want to use for estimation before making this menu choice.\u00a0 When this menu choice is selected, you are asked to enter the number of copies (clones) of the data for the analysis, with the default value being 100.\u00a0 Once the value has been entered and the OK button pushed, MARK will generate new estimates with the cloned data to compare with the estimates from the model highlighted in the <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/results_browser.htm\">Results Browser<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">. The original estimates and the new estimates from the cloned data are presented in an Excel spreadsheet so that you can compare the estimates and their standard errors.<\/span><\/p>\n<p><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\">The confidence intervals can also be compared, and the use of <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/profile_likelihood_confidence_intervals.htm\">profile likelihood confidence intervals<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\"> is suggested for examining parameter estimates at boundaries.\u00a0 That is, a parameter at a boundary, e.g., a survival estimate equal to 1, will generally have a zero (or at least unrealistically small) standard error. Cloning the data does not change this small standard error.\u00a0 However, if you have computed <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/profile_likelihood_confidence_intervals.htm\">profile likelihod confidence intervals<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\"> for this parameter, the <\/span><span style=\"font-family: Arial, helvetica, sans-serif;font-size: small\"><a href=\"http:\/\/warnercnr.colostate.edu\/~gwhite\/mark\/markhelp\/profile_likelihood_confidence_intervals.htm\">profile likelihood confidence intervals<\/a><\/span><span style=\"color: #0000ff;font-family: Arial, helvetica, sans-serif;font-size: small\"> for the cloned data will be considerably shorter (assuming you clone a 100 copies) than the original data.\u00a0 So, data cloning is also useful for verifying that a parameter estimated at the boundary is also estimable.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Contents &#8211; Index Data Cloning Cloning the data is a numerical approach to identifying parameters that are not estimable (Lele et al. 2007, Lele et al. 2010).\u00a0 The reason for a parameter not being estimable is because it may be &hellip;<\/p>\n<p class=\"read-more\"> <a class=\"more-link\" href=\"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/data-cloning\/\"> <span class=\"screen-reader-text\">Data Cloning<\/span> Read More &raquo;<\/a><\/p>\n","protected":false},"author":117,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-267","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/pages\/267","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/users\/117"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/comments?post=267"}],"version-history":[{"count":1,"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/pages\/267\/revisions"}],"predecessor-version":[{"id":268,"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/pages\/267\/revisions\/268"}],"wp:attachment":[{"href":"https:\/\/sites.warnercnr.colostate.edu\/gwhite\/wp-json\/wp\/v2\/media?parent=267"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}