\documentclass[12pt]{report} \usepackage{amsbsy} % for \boldsymbol and \pmb \usepackage{amssymb} % for \blacksquare \usepackage{amsmath} \usepackage[colorlinks=true, pdfstartview=FitV, linkcolor=blue, citecolor=blue, urlcolor=blue]{hyperref} % For links \oddsidemargin=0in % Good for US Letter paper \evensidemargin=0in \textwidth=6.3in \topmargin=-0.5in \headheight=0.2in \headsep=0.5in \textheight=8.5in \begin{document} \begin{center} {\textbf{Rules for Parameter Identifiability}}\footnote{This handout was prepared by \href{http://www.utstat.toronto.edu/~brunner}{Jerry Brunner}, Department of Statistical Sciences, University of Toronto. It is licensed under a \href{http://creativecommons.org/licenses/by-sa/3.0/deed.en_US} {Creative Commons Attribution - ShareAlike 3.0 Unported License}. Use any part of it as you like and share the result freely. The \LaTeX~source code is available from the course website: \href{http://www.utstat.toronto.edu/~brunner/oldclass/2101f19} {\small\texttt{http://www.utstat.toronto.edu/$^\sim$brunner/oldclass/2101f19}}} \\ % These would be better with some consolidation of standardized and unstandardized. \end{center} \vspace{2mm} \noindent \textbf{Note:} Here, identifiable means identifiable from the covariance matrix. Models are re-parameterized to exclude expected values and intercepts. All the rules listed here assume that errors are independent of exogenous variables that are not errors. \begin{enumerate} \item \emph{Parameter Count Rule:} If a model has more parameters than covariance structure equations, the parameter vector can be identifiable on at most a set of volume zero in the parameter space. This applies to all models. \item \emph{Measurement model} (Factor analysis) In these rules, latent variables that are not error terms are described as ``factors." \begin{enumerate} \item \emph{Double Measurement Rule}: Parameters of the double measurement model are identifiable. All factor loadings equal one. Correlated measurement errors are allowed within sets of measurements, but not between sets. \item \emph{Three-Variable Rule for Standardized Factors}: The parameters of a measurement model will be identifiable if \begin{itemize} \item Errors are independent of one another. \item Each observed variable is caused by only one factor. \item The variance of each factor equals one. \item There are at least 3 variables with non-zero loadings per factor. \item The sign of one non-zero loading is known for each factor. \end{itemize} Factors may be correlated. \item \emph{Three-Variable Rule for Unstandardized Factors}: The parameters of a measurement model will be identifiable if \begin{itemize} \item Errors are independent of one another. \item Each observed variable is caused by only one factor. \item For each factor, at least one factor loading equals one. \item There are at least 2 additional variables with non-zero loadings per factor. \end{itemize} Factors may be correlated. \newpage \item \emph{Two-Variable Rule for Standardized Factors}: A factor with just two variables may be added to a measurement model whose parameters are identifiable, and the parameters of the combined model will be identifiable provided \begin{itemize} \item The errors for the two additional variables are independent of one another and of those already in the model. \item The two new variables are caused only by the new factor. \item The variance of the new factor equals one. \item Both new factor loadings are non-zero. \item The sign of one new loading is known. \item The new factor has a non-zero covariance with at least one factor already in the model. \end{itemize} \item \emph{Two-Variable Rule for Unstandardized Factors}: A factor with just two variables may be added to a measurement model whose parameters are identifiable, and the parameters of the combined model will be identifiable provided \begin{itemize} \item The errors for the two additional variables are independent of one another and of those already in the model. \item The two new variables are caused only by the new factor. \item At least one new factor loading equals one. \item The other new factor loading is non-zero. \item The new factor has a non-zero covariance with at least one factor already in the model. \end{itemize} \item \emph{Four-variable Two-factor Rule}: The parameters of a measurement model with two factors and four observed variables will be identifiable provided \begin{itemize} \item All errors are independent of one another. \item Each observed variable is influenced by only one factor. \item Two observed variables are influenced by one factor, and two are influenced by the other. \item All factor loadings are non-zero. \item For each factor, either the variance of the factor equals one and the sign of one new loading is known, or at least one factor loading equals one. \item The covariance of the two factors does not equal zero. \end{itemize} \item \emph{Combination Rule}: Suppose that the parameters of two measurement models are identifiable by any of the rules above. The two models may be combined into a single model provided that the error terms of the first model are independent of the error terms in the second model. The additional parameters of the combined model are the covariances between the two sets of factors, and these are all identifiable. \newpage \item \emph{Cross-over Rule}: Suppose that \begin{itemize} \item The parameters of a measurement models are identifiable, and \item For each factor there is at least one observable variable that is caused only by that factor (with a non-zero factor loading). \end{itemize} Then any number of new observable variables may be added to the model and the result is a model whose parameters are all identifiable, provided that \begin{itemize} \item The error terms associated with the new variables are independent of the error terms in the existing model. \end{itemize} Each new variable may be caused by any or all of the factors, potentially resulting in a cross-over pattern in the path diagram. The error terms associated with the new set of variables may be correlated with one another. Note that no new factors are added, just new observable variables. \item \emph{Error-Free Rule}: A vector of observable variables may be added to the factors of a measurement model whose parameters are identifiable. Suppose that \begin{itemize} \item The new observable variables are independent of the errors in the measurement model, and \item For each factor in the measurement model there is at least one observable variable that is caused only by that factor (with a non-zero factor loading). \end{itemize} Then the parameters of a new measurement model, where some of the variables are assumed to be measured without error, are identifiable. The practical consequence is that variables assumed to be measured without error may be included in the latent component of a structural equation model, provided that the measurement model for the other variables has identifiable parameters. \end{enumerate} % End of measurement model rules \item \emph{Latent variable model}: $\mathbf{Y}_i = \boldsymbol{\beta} \mathbf{Y}_i + \boldsymbol{\Gamma} \mathbf{X}_i + \boldsymbol{\epsilon}_i$~ Here, identifiability means that the parameters involved are functions of $cov(\mathbf{F}_i)=\boldsymbol{\Phi}$. \begin{enumerate} \item \emph{Regression Rule:} If no endogenous variables cause other endogenous variables, the model parameters are identifiable. \item \emph{Acyclic Rule}: Parameters of the Latent Variable Model are identifiable if the model is acyclic (no feedback loops through straight arrows) and the following conditions hold. \begin{itemize} \item Organize the variables that are not error terms into sets. Set 0 consists of all the exogenous variables. \item For $j=1,\ldots ,k$, each endogenous variable in set $j$ is influenced by at least one variable in set $j-1$, and also possibly by variables in earlier sets. \item Error terms for the variables in a set may have non-zero covariances. All other covariances between error terms are zero. These conditions are satisfied if $\boldsymbol{\Psi}$ is diagonal. \end{itemize} \end{enumerate} % End of latent variable model rules \newpage \item \emph{Two-Step Rule}: This applies to models with both a measurement component and a latent variable component, including the full two-stage structural equation model. \begin{itemize} \item[1:] Consider the latent variable model as a model for observed variables. Check identifiability (usually using the Regression Rule and the Acyclic Rule). \item[2:] Consider the measurement model as a factor analysis model, ignoring the structure of $cov(\mathbf{F}_i)$. Check identifiability. \end{itemize} If both identification checks are successful, the parameters of the combined model are identifiable. \end{enumerate} \end{document}