# BLUPF90

faq.vc

## I got weird messages and nonsense estimates in AI REML. What is it?

### Symptom

You would see G not positive definite or corrected Covariance Matrix in a REML iteration. The (co)variance estimates would look weird (zero or huge values). These are a sign of divergence - the estimation is nearly failing. Once it happens, the estimates are most likely nonsense; you should not use it as estimated values.

### Mechanism

AI REML is efficient and reliable if the model is simple, and the amount of data is enough. However, AI REML is a purely numerical method, and the estimates are not guaranteed to be in the parameter space; sometimes the estimates become nonsense e.g., zero or negative variance components (this is what not positive definite means). The airemlf90 program tries to adjust the covariance estimates (corrected Covariance Matrix), but it is not perfect. Finally, the estimation would fail.

### Source of issues

There are several reasons why the divergence has happened.

• Too complicated model (too many variance components) compared with the amount of data (phenotypes)
1. Many traits: 5 traits with 1000 phenotypes would not work.
2. Complicated model: Many random-regression coefficients would not work.
3. Questionable model: Some variance components could not be estimable because of a data structure.
4. Unbalanced model: The computation would be unstable if some effects are effective only for a particular trait.
5. Inadequate model: It would fail if some effects are confounded or nonsense.
6. Unbalanced data: Many missing observations would fail.
• Mistakes in files; data, pedigree, and parameter files
1. Incorrect model description.
2. Duplicated animal in the pedigree
3. Wrong file format
• Just an accident
1. Dependent on initial values

### Remedy

There are some recommendations to avoid the divergence and to obtain estimates stably.

• Simplify the model
1. Start from the simplest model.
2. Split a big multiple-trait analysis into small two-trait analyses.
3. Reduce the number of random regressions.
• Look into the data structure
1. Remove nonsense or highly-confounded random effects
2. Remove traits with too many missing observations
• Check the files
2. Use OPTION EM-REML 10 which uses EM algorithm for the first 10 rounds to get much closer initial values to the estimates