tinytheorems - about groups - rings - modules - models atom/rss

Exploration 1: The canonical model of a theory

.

The goal of this exploration is to show what it takes for a theory TT to have a model.

Each section will prove that TT has a model for increasingly wider classes of TT, starting with TT only containing atomic sentences, and ending with TT containing first-order sentences.

In this section, we construct the canonical model for a set of atomic sentences TT.

For now, we will only discuss theories TT where TT is a set of atomic sentences (atomic formulas with no variables xˉ\bar{x}). We’ll prove that there is always a model for TT, the canonical model of TT.

Outline:

Let’s start. We’d like to extend our theory TT to give it some properties that make it easier to find a model for it. Here are two such properties:

Intuitively, (congruence) says if two terms are equal, then you should be able to swap them in any sentence in TT and the result will still be in TT.

If a set of atomic sentences TT has both (reflexivity) and (congruence), then it is ==-closed (in LL). So we’ll go ahead and extend our set of atomic sentences to its ==-closure, i.e. its minimal ==-closed extension.

Now let’s use these properties to find a model for TT.

Building the canonical model

First of all, what is the underlying set? By definition of \models, we know that to model equalities in TT, we must interpret both sides of the equality to be the same. An easy way to do this is to have the elements of AA be equivalence classes where t1=t2Tt_1=t_2\in T puts t1t_1 and t2t_2 in the same equivalence class.

We prove t1=t2Tt_1=t_2\in T is an equivalence relation on tit_i. By (reflexivity) we already have reflexivity. By (congruence) if t1=t2t_1=t_2 is in TT then we can swap the terms to get t2=t1t_2=t_1 also in TT, so we have symmetry, and by a similar argument we have transitivity. So t1=t2Tt_1=t_2\in T is a valid equivalence relation.

Therefore, the underlying set for our model shall be equivalence classes of closed terms, where t1t_1 and t2t_2 are equivalent if t1=t2Tt_1=t_2\in T.

Since terms are now equivalence classes, our constant symbols must also be equivalence classes, and our function symbols and relation symbols all must operate on equivalence classes. We do this by lifting the relations RR in LL to take equivalence classes instead of terms.

Let’s show that this definition preserves the truth of relations in TT. This requires showing that in LL, whenever t1t_1 and t2t_2 are terms in the same equivalence class (of equality in T), R(t1)R(t_1) is the same as R(t2)R(t_2). But this is true because of (congruence). More generally, we must show R(sˉ)=R(tˉ)R(\bar{s})=R(\bar{t}) for all length-nn sequences of closed terms sˉ\bar{s} and tˉ\bar{t} where corresponding sis_i and tit_i are in the same equivalence class. But this is true by applying (congruence) nn times. Thus, lifting the relation symbols from LL to apply to equivalence classes is well-defined, and gives us a set of relation symbols for AA.

What about the constants and function symbols of AA? The constant elements cc and function symbols FF of LL can be lifted from LL into AA by having them return equivalence classes of terms rather than single terms. We again apply (congruence) recursively to show F(tˉ1)F(\bar{t}_1) and F(tˉ2)F(\bar{t}_2) are in the same equivalence class, and so this is well-defined.

Summary

In summary, by defining this LL-structure AA on the ==-closure of TT, we’ve built the canonical model of TT that, by construction, models our given set of atomic sentences TT.

In this section, we prove that every model of TT is isomorphic to the canonical model, assuming TT is composed of atomic sentences.

First we define homomorphisms between LL-structures. The idea of a homomorphism is that it set every symbol in an LL-structure AA to a corresponding symbol in another LL-structure BB (symbols may overlap).

An isomorphism is an invertible homomorphism. This necessarily means each unique symbol in AA maps to a corresponding unique symbol in BB, i.e. it’s a renaming.

We’d like to prove that the canonical model is unique – that any two canonical models MM, NN are equal. But models generally aren’t unique since you can always swap around the names of its elements to get an equivalent model. So we prove instead that the canonical model is unique up to isomorphism – any two canonical models are renamings of each other.

Suppose that MM is the canonical model for TT and NN is some other arbitrary model of TT. Define the homomorphism f:MNf:M\to N that maps equivalence classes [t]M[t]\in M to the element tNNt^N\in N.

Therefore ff is an isomorphism. Since we’ve constructed an isomorphism from the canonical model MM to an arbitrary model NN of TT, this means every model of TT is isomorphic to the canonical model, i.e. the canonical model is unique up to isomorphism.

In this section, we show certain first-order theories are also modelled by the canonical model.

The language LL we work with is often a subset of LωL_{\infty\omega} (“infinitary logic”), an extension of first-order logic. The first symbol \infty means that the maximum size of conjunctions \land and disjunctions \lor in formulas is infinite. The second symbol ω\omega means that the maximum number of quantifiers can be large but finite.

By taking all sentences in LωL_{\infty\omega} true for a structure AA, we obtain its first-order theory Th(A)Th(A). So every structure has a first-order theory. Conversely, does every first-order theory have a model, like we showed before for atomic sentence theories? The answer turns out to be no, not every first-order theory has a model.

But by extending some atomic sentence theory TT with certain first-order sentences, it’s possible to still have the same canonical model be a model of the extended TT, as long as the resulting first-order theory maintains the following consistency properties:

  1. Consistency: ϕT\phi\in T implies ¬ϕchar "338T\lnot\phi\notin T for atomic sentences ϕ\phi
  2. Congruence: ϕ(s)T\phi(s)\in T iff ϕ(t)T\phi(t)\in T for atomic formulas ϕ\phi and closed terms s,ts,t of LωL_{\infty\omega}, whenever s=tTs=t\in T
  3. Reflexivity: (t=t)T(t=t)\in T for every closed term tt of LωL_{\infty\omega}
  4. Double negation: ¬¬ϕT\lnot\lnot\phi\in T implies ϕT\phi\in T
  5. Closure under \land: ΦT\bigwedge\Phi\in T implies ΦT\Phi\subseteq T
  6. Its inverse: ¬ΦT\lnot\bigwedge\Phi\in T implies ψΦ.¬ψT\exists\psi\in\Phi\ldotp\lnot\psi\in T
  7. Closure under \lor: ΦT\bigvee\Phi\in T implies ψΦ.ψT\exists\psi\in\Phi\ldotp\psi\in T
  8. Its inverse: ¬ΦT\lnot\bigvee\Phi\in T implies ψΦ.¬ψT\forall\psi\in\Phi\ldotp\lnot\psi\in T
  9. Closure under \forall: x.ϕ(x)T\forall x\ldotp\phi(x)\in T implies ϕ(t)T\phi(t)\in T for every closed term tt of LωL_{\infty\omega}
  10. Its inverse: ¬x.ϕ(x)T\lnot\forall x\ldotp\phi(x)\in T implies ¬ϕ(t)T\lnot\phi(t)\in T for some closed term tt of LωL_{\infty\omega}
  11. Closure under \exists: x.ϕ(x)T\exists x\ldotp\phi(x)\in T implies ϕ(t)T\phi(t)\in T for some closed term tt of LωL_{\infty\omega}
  12. Its inverse: ¬x.ϕ(x)T\lnot\exists x\ldotp\phi(x)\in T implies ¬ϕ(t)T\lnot\phi(t)\in T for every closed term tt of LωL_{\infty\omega}

The motivation for these properties: #1 and #4 encode negation, #2 and #3 encode ==-closure, #5/#6 and #7/#8 encode \lor and \land, #9/#10 and #11/#12 encode \forall and \exists. These properties are known as consistency properties of TT, because they ensure that TT is consistent even under the various properties of our logical connectives. Such theories TT satisfying consistency properties are called Hintikka sets (for LωL_{\infty\omega}).

The 8th property (¬ΦT\lnot\bigvee\Phi\in T implies ψΦ.¬ψT\forall\psi\in\Phi\ldotp\lnot\psi\in T) is more commonly known as the Henkin property, which is special for reasons we’ll get to soon.

But why do we care about this specific assortment of properties? And why are these properties enough to encode everything we need about their respective operations?

Hopefully that clears up the motivation for Hintikka sets; now we can say:

(Theorem 2.3.3) If TT is a Hintikka set for LL, the canonical model for (atomic sentences in TT) models TT.

Theorem 2.3.3 implies that any theory TT that can be extended to a Hintikka set has a model (the canonical model). “Just extend it to a Hintikka set” is a lot to ask since there’s so many properties to fulfill. Luckily, all the properties above are true if the following smaller set of properties are true:

(Theorem 2.3.4) If

…then TT is a Hintikka set for LL.

If a theory TT can be extended to get all three properties, then the extension is a Hintikka set, and therefore TT has a model. Proof that these three properties imply a Hintikka set:

Two of the consistency properties follow immediately:

To derive the remaining consistency properties quickly, we first prove three short lemmas:

Since the majority of our consistency properties are in the form ϕTψT\phi\in T\to\psi\in T where {ϕ}ψ\{\phi\}\tt\psi, we can use the (finite strong completeness corollary) to immediately prove them.

We’ve proved 10 out of 12 properties; the remaining ones are closure under \lor and the inverse of closure under \land. They both require showing some ψΦ\psi\in\Phi exists in TT without specifying which ψ\psi. Since we can’t derive (\tt) this unknown ψ\psi, we can’t use (finite strong completeness) here. Instead, we’ll prove by contradiction – not having the property lets us construct an inconsistent finite subset of TT, violating (finite consistency).

Theorem 2.3.4 tells you a set of minimum requirements to make an extension of TT into a Hintikka set: every finite subset must have a model, it must be a complete theory, and it must be closed under \exists (Henkin property). If you can extend TT with these properties, you have a model for TT. We will use this in the next section.

In this section, we show the absolute minimum requirement for a first-order theory to have a canonical model. (Compactness theorem)

Previously, we found that not all first-order theories have a model, but certain extensions of atomic sentences as first-order theories do have a model (the canonical model) if they can be extended to get a Hintikka set: specifically, if (1) every finite subset of TT has a model, and TT is (2) complete and (3) closed under \exists, then it has a model.

The following theorem shows that (2) and (3) are unnecessary; (1) alone is enough to find a model for first-order TT, again by taking the canonical model of a larger (cleverly-crafted) theory. This fundamental result of model theory is known as the compactness theorem for first-order logic, and is the more general way to find when a given first-order theory has a model.

The compactness theorem: If every finite subset of TT has a model, then TT has a model. (where TT is a theory in a first-order language LL)

Proof.

Then TκT_\kappa (the theory obtained after considering every sentence ϕi\phi_i) will achieve the above three requirements of being a Hintikka set by ensuring

  1. every finite subset of each TiT_i has a model. Then TκT_\kappa satisfies the first requirement. We can ensure this if, when we try to add each ϕi\phi_i, we only actually add ϕi\phi_i if every finite subset of the resulting Ti+1T_{i+1} has a model. Otherwise, Ti+1=TiT_{i+1}=T_i.
  2. either ϕi\phi_i or ϕj\phi_j is in TκT_\kappa (where ϕj=¬ϕi\phi_j=\lnot\phi_i).
  1. every x.ψ(x)\exists x\ldotp\psi(x) in TκT_\kappa implies ψ(t)\psi(t) in TκT_\kappa for some tt in L+L^+

Finally, we’ve constructed some TκT_\kappa that is a Hintikka set of L+L^+ by virtue of satsifying the three conditions from the previous section. Therefore it has a model (the canonical model of its atomic sentences). Then TκT_\kappa includes TT by its construction (a chain on top of TT), so we can take the LL-reduct of the model of TκT_\kappa to get a model of the original theory TT in LL.

There are many many proofs of the compactness theorem, and this is just one of them, called the Henkin construction. For more proofs of the compactness theorem, see: https://mathoverflow.net/a/45501

The converse of the compactness theorem is quite trivial. If a theory TT has a model MM, then MM models every finite subset of TT since they are subsets of TT.

TT is satisfiable” is another way to say “TT has a model”. So the compactness theorem can also be stated “TT is satisfiable if and only if TT is finitely satisfiable.”

In the end we’ve found the minimum requirement for TT to have a canonical model – we just need to prove that every finite subset of TT has a model, which is often done by construction.

< Back to category Exploration 1: The canonical model of a theory (permalink)
Intro to model theory Exploration 2: Types