In this post, we attempt to better understand the questions posed in this post and its follow-up by formalizing them generally in the language of universal algebra.
Since finite and infinite seem to feel different, let’s restrict this formulation to infinite sets.
Problem. For which universal algebraic signatures and sets of
-axioms
is it the case that for any infinite set
there is a collection of operations of type
on
satisfying
?
Clearly, if is empty then this is always possible. So this seems to be conditioned on the axioms in
. But actually, why would it? I think this would be doable for any
.
Upon further thought, this seems to be particularly amenable to Godel’s Completeness Theorem. Assume the axioms in are all first-order, except no relations, only operations. (This can still involve existence, negation/inequations, conditional implication, etc.) Then, Godel’s Completeness Theorem states that, if
is a first-order sentence (in the language of
) such that any model of
satisfies
, then there is a proof of
from
. Can we choose
in a clever way?
Actually, how would existence of proof help us? We want existence of model. So maybe we’d like to go the opposite direction (which would then be obvious though, no need for Godel’s theorem.) We cleverly produce as a consequence of
, then we are given that
must be satisfied in any model of
. If we can show that there is a model of
, then can we show that there is a model of
? What we’d further like to show is: given
(which we assume to be consistent), there is a model of
of any infinite cardinality.
So this is the question, then. Given a set of consistent axioms , there is a model of
of any infinite cardinality. This seems to boil down to some fundamental mathematical logic arguments. In fact, in some sense, it would be intuitively suggestive for the consistency of a set of axioms to be equivalent to the existence of a model for them; clearly, if they are inconsistent, then there is no model for them. Is this true?
If a set of axioms is consistent, then must there be a model
for them? Must there be a model
for them of any infinite cardinality?
I suspect we can do something similar to the free object construction to construct a model for of any cardinality. Let’s review how we’d do this with a motivating example.
Let our axiom be cancellation: one operation and . How do we construct a model
of this axiom for any cardinality?
Well, we can basically consider “words” of variables, with words being equivalent if and only if they can be proven to be equal under the axioms. The set of equivalence classes of these words is then our free object, if I understand correctly.
So let’s be general again, with a set of axioms (that are consistent.) let
be a set of variables, of any infinite cardinality. We define a set
such that, for any
, we have
Then, we define an equivalence relation on by: two words are equal if and only if they can be proven equal under the axioms
. Is this guaranteed an equivalence relation?
- Every word is automatically
-provably equal to itself, assuming
is consistent.
- If a word
is
-provably equal to a word
, then by basic property of equality (nothing else needed from
) this also yields that
is
-provably equal to
.
- Transitivity follows similarly to point 2, based on the fact that proofs from
can involve basic logical steps like transitivity of equality.
Thus, we can form the set of equivalence classes, which we call . This set is what we want; can we guarantee that it is in bijection with
? If we can, then we’re done.
As an aside, it seems like we haven’t really used the consistency of here in a fundamental way — what happens if
is inconsistent? If
is inconsistent, then anything can be proven from it (Principle of Explosion.) Thus, then any two words can be proven equal, but also any two words can be proven unequal … so none of this logic works then. So we do need
consistent, in a fundamental way, to produce
.
Can we now show that is in bijection with
? Would this depend somehow on the axioms in
? (Would reduction by axioms show that
would have to be smaller?)
Actually, let’s look at Free object – Wikipedia. I implicitly assumed associativity, but my idea was valid in comparison to how the free object is typically constructed. The main thing we’d want to prove then is: for any infinite set and any set of axioms
(what the article refers to as “defining relations”), the free object for
over
is in bijection with
. If we can show that, then we’re done.
Let’s have a trivial example: what if consisted of the one axiom
? Then
would just be
. So it’s very possible to have
such that
is small.
OK, can we at least develop general equivalent conditions on for
? And is this equivalent to
being imposable on any infinite set?
Let’s try some more examples. Let have the two axioms
and
. Then,
would consist of arbitrary-length words with no two same variables next to each other. Since each element of
is a word on its own, we have that
. If we can show
, we’re done. But we actually already know that this
is satisfiable for any cardinality, since these are part of the Boolean ring axioms and we know that any infinite set can be turned into a Boolean ring.
Do we know that: if the free object for over
doesn’t biject with
, then there exist infinite sets that can’t be turned into models of
? Can we prove this?
So assume that there exists infinite such that the free object for
over
doesn’t biject with
. We conjecture that there exists an infinite set that can’t be turned into a model of
. To prove this, assume for the sake of contradiction that every infinite set can be turned into a model of
. In particular,
can be turned into a model of
. Since
is a model of
, we have …
Let’s try some special cases to see if this conjecture holds up. For the axiom above, can any infinite set be turned into a model of it? No, it can’t, because it implies that the max number of elements is 1.
OK, how about the axiom ? Let’s denote the free object over a set
by
; we can include the
as a subscript to be explicit if needed, as in
. So with this axiom, what does
look like? Well, everything disappears except the first character in the word, so this really is just
. Certainly, then,
. Can we impose this axiom on any infinite set? (This is actually clear, what we’re trying to build evidence for is the converse, but just a quick sanity check.) Let
be an infinite set, and define the operation
by
. Then, we always have
with this operation. So this is trivially satisfiable.
From these examples, I have an even stronger conjecture: given a set of axioms and a set
,
can be turned into a model of
if and only if
. This would pinpoint exact cardinalities of sets that can be turned into models of
, not just whether or not
can be imposed on everything infinite.
We can establish one direction clearly: assume we have and
, and
, then we know that
is a model of
by construction and so we can use the bijection to turn
into a model of
.
Let’s actually try the other direction: assume we have and
, and
can be turned into a model of
. We also have that
is a model of
, by construction. Can we show that
?
Would this work for finite sets ? Let’s try a toy example:
and
is the set of group axioms. Clearly
can be turned into a model of
(via
), but what is
? If we can show it contains more than two elements then we’d disprove this direction for finite
. But of course it does: since no relations are imposed on
we have
at least as elements of the free object. Thus, this direction can’t be true in general for finite
. But we could still believe in this direction for infinite
, since bijection is different for finite versus infinite sets.
Let’s try to prove this direction for infinite .
We split into cases. In the first case, assume that does not contain the statement
as a provable implication (whether as an axiom in
or as a consequence.) Then, from how first-order logic is constructed, we know that each element of
would be a separate “character” in
, so we have
. (It remains to continue this part.)
In the second case, assume that does contain as a provable implication
. Then, any model of
must have at most one element, so by the assumption that
can be turned into a model of
we must have
. But we assumed
was infinite, contradiction, so this case is invalid. (Incidentally, in this case the direction is trivially true for any
including finite: we also have
, so if
is empty, then
is empty (by definition of free object), and if
has one element then
has one element, so
.)
—
Let’s go back to the infinite field conjecture. To show that, we just need to show that for infinite
and
the set of field axioms. (This doesn’t depend on whether the converse direction turns out to be true above.) So let
be any infinite set, and consider
with
the set of field axioms. What is
?
At the very least, must contain each element of
as a “separate character,” which would each be a one-character word. Thus,
. If we can show that
, we’d be done. So we want to exhibit an injection
.
Since this doesn’t work for finite we’d need to use the infinitude of
somewhere. But that just seems to get us back in the same place as before with considering infinite cardinalities …
It remains to continue this.
Written February 22, 2023.
—
EDIT (03/27/2023): I’ve just read about the Löwenheim-Skolem Theorem. This basically achieved everything I was hoping was possible. This completely resolves my question about fields, as well as many other such analogous questions, in what I consider to be the best way possible. “Incredible” doesn’t begin to describe this theorem.
Sidenote: I’ve just heard that there is such a thing as a “real-closed ring,” which may not be a field. I’m confused about this. Aren’t the field axioms first-order? Meaning that any real-closed ring would have to satisfy them, and thus be a field? I went through the field axioms just to check that they are first-order; it is well-known that fields aren’t an equational class, but certainly all the axioms seem at least first-order to me? What am I missing here?
Actually, I got it. Fields are in fact a first-order theory. The Wikipedia article on real-closed rings states that the name is given by an extension of a certain characterization of real-closed fields; elementary equivalence to the real numbers is only one such characterization, and that one isn’t extendable beyond fields.
