In this post, we continue the discussion from the previous in the series.
We make some additions. First, it is clear that if an equational identity is true for a group , then it must be true for a subgroup
of
— by definition, the identity must be of the form
, and if this holds with any combination of the
taken from
then it certainly holds with any combination of the
taken from a subset of
, including a subgroup. This is generally true: for any algebra
, if an equation
is true of
, then
is true of any subset
of
.
This actually shows that: the class of all groups is the equational wide closure of the class of all symmetry groups. In other words, we can alternatively define a group as follows:
Definition. For any set , let
denote the set of permutations of
with the operations of composition, identity, and inverse. Let
be the set of all equations that hold for all such
over all sets
. Then, a group can be defined as any model of
.
In particular, the three equations consisting of the associative property, existence of identity, and existence of inverse are enough to generate all of — in the language used in the Wikipedia article on abstract Boolean algebras, these three equations are a basis. This inspires similar questions: given our kinds of characterizations of structures that result from the concepts of narrow and wide closures, we can ask about finite or even numerically “small” bases for the relevant sets of equations or first-order statements.
Note that our logic doesn’t show that a group can be defined as a model of generated from the
that only include the operation of composition, excluding identity and inverse, since we haven’t seen that a group could be defined with purely equations involving composition (multiplication.) Actually, is that really true though? Let’s step through our logic a bit more slowly. Let’s ignore the associativity, existence of identity, and existence of inverse properties completely. (By the way, whatever we do here can shed insight on our question on dependence in general on which operations are included in the signature.)
What does Cayley’s Theorem say? Let’s say our first-order language is just multiplication, . Let
be the class of subgroups of symmetry groups, as models of
(so including just multiplication.) Then, Cayley’s Theorem says that any group
must be isomorphic to a member of
— with isomorphism defined with respect to just multiplication. However, we know that
for a group implies
and
. So this implication is what will make the translation into a different signature possible. For example, for rings this implication doesn’t hold (it is possible for
and
but
. As an example, take
with
.) Going back to groups, the isomorphism for just multiplication implies that any first-order statement in
will be true of one group if and only if it is true of the other (when both are isomorphic.) But the fact that a just-multiplication isomorphism also preserves identity and inverse then implies that any first-order statement in
will be true of one group if and only if it is true of the other. The set of first-order statements in the second statement is a superset of that in the first statement, so this is a great improvement!
So then let now stand for the class of all symmetry groups, as models of
(all three operations.) Cayley’s Theorem implies isomorphism with respect to this expanded language even though it just states isomorphism with respect to the smaller language
, and the isomorphism for the expanded language shows that any equation in the expanded language true for all symmetry groups is also true for all groups. Furthermore, some equations in the expanded language true for all symmetry groups include the three properties of associativity, existence of identity, and existence of inverse, so we have that the class of groups is the class of all structures that satisfy every equation in the expanded language true for all symmetry groups.
This last part is what can break down with an analogous statement for just multiplication. Clearly, we have that isomorphism with respect to implies isomorphism with respect to just
. Thus, any equation in just
true for all symmetry groups is also true for all groups. This implies that the class of all groups is a subclass of the wide closure of the class of all symmetry groups with respect to
. But for it to be equal, our logic would need an equational axiomatization of a group with just multiplication. If such an axiomatization didn’t exist, then it’s possible that there could be structures that are not groups but that satisfy all equations in
true for all symmetry groups — intuitively, this set of equations could be smaller, so that the class of structures satisfying them could be larger.
So this really boils down to equivalent axiomatization, while taking care to respect different kinds of statements (different logics.) Let’s state this more precisely in general:
Statement. Let be a logic that is a subset of first-order logic. (So
could be first-order logic itself, or equational logic, or something else.) Let
be a first-order language (so that
in particular applies to
.) Let
be a class of first-order models of
. Then, let
be the
-wide closure of
with respect to
. If
can be equivalently axiomatized in first-order logic by a different first-order language
, but not in the logic of
, then the
-wide closure of
with respect to
may be different from
(at least, our reasoning doesn’t show that they’re the same.)
Let’s further try a counterexample to hammer home this point. Let’s use rings. But first actually, let’s update our notation: we see that narrow and wide closures are dependent on both the first-order language as well as the logic. So especially in the presence of possible confusion, we should introduce terminology that adequately clarifies both. (I’m not going to do anything now with logics that aren’t subsets of first-order logic.) Let’s say that given a first-order language and a logic
, we can speak of the “
-language
-logic narrow/wide closure.” Thus, with this terminology, we can say definitively that: the class of all groups is the
-language equational-logic wide closure of the class of all symmetry groups. (Intuitively, this views groups as their underlying sets, where we then impose the language on them, as opposed to groups containing data about their language themselves.)
So, for our counterexample. Let be the class of all rings. We want to exhibit a subclass
such that
is the equational-logic
-language wide closure of
, but not the equational-logic
-language wide closure of
. Actually, this is just the same as what we said for groups: our reasoning would break down if we couldn’t find an equational axiomatization for rings without the multiplicative identity operation.
We’ve stated that “our reasoning can’t conclude X”; can we go further and say that without such an equivalent axiomatization, “X can’t be true”? In other words, can we show that: if there doesn’t exist an equational axiomatization for groups in just multiplication, then the equational wide closure in just multiplication must be strictly larger than the equational wide closure in the expanded language? Well, if these two wide closures were equal, then every equation true of all members of the “representative”
would be true of all groups, in just multiplication. But then the set of such equations
is literally the equational axiomatization of groups in just multiplication! So we’ve shown the contrapositive, which yields our desired result. We can generalize this reasoning:
Result. If is a class of models of
and also a class of models of
, then the following are equivalent:
- The
-logic wide closures of
in the languages
and
are equal;
- The
-logic wide closure of
in the language
can be equivalently axiomatized in the logic
and language
, and vice versa.
So determining whether the class of groups is the wide closure of the class of symmetry groups in just multiplication is the same as determining whether groups are an equational class under just multiplication. Can we axiomatize groups via just equations in multiplication?
From Working On It (unm.edu) (which I referenced in the first post of the series), it looks like we can’t just do multiplication for equational axioms, we need inverse too. Of course, this is single equational axioms, but if the axiomatization doesn’t exist for just multiplication then we’d have an answer to our question.
To be fair, we could also ask about finite equational axiomatizations; in the reasoning above, the set of all equations true of the representative could very well be infinite.
Sidenote: the fact that we have the terminology of narrow and wide closures has allowed us to sunset the “representative” term, which is great; I like “narrow closure” and “wide closure” a lot more. Or maybe better terms could be “weaker closure” or “stronger closure.” (Although the appearance of “weak” and “strong” seem rampant throughout math, and therefore prone to ambiguity.)
Actually, is this result fully true? Did we miss something around the isomorphism implication that was necessary earlier?
Let’s step through our reasoning slowly to be sure. Let’s say that we have a logic , and we have a class
of models of a language
. Let
be the
-logic
-language wide closure of
(we explicitly say in the language of
, although the fact that we define
to consist of models of
makes this redundant here.) Assume that we can also express all of
as models of another language
.
First, let’s show (2) implies (1). Let’s say that we can find a set of
-logic statements in
that axiomatize
, or in other words that
is the class of all models of
that satisfy
. Let’s also say that there exists an alternative axiomatization of
in the language of
, or more formally, there exists a set
of
-logic statements in
such that
is the class of all models of
that satisfy
. (When we say “is,” we mean equality of underlying universal sets.) Then, let
be the
-language (
-logic) wide closure of
. We want to show that
. It remains to continue this.
We continue this discussion in a follow-up post.
