There are multiple definitions of logical independence out there in the literature. In this article, we tease out these definitions and investigate their relationships with each other.
The definitions we use are the following:
Definition 1. Let be a language,
be an axiomatic system in
, and
a statement in
. Assume
is consistent. We say that
is independent of
if both
and
are consistent.
Definition 2. Let be a language,
be an axiomatic system in
, and
a statement in
. Assume
is consistent. We say that
is independent from
if there exists no sequence of logic from
that can prove or refute
.
Wikipedia lists some definitions as ignoring the possibility of refutation; we consider that to be against the common informal meaning of independence and so we discard such definitions.
If is independent of
by Def 1, then
is consistent, so we cannot prove
in
. But also
is consistent, so we cannot prove
in
. Thus,
is independent of
by Def 2. Thus, Def 1 implies Def 2.
Conversely, if is not independent of
by Def 1, then either
or
is inconsistent. Assume for the first case that
is inconsistent. But since
is consistent, this implies that
is (provably) false in
. (In symbols,
where
means false, so the contrapositive yields
, and in
this yields
.) Similarly, for the second case,
being inconsistent implies that
is (provably) true in
. Either way,
is not independent by Def 2. Thus, the negation of Def 1 implies the negation of Def 2, so Def 2 implies Def 1.
Hence, these two meanings are equivalent. Note that this works regardless of proof systems: whatever proof system we choose for the second definition is also taken as the proof system for the first definition. The proof above uses proof by contradiction (which in turn is contrapositive logic) and other propositional logic principles, so any proof system that includes such principles would allow a proof of Def 1 from Def 2 within it. We would not consider a “proof system” to be a valid proof system if it didn’t include propositional logic principles at least — these are obvious and basic principles for human and objectively valid logical reasoning, and they are necessary to begin writing meta-proofs to study proof theory in the first place. We consider such logical principles to always be there regardless of proof system and to be a prerequisite to any mathematical or logical discussion.
Thus, it’s fine to use either definition, as Wikipedia does (see the Wikipedia article on logical independence, Independence (mathematical logic) – Wikipedia.)
Logical Independence of Theories
OK, now let’s tackle another question. There seem to be multiple meanings of a theory itself being independent, in terms of its axioms being independent of each other. What are these meanings?
Definition. An axiomatic system is independent of 1st kind if every axiom in
is independent of all the others: for every
,
is independent of
.
Personally, I don’t know if I would consider that to be “fully” independent; at least as a definition, I would define:
Definition. An axiomatic system is independent of 2nd kind if every axiom is independent from every other subset of axioms. There are multiple possible formulations I can do of this:
- For every subset
and every statement
,
is independent of
- For every
and every subset
,
is independent of
Are these equivalent (just for the second definition)? Is choosing and
equivalent to choosing
and
?
Yes, it is. If , then
can’t exist, so that’s a null condition and can be thrown away. Otherwise,
is a proper subset, and choosing
guarantees that both
and
. Conversely, if we choose
and
, then this also means
and
. Thus, these two formulations of the concept of “independent of 2nd kind” are equivalent.
Are these two definitions (1st kind and 2nd kind) equivalent? And how does this relate to Wikipedia’s assertion that though GCH is independent of ZFC, in fact ZF + GCH implies AC? (Here, GCH stands for generalized continuum hypothesis, and the other acronyms stand for their standard meanings.)
Well, would be independent in either of these definitions? The fact that
implies that AC is not independent of the theory minus AC (which is
); thus, this wouldn’t be independent of the 1st kind, let alone the 2nd. (The 2nd kind is clearly at least stronger than the 1st kind.) Thus, what this really says is: if
is independent of
, that doesn’t mean necessarily that
is an independent theory. There’s more being asserted for a theory to be independent.
I highly suspect the 1st and 2nd kind definitions to be equivalent, which case we can just talk about a theory being independent regardless and it doesn’t matter. We need to show that the 1st kind implies the 2nd kind, so let’s do that.
First, note that if is independent of
then
is independent of any subset of
, since if
weren’t independent of some subset of
then it would be provable or refutable from that subset, and therefore provable or refutable from
overall. Now, assume independence of the 1st kind, and take
and a subset
. Since
is independent of
by assumption of 1st kind independence,
is independent of
. We are done, this proves that both definitions are equivalent.
First written March 31, 2022. When I first wrote this, I thought that the term “undecidable” was also commonly used as a synonym for “independent,” so I viewed the standard terminology as potentially ambiguous. Specifically, I used the term “undecidable” for the second definition of independence (for an individual statement, not a theory.) I later concluded that that was really my “head-canon definition” based on my intuitive understanding of the term “undecidable”; the term “undecidable” in standard usage is only ever applied to the existence of an algorithm to a decision problem, and thus the usage of the term in proof theory is just based on the “most natural” decision problem for proof theory, of whether or not we can generate a proof. Hence, there is no ambiguity, and undecidability is a separate concept from independence.
