Sunday, April 18, 2010

My Muddled Musings about L.J. Cohen's Conjunction Paradox

Is there a paradox -- a meta-paradox? -- about the assumption that the product rule applies to burdens of proof and persuasion on the (multiple) elements of a legal claim? This assumption is the basis of L.J. Cohen's "conjunction paradox." See L.J, Cohen, The Probable and the Provable 58-62 (1977).
I wrote this -- substantially this -- to a good friend today:
Assume that legal liability depends on elements a, b, c & d. Assume that these four elements (or, in any event, 3 of these 4 elements) can be classified either as elements of a claim [assume a civil case] or as elements of an affirmative defense. (In theory affirmative defenses could be done away with altogether -- by making all elements elements of claims.)
Assume that claimant has the burden of persuasion (assume this is defined in terms of probabilities) of slightly more than p(.5) on each element of each claim and that respondent has the burden of persuasion of slightly more than p(.5) on the negation of each element of each affirmative defense.

If the product rule (not modified for dependencies) applies to legal claims (which are assumed to consist of elements) and also to affirmative defenses (which are also assumed to consist of elements) -- and why should not the product rule apply to affirmative defenses if it applies to claims?--, does it follow that if elements a, b & c rather than just elements a & b are elements of the claim, that the claimant's burden of persuasion for the entire claim (with three elements) is roughly .125 rather than .25 (with two elements) and does it also follow that the respondent's burden (on the entire affirmative defense) rises to roughly .5 (from .25)? Can this (logically) be?

Look at the problem this way. Assume the claim has two elements a & b. Assume the affirmative defense has elements c & d. If the claimant's burden of persuasion in this situation is _roughly_ .5 on each of the four elements, does it not follow that if an affirmative defense is "in play" (because properly raised by the pleadings or otherwise), the claimant's burden on all four elements -- the conjunction of those four elements -- is roughly .0625 whereas the respondent's burden on the affirmative defense (the conjunction of not-c and not-d) is roughly .25? Is there a paradox here, a paradox about the assumption that the product rule applies here?

Put it this (equivalent?) way: If the conjunction rule applies and all four elements are elements of the claim (and there is no affirmative defense), the respondent wins if it establishes that the probability of the negation of any one element is roughly (slightly more than) .5. However, if two of the four elements are part of an affirmative defense (and if, therefore, they are not elements of the claim), respondent wins by relying only on a showing of "high" probabilities (a probability slightly greater than .5) of the negation of each of the two elements -- i.e., by showing that the probability of the negation of each element of two elements is slightly more than .5. If so, how can it be (on one horn of the conjunction paradox) that claimant's responsibility is to show (at least) that p(c & d) is roughly .25 whereas respondent must show that the probability of p(~c & ~d) is roughly .25? Doesn't the rule against self-contradiction, or the inverse relationship between the probability of some thing X and the probability of some thing ~X and the convention that probability of some thing X and the probability of the complement of X must sum to one (1) mean that the probability of the subset (c & d) and the probability of the complementary subset (~c & ~d) must equal one? If so, is it logically incoherent, self-contradictory, to assume that the product rule applies to the problem of the burden of persuasion on a claim? (My basic argument remains unchanged if you assume that the burden of persuasion on a claim requires that probability of the entire claim be slightly more than .5 -- and that this means [the other horn of the conjunction paradox] that the probabilities of some elements of a claim must be more than .5, often [depending on the number of elements] much, much more than .5. But I think the counter-riddle I have tried to describe also arises here, though in a somewhat different guise. But don't ask me to explain!)

N.B. The conjunction paradox does not go away just because alternative factual scenarios (which have facts that instantiate or satisfy an element of a legal claim) can establish a legal claim: The paradox remains. (I once worked this out to my satisfaction. But, again, please don't ask me to do so here.) But my counter-paradox (meta-paradox?) also remains, I think.

But perhaps my argument (which definitely falls short of a proof!) -- perhaps my argument about the existence of meta-paradox has a logical flaw? It's entirely possible that it does!

&&&

Postscript: As the comments to this post show, my musings (see above) did not lead anywhere. What makes the conjunction paradox so agonizingly difficult is that one has the sense -- or, in any event, I do -- that the proper mathematical interpretation of the burden of persuasion on a claim (or an affirmative defense) is that the jury must be instructed (or the trial court must understand) that it can find for the claimant (or for the respondent on the basis of an affirmative defense) only if it finds (or the judge concludes) that the claimant (or respondent) has established the probability of the claim as a whole (or, in an appropriate situation) the affirmative defense) meets or exceeds the probability required by the applicable burden of persuasion -- and not just that each element of the claim (or affirmative defense) meets or exceeds the requisite probability. The difficulty with this interpretation is twofold: (i) it seems to be inconsistent with the way that the law usually expresses the burden of persuasion and (ii) if each element must be shown to the probability required by the legal burden of persuasion, and if the claim or affirmative defense has more than one element, the application of the product rule says (even when there is some dependence, but less than complete dependence, between the probabilities of the two elements) that the overall claim or affirmative defense must be established to a probability that exceeds the legally-stated burden of persuasion for the claim as a whole -- and, moreover, unless there is a very high degree of dependence among the probabilities of the elements of a claim or affirmative defense, if there are many elements in a claim or affirmative defense, the requisite level of probability for the claim or affirmative defense as a whole is much higher than the legally-stated burden of persuasion (much higher than, e.g., .5 in a civil case, if a probability of slightly more than .5 is the level of probability that must be shown for each element in the usual civil case).

If that's the case -- if, for example, in the usual civil case the trier of fact should uphold a claim or affirmative defense with numerous elements only if it finds that the probability of the claim or affirmative defense very substantially exceeds .5 -- does it follow that the jury or trial court should almost always find that the claim or an affirmative defense has not been sufficiently established? No, I think that does not follow at all -- because if the claimant or respondent has oodles of powerful evidence to support each element, the jury or the trial court should, of course, find that the claim as a whole [or the affirmative defense] has been sufficiently established, and it probably will do so. The high probability seemingly required for the claim [or affirmative defense] as a whole speaks only [if it does so] to the question of whether an investigation will uncover evidence that sufficiently establishes a claim or an affirmative defense. Moreover, whether an investigation will uncover sufficient evidence of numerous elements of a claim or an affirmative defense depends in part on the characteristics of the part of the world that is being investigated -- and this means it may or may not be very difficult to uncover evidence that will establish all of the various elements to a very high probability. Yes? No?
&&&

The dynamic evidence page
It's here: the law of evidence on Spindle Law. See also this post and this post.

10 comments:

Don Mathias said...

Peter - Some comments, with relevant passage in your posting indicated. (Please excuse my spelling of "defence", but that's how we do it here):

...
... and does it also follow that the respondent's burden (on the entire affirmative defense) rises to roughly .5 (from .25)? Can this (logically) be?

Claimant C has to show that it is more likely that all the elements existed than that they did not. You have only done half the work: the probability that they all existed is 0.125, but what is the probability that they did not? Let’s use different figures to make the point clearer. Suppose P(a) = P(b) = P(c) = 0.8; then the conjunction is 0.512. Since P(-a) = P(-b) = P(-c) = 0.2, the probability that two but not three of them existed is 0.8 x 0.8 x 0.2 = 0.128. C has proved his case because the ratio 0.512 to 0.128 is greater than 1.

... Is there a paradox here, a paradox about the assumption that the product rule applies here?

Again, let’s use figures that make the contrast clearer. P(a) = P(b) = P(c) = P(d) = 0.8; and so P(-c) = P(-d) = 0.2. C’s task is to show that P(a and b and c and d) is greater than P(-c and –d). That is, he has to show that 0.409 is greater than 0.04, which indeed it is: he wins again.

... the negation of any one element is roughly (slightly more than) .5.

Yep. Using my figures, but if P(d) is now 0.4: P(a and b and c and d) = 0.204, and P(a and b and c and –d) = 0.307; the ratio is less than 1 and C fails to prove his case.

... by showing that the probability of the negation of each element of two elements is slightly more than .5.

Nope. Using my figures again, but varied for c and d, so that if P(c) = P(d) = 0.4 and P(-c) = P(-d) = 0.6. If C only has to prove a and b (the affirmative defence being, as you say, not elements of the claim), where each have 0.8 probability, then C proves 0.64 and respondent proves 0.36. The ratio is greater than 1 and C wins.

If so, how can it be (on one horn of the conjunction paradox) that claimant's responsibility is to show (at least) that p(c & d) is roughly .25 whereas respondent must show that the probability of p(~c & ~d) is roughly .25?

When the fight is just about whether the affirmative defence is proved, there is nothing unusual in C having to rebut the defendant’s proof once the affirmative facts are more likely than not.

... the probability of the complementary subset (~c & ~d) must equal one?

Nope. Using my figures: P(a and b) = 0.8 x 0.8 = 0.64, and P(-a and –b) = 0.2 x 0.2 = 0.04: the probability of (a and b) plus the probability of (-a and –b) do not equal 1.

...

Unknown said...

Why is it that only law teachers who live south of the equator -- well, almost only -- are good at formal logic?

I'll have to work through Don's comment to see where my intuitions have led me astray and to see if I can or do disagree with anything he says.

Stay tuned. Perhaps I'll post a rely in a couple of days. (First come class preparation and doing my laundry.)

Unknown said...

Let me stick with p(approximately .5) -- the calculations are easier.

In the first iteration of the problem that I stated I tried to compare what the burdens of persuasion are under the product rule if one element material to liability is reclassified as an element of a claim rather than as an element of an affirmative defense. I assumed there are four elements in all, in the initial state two elements are part of a claim, and two elements are part of an affirmative defense. I then assumed that one of the two elements of the affirmative defense becomes an element of the claim. Claimant's "aggregate burden of persuasion" then becomes (at a minimum) (slightly more than) p(.125) rather than P(.25) [because claimant must show that each of its three elements has a probability of lightly more than .5]. We're agreed on that, yes? Are we also agreed that respondent's "aggregate burden of persuasion" on its affirmative defense becomes p(roughly .5) -- because now, to gain victory, respondent must only show that the probability of just one element is (slightly more than) .5.

But it now strikes me that the numbers I originally had in my head do not capture how burdens with respect to claims and affirmative defenses work.

As I was thinking about Don's comment, I at first thought that one possible criticism of my first formulation of the problem is that my first formulation ignores the fact that claimant's victory requires that the improbability of each of the elements classified as parts of an affirmative defense must nonetheless be very close to .5. Viewing the situation this way the "aggregate burden of persuasion" of the claimant is roughly .5 to the fourth power. But this conclusion is INCORRECT because, to win, claimant must, yes, establish (by showing a probability of at least .5) the elements that are part of its claim but it must invalidate only _one_ of the two elements of respondent's affirmative defense. Hence, if two elements are part of the claim and two elements are part of the affirmative defense, -- you can figure out the rest yourselves!

If two elements are part of the claim and two are part of the affirmative defense, respondent wins (i) if it shows that the improbability of either one of the claimant's elements is .5 or more OR (ii) if it shows that the probability of the 3d element (part of the affirmative defense) is more than .5 AND that the probability of the 4th (also part of the affirmative defense) is more than .5.

I think Don and I may agree on this much -- but I'm not entirely sure about that.

Having said all of this (the stuff above), now it's not apparent even to me why any of this should suggest that it is _incoherent_ to apply the product rule to burdens of persuasion on claims. So I'll move on (later) to the other iterations of the problem I (tried to) pose.

Unknown said...

The second iteration of the problem I originally posed was incorrect for a reason stated in my most recent comment here: Claimant does not have to establish that each of the two elements of the respondent's affirmative defense has a probability of less than .5. To counter the respondent's affirmative defense claimant must establish only that ONE of the two elements of the respondent's affirmative defense is less than .5.

So the second iteration of the problem I attempted to pose was also truly muddled, yes?

Hmmm, where does this leave me?

Unknown said...
This comment has been removed by the author.
Unknown said...

In the last iteration of my attempt to state a problem I think (I haven't worked out the details, and I think I won't) -- I think I made the basic mistake that the elements required for C's victory and the elements required for R's victory exhaust the possible combinations of the elements and their negations. That's not the case, of course. For example, assume all four elements are part of the C's claim. R wins if it shows that the probability of the negation of any one of those elements is .5. More broadly, there is no inconsistency in saying that P(a & b & c & d) and P(~a & ~ b & ~c & ~d) do not have to equal one: the latter probability is clearly not the complement of the first.

Darn! But I still think that thinking about affirmative defenses may show that there is a logical quirk in L.J. Cohen's conjunction paradox. It's back to the drawing board.

Don Mathias said...

There was an interesting exchange (you will probably recall) between Sir Richard Eggleston and Johnathan Cohen in the English journal [1980] Crim LR, especially at 678 and 747. The Pascalian (and Bayesean) vs the Baconian.

These debates tend to span the centuries.

Unknown said...

Yes, that exchange is a classic.

Unknown said...

Postscript: As the comments to this post show, my musings (see above) did not lead anywhere.

What makes the conjunction paradox so agonizingly difficult is that one has the sense -- or, in any event, I do -- that the proper mathematical interpretation of the burden of persuasion on a claim (or an affirmative defense) is that the jury must be instructed (or the trial court must understand) that it can find for the claimant (or for the respondent on the basis of an affirmative defense) only if it finds (or the judge concludes) that the claimant (or respondent) has established the probability of the claim as a whole (or, in an appropriate situation) the affirmative defense) meets or exceeds the probability required by the applicable burden of persuasion -- and not just that each element of the claim (or affirmative defense) meets or exceeds the requisite probability. The difficulty with this interpretation is twofold: (i) it seems to be inconsistent with the way that the law usually expresses the burden of persuasion and (ii) if each element must be shown to the probability required by the legal burden of persuasion, and if the claim or affirmative defense has more than one elements, the application of the product rule says (even when there is some dependence between the probabilities of the two elements) that the overall claim or affirmative defense must be established to a probability that exceeds the stated burden of persuasion for the claim as a whole -- and, moreover, unless there is a very high degree of dependence among the probabilities of the elements of a claim or affirmative defense, if there are many elements in a claim or affirmative defense, the requisite level of probability for the claim or affirmative defense as a whole is much higher than the probability "officially" seemingly required by legally-stated burden of persuasion(much higher than, e.g., .5 in a civil case, if a probability of slightly more than .5 is the level of probability that must be shown for each element in the usual civil case).

If that's the case -- if, for example, in the usual civil case the trier of fact should uphold a claim or affirmative defense with numerous elements only if it finds that the probability of the claim or affirmative defense very substantially exceeds .5 -- does it follow that the jury or trial court should find that the claim or defense has not been sufficiently established? No, I think that does not follow at all -- because if the claimant or respondent has oodles of powerful evidence to support each element, the jury or the trial court should, of course, find that the claim as a whole [or the affirmative defense] has been sufficiently established and it probably will do so. The high probability seemingly required for the claim [or affirmative defense] as a whole speaks only [if it does so] that an investigation will uncover evidence that sufficiently establishes a claim or an affirmative defense. Moreover, whether an investigation will uncover sufficient evidence of numerous elements of a claim or an affirmative defense depends in part on the characteristics of the part of the world that is being investigated -- and this means it may or may not be very difficult to uncover evidence that will establish all of the various elements to a very high probability. Yes? No?

Unknown said...

Footnote: If you believe (as I do) that all probabilities about factual hypotheses are conditional probabilities [e.g., P(element 1| evidence x)], then the product rule, if applied at all, should be applied to such conditional probabilities (and not to freestanding, or unconditional, probabilities [e.g. P(element 1)].