NaClhv: The two envelopes problem and its solution

The two envelopes problem and its solution

A job I was looking at had a requirement that read: "Inability to stop thinking about the two envelopes problem unless you’ve truly come to peace with an explanation you can communicate to us". So I thought I'd post my explanation for the problem.

The setup to the problem goes like this:

You have two indistinguishable envelopes in front of you. They both contain money, but one envelope has twice as much money as the other.

You get to choose one of the envelopes to keep. Since the envelopes are indistinguishable, you have 1/2 chance of having chosen the one with more money.

But now, after you've picked an envelope but before your choice becomes finalized, you are given the opportunity to switch to the other envelope. Should you make the switch?

Now, one sensible and easy reply is to say that you shouldn't bother. The envelopes are indistinguishable and you have no idea which one contains more money. Your chances of getting the bigger payout remains 50-50 regardless of your choice.

But now, a wild statistician appears, and makes the following argument:

"Let's say, for the sake of argument, that the envelope you have now contains $20. Then the other envelope might contain $40, or $10. Since these two possibilities are equally likely, your expectation value after switching would be half of their sum (0.5*$40 + 0.5*$10), or $25. That's 25% more than the $20 you have now.

But if we think about this more, the initial choice of $20 actually doesn't matter. You can make the same argument for any possible value of money in your envelope. You'll always gain 25% more on average by switching. So, even without knowing the amount of money in your envelope now, you should switch."

Impressed by the wild statistician's use of numbers and such, and figuring that even if he's wrong you would at worst break even, you decide to make the switch. But then, as you're about to finalize your decision and take the new envelope home, the statistician repeats exactly the same argument, word for word. "Let's say, for the sake of argument..." He's now urging you to switch BACK to your original envelope. After all, the two envelopes are indistinguishable. If there is a rational reason to switch the first time, the same reason must equally apply for switching the second time. But at this point, it becomes obvious that if you continued to listened to the wild statistician, you would do nothing but switch the two envelopes for all eternity.

That can't possibly be the right choice. Now, here is the real two envelopes problem: something must be wrong with the wild statistician's argument - but what exactly is the nature of his error?

The solution to the problem goes as follows:

If we start by assuming there's $20 in your envelope, it is NOT equally likely that the other envelope contains $40 or $10. This is where the wild statistician goes wrong. In general, given a value x in your current envelope, it is NOT equally likely for the other envelope to contain 2x or x/2.

Before we get more mathematical, let's examine the problem intuitively, by grounding it in a solid example. Say that you're on a television game show, and you're playing this two envelopes game. You know that American TV game shows typically give prizes from hundreds to tens of thousands of dollars. Now, if the host of the show lets you know that your envelope contains $50, should you switch? I certainly would. I know that, given the typical payout of TV shows, the two envelopes were more likely set up to contain $100 and $50 rather than $50 and $25. The two probabilities are NOT EQUAL.

Oh the other hand, imagine that you're a high school statistics student, and your teacher is playing this two envelope game with you for a class lesson. Your envelope contains the same $50 as in the previous example. Should you make the switch? No way. You seriously think your teacher put $100 in the other envelope to give to a high school student, for a single lesson? If your teacher has 5 statistics classes, he stands to lose up to $500 on that one lesson - likely far exceeding his pay for the day. It is much more likely that your teacher chose $50 and $25 for the values rather than $100 and $50. Again, the two probabilities are NOT EQUAL.

Now, if the two probabilities were equal, then the wild statistician would be right, and you should switch. And you should continue to do so as long as the probabilities remained equal. But the problem described by that situation is not the two envelope problem. It's actually a 50-50 bet where if you win, you double your money, but if you lose, you only lose half your money (compared that to most casino games, where you lose your entire bet). If you find a game like that, you should continue playing it for a very long time.

But for the two envelope problem, the chances of either doubling or halving your money are generally not equal. This will be true for ANY reasonable probability distribution of possible values of money in the envelops. "Reasonable" here means that the probability distribution must sum to one, and that it must have a finite expectation value. Consider any of the following probability distributions (or any other reasonable distribution you wish to think up) for the money in the envelopes:

The orange line the probability distribution for the smaller amount money in one of the envelopes. The green line is the probability distribution for double that value, in the other envelope - it's been stretched horizontally by 2 to represent the doubling, and compressed vertically by 0.5 to keep the probability normalized. You see that the two probabilities are equal (where the lines cross) only for very rare, special amounts of money. In general, if you see a small amount of money in your envelope, you're more likely to have the "smaller" of the two envelopes, and if you see lots of money, you're more likely to have the "greater" of the two. You should be able to understand this intuitively, in conjunction with the game show / statistics teacher examples given above.

Whether you should switch or not depends on the expectation value of the money in the envelopes. If the amount in the "smaller" envelope is A, then the amount in the "greater" envelope would be 2A, and the expectation value for choosing them with 50-50 chance would simply be 3A/2. Since the envelopes are indistinguishable, this is in fact the expectation value of choosing either one, so it doesn't matter which one you choose. This is nothing more than the original, simple argument presented at the very beginning.

However, what if the wild statistician insists on putting the problem in terms of expected gain conditioned on the different possible values of the money in your current envelope? This is how his original flawed argument was framed. It's an overly complicated way of thinking about the problem, but shouldn't we also be able to come to the correct solution this way?

We can. (Beware, calculus ahead) Let:

x = amount of money in your current envelope,
f(x) = probability distribution of the money in the "lesser" envelope, and
g(x) = probability distribution of the money in the "greater" envelope.

Then f(x) can be completely general, but g(x) = 0.5 f(0.5x) due to the stretch/compression transformations. Also, the overall distribution for the amount in your current envelope, given that you chose one of the two envelopes with equal chance, is:

p(x) = 0.5( f(x) + g(x) ).

Then, the expectation value for switching is given by the following integral:

Expectation value for switching = ∫ E(x) p(x) dx

Where E(x) is the expectation value of switching when the money in your current envelope is x. This is given by:

E(x) = x * p("smaller" envelope|x) - 0.5x * p("greater" envelope|x)

That is to say, upon switching, you'll gain x if you currently have the "smaller" envelope, but lose 0.5x if you currently have the "greater" envelope. Furthermore, the p("smaller" envelope|x) and p("greater" envelope|x) values can easily be calculated by the definition of conditional probability as follows,

p("smaller" envelope|x) = 0.5 f(x) / p(x),
p("greater" envelope|x) = 0.5 g(x) / p(x)

noting that the numerator corresponds to getting a specific envelope AND a specific x value.

putting this all together, we get:

Expectation value for switching = ∫ E(x) p(x) dx =

∫ (x * 0.5 f(x)/p(x) - 0.5x * 0.5 g(x)/p(x)) p(x) dx = 0.5 ∫ x * f(x) - 0.5x * g(x) dx =
0.5 ( ∫ x f(x) dx - ∫ 0.5x g(x) dx )

However,

∫ 0.5x g(x) dx = ∫ 0.5x 0.5 f(0.5x) dx = ∫ 0.5x f(0.5x) 0.5dx = ∫ u f(u) du = ∫ x f(x) dx

Where we used a u-substitution and took advantage of the fact that the integral goes from 0 to infinity in the last two steps. Therefore:

Expectation value for switching = ∫ E(x) p(x) dx = 0.5 ( ∫ x f(x) dx - ∫ x f(x) dx ) = 0.5 * 0 = 0

So there is no expected gain or loss from switching, which is the same conclusion we reached at the very beginning.

You may next want to read:
The intellect trap
Basic Bayesian reasoning: a better way to think (Part 1)
A common mistake in Bayesian reasoning
Another post, from the table of contents

61 comments :

JeffJo12/1/15, 11:32 AM
Consider two possible solutions:

1) Assume that your envelope contains $A, and that the other is equally likely to contain either $2A, or $A/2. So the expected value of the other is ($2A)/2+($A/2)/2 = $5A/4.

2) Assume that the two envelopes combined contain $T, and it is equally likely that yours contains $T/3 and the other contains $2T/3, or yours contains $2T/3 and the other contains $T/3. The expected value of yours is ($T/3)/2+($2T/3)/2 = $T/2, and expected value of the other is ($2T/3)/2+($T/3)/2 = $T/2.

The math in these two solutions is identical, yet they get different answers. So at least one (to be completely unbiased) must be wrong.

Since the only differences are in the assumptions made to set up the math, one must make a bad assumption. The assumptions about the values ($A and either $2A or $A/2, and split between $T/3 and $2T/3) are trivially correct. The only other assumption is "equally likely." It must be wrong in at least one of the solutions.

Your reasons for claiming it is wrong in the first solution sound reasonable, but because they are based on subjectivity, they are not provable. There is an argument that is provable. Money comes in integer multiples of some base unit; call it $1 here. If A=1, it must be the smaller value; that is, the other envelope contains $2 with 100% probability. Even if you could arrange distributions where $2A and $A/2 are equally likely at other values, this counterexample proves the assumption to be incorrect in general. So solution #1 is wrong.

But the assumption of equal likelihood is trivially correct in solution #2.
ReplyDelete
Replies
JeffJo12/8/15, 5:25 AM
Please, actually read and understand the post you're commenting on.

You had simply asserted that "If we start by assuming there's $20 in your envelope, it is NOT equally likely that the other envelope contains $40 or $10." And you never proved it (keep reading). You supported this assertion with subjective claims which, because of the subjectivity, a casual reader might doubt.

My "motivating scenario" was only to prove that it is true for that casual reader. That was all I was trying to do; avoid the mere assertion of this truth, and demonstrate it instead. Not to address what I suspected to be, and turns out to be, an incorrect (again, keep reading) analysis that I admit I had only skimmed, but was willing to let pass.

Yes, incorrect. It may surprise you to know that there are distributions where, if you assume a value x in your envelope, the expected value in the other envelope is greater. That's why I suspected an error. Look it up on Wikipedia. The reason it is not a paradox is because the expected value of x, before you assume its value, is infinite. So the question "Should I switch?" asked before you know x, is comparing two infinite values and cannot be answered. Asked after, it theoretically allows the envelope to contain more money than what exists.

The incorrect part is the "distribution" functions g(x) = 0.5 f(0.5x). Not true with discrete distributions, which really are what you should use. Since g describes the "greater" value, and f the "lesser," g(x)=f(0.5x). With continuous, it is true that the probability DENSITY functions satisfy your equations. But then your analysis is just the continuous version of the discrete analysis in my solution #2.
ReplyDelete
Replies
JeffJo12/15/15, 4:47 AM
‘Do you see where I said, "Before we get more mathematical, let's examine the problem intuitively, by grounding it in a solid example"?’

Yes. Did you see where I pointed out that this “solid example” was subjective, meaning that it is not as “solid” as you seem to think? If I’m guilty of anything, it is only that I didn’t make it clear in that reply that I wasn’t talking about your more explicit solution that used calculus. I did try to clarify that in my second reply – did you notice that I said “You are right, that it fails in the continuous case” and identified what I was talking about as “the subjective assessment of {$25,$50} vs. {$50,$100} on a game show” ?

Did you notice that THE MAIN POINT of my original comment was to provide a solid example in place of the one you gave? And how I have repeatedly pointed out that it was enough to satisfy the original thesis, of communicating to a job interviewer why there is no paradox?

Have you noticed that, since the problem uses currency as a metric, this problem can only use a discrete probability problem? And so that “solid example” I provided is actually mathematical proof that the two probability distributions cannot be equal? Have you noticed that extending the problem to a continuous distribution IS JUST AS “UNREASONABLE” as a discrete one where the expectation is infinite, since the gain can be infinitesimal?

Have you noticed that I pointed out that you can’t treat “the lesser amount” and “the greater amount” as independent random variables? Whether or not the result is the same, AND I FREELY ADMIT that I haven’t examined your analysis in depth (because there is no need to), you do not make this distinction in your math.

Finally, have you noticed that in my original reply I provided an explanation where it does not matter if the distribution is discrete, or continuous; finite, or infinite; reasonable, or unreasonable (which is why I have no desire to study your analysis)? How, if the total amount has the distribution t(x) (discrete or continuous), then the second random variable you need is just the choice? And that it is discrete, it is independent of anything else, and it has the equiprobable values {Low,High}? And so the expected gain by switching is (2x/3-x/3)*P(Low)+(x/3-2x/3)*P(High)=0?

Which result – yours with f(x) and g(x) and a long explanation, or mine with t(x) and a short one, do you think a job interviewer would be more impressed by?
ReplyDelete
Replies
JeffJo12/17/15, 7:06 AM
In your original reply, you said: “Your explanation … trivialized the problem.” You are right about one thing only here: my approach is more trivial that your. But that could mean that you have over-complicated it, while mine is sufficient. If this is the case – as I keep asserting it is – then there is no need to go through your exercises. But I will go through a simplified version of them. On my terms.

You then added: “Essentially, your reasoning boils down to ‘solution 1 could be right, or solution 2 could be right. Since they disagree, and since solution 2 is right, solution 1 must be wrong.’” I also submit to you that this is all you have done, just with more complexity. AND IT REDUCES TO MINE.

I mentioned several times that you used treat f(x) and g(x) as independent distributions, when they are not. You still haven’t addressed that (you did point out how to transform one to the other, but not why you can integrate over both with the same variable of integration). What you keep skipping over is that this is just a complication of my second approach, which you have not addressed. And it is the exercise I will present, although I have already said as much: The entire second part of your analysis can be replaced with:

Let T be a random variable representing the total amount in the two envelopes. Let f(t) be its distribution function – its range, reasonableness, and whether it continuous or discrete-via-dirac-delta, are completely irrelevant. Let C be a random variable representing your choice – it is discrete, and has the range c={Low,High}. The only assumptions that are needed, is that C is independent of T and Pr(c=Low)=Pr(c=High}=1/2. I hope you will agree with that. Then, if you switch, you gain t/3 if c=Low, and gain –t/3 if C=High. The overly-complex form of the integral you use for the expected gain, but in my system, is:

esp(gain) = integral(t,[gain(c=Low)*Pr(c=Low)+gain(c=High)*Pr(c=High)]*f(t) ).

= integral( t, [(t/3)/2+(-t/3)/2]*f(t) )
= 0.

There is no need to actually integrate this. There is no need to make any assumptions about what the distribution of t is. There is no need to make any assumptions about its range. Or the expected value of t. Or whether it is reasonable, as opposed to a “wild statistician’s” imagination. There is no need to address how to handle possible dependence when you separate the one random variable into two. In fact, there is no need to go through any of the points you keep bringing up.

The expectation is zero. Trivially. Why are you arguing against this?
ReplyDelete
Replies
JeffJo12/19/15, 6:59 AM
"Have you still not understood that the point isn't to know that the wild statistician is wrong, but to figure out exactly what's wrong with his argument?" And have you not figured out that I did just that in my first post? That it isn't necessary to beat that horse until there is nothing left that is recognizable as equine, if all you want to know is that it is dead?

There are two approaches you mention: (A) Knowing THAT the "wild statistician" is wrong, and (B) knowing EXACTLY the error was that made him wrong. But what you try to address is different, (C) how to make his error "exactly correct" and maybe provide a corrected version of his solution. That is not what you claim is the point.

There is a fourth approach, (D) Providing an alternate correct solution; but this does not address this issue you claim is the point. Without (B), all it does is suggest that there are two solutions that seem to be right. This is the paradox we are trying to dispel. With (B), it shows that the error is significant. (B) alone doesn't prove the answer is wrong. Accomplishing (D) with it does.

The "wild statistician" said "Since these two possibilities are equally likely..." This obviously is a statement that he justified to himself only superficially. Showing that it must be wrong, AS I DID even if it was only a corner case of any distribution, is sufficient for approach (B). Yes, I said (B), not (A) as you claim. It is the only possible error, and it is wrong, so (B) is satisfied. Every other part of his solution is 100%, undeniably correct. As is my solution #2 (aimed at approach (D)). So I addressed the point you claim needs to be addressed.

There is no need to address (C). First, we don't have enough information to do it, which is why you limited yourself to a few "reasonable" distributions. But that does not prove that an "unreasonable" one (btw, summing to one is not part of part of being "reasonable," it is the definition of "distribution") can't have the property "these two possibilities are equally likely."

What you call a "qualitative, intuitive answer" is still just a subjective opinion. So even though your answer is right, you failed to find what is "exactly correct" about it. And didn't even prove what was "exactly wrong" with the wild statistician's approach. You pointed out where you thought the error was, and found a " qualitative, intuitive" example of how it could be wrong, but you didn't prove it. I did, very simply. Any way you could put money into these envelopes has a minimum value, and so there is at least one point where the wild statistician must be wrong.

Do I also need to keep pointing out that the rest of your post is aimed at approach (D), and does not (again, IMHO) do it as well as my much simpler approach? And that it makes a claim that is just as superficial as the wild statistician's, when it uses the same variable (x) in two inter-dependent probability distribution functions?

This is why I never studied your approach (D) in depth, and have no intention to. Not because I believe the result is wrong - I don't - but because (1) there is an error in it, even if it is just in the labels you apply, and (2) there is a much simpler way to accomplish the exact same thing.

If you use just one random variable to represent value, there are no issues like the ones I have raised, whether or not they affect the result. And the function you integrate over IS IDENTICALLY ZERO everywhere you integrate it. Which is what I did. I "solved the problem" to use your words. Formally, completely, correctly, and much more simply than your approach. So again I ask, what part of the conclusions in this paragraph do you dispute? How is it inferior, in any way, to yours?
ReplyDelete
Replies
JeffJo12/20/15, 8:42 AM
Part 1

"You're at the stage where you ignore my replies and pretend that your claims still have validity." And from where I sit, that is where you have been in this entire non-discussion.

"I could … point out your errors … again." You don’t seem to understand that you have not pointed out a single error, only where you think my approach is inferior to yours. All you have said is things like "you restated the problem," or "you trivialize the problem based on a technicality," and then deferred any actual commentary to “later.”

I have a master’s degree in applied mathematics, so I do not need you to assign me "homework" to prove that I know as much as you think you do. I have pointed out actual errors in your work – and I admit they are mostly superficial – that shows I know more. Things like "’Reasonable’ here means that the probability distribution must sum to one" when that is the definition of a distribution; and calling f(x) and g(x) distributions for what you imply are two different random variables ("lesser" and "greater") when they are values derived from only random variable that you identify, x=the current value. Then you ignore the fact that what you actually did was the same thing I did – one random variable that defines both "lesser" and "greater" but not "current." You just did it in a far too complicated way.

Maybe, as your homework, you should try to explain why a continuous model of a discrete probability distribution is just that – a model. A way of looking at one thing in the paradigm of another. Then explain what makes them different – start with the probability of any specific value being zero in a true continuous distribution. Or explain how a job interviewer in field of Mathematics might react differently to a response, than one in Computer Science. And then argue for which field this problem is better suited.

Through all of this, it is you who has “demonstrated an obstinate refusal” to address any of my points. You have not discussed any of them, you have summarily dismissed them as unworthy of your consideration. So again, I repeat the salient points:
ReplyDelete
Replies
JeffJo12/20/15, 8:43 AM
Part 2:

1) All of the math presented here – by me when I presented the two solutions, by the wild statistician, and by you, is essentially correct. Yours makes superficial errors in how it uses terms, but I never said it ended up being wrong.
2) The only significant error is the assumption that, once you call value in your envelope "x," that it is equally likely that the other envelope will have x/2 or 2*x. (See further explanation below.)
3) This is can be trivially demonstrated to be an error, since a discrete distribution must have a minimum value. Realistically, it should have a maximum value also, but that is not provable.
4) Supposing a “wild statistician” who makes the subjective assessment that you are in the middle of this range is irrelevant, because the assumption may be true there. Similarly, finding distributions that you call "reasonable" is irrelevant. For example, if the envelopes could be {$10,$20}, {$20,$40}, … {$320,$640} with equal probability, then when you see anything between $20 and $320 it is indeed equally likely that the other envelope contains half, or twice, as much. So if you allow the distribution to be an open question, it is only at the endpoints that the chances must be unequal.
5) A continuous distribution that is not a model of a discrete one (thru dirac-delta functions) is irrelevant to the problem as stated. Using such a model for a discrete case does not change the fact that it is discrete.
6) The correct way to answer the question “should you switch” is to use one random variable to describe what the values are, and another to describe whether you have the high or the low one. You do this, even though you phrase it another way and refuse to admit it.
7) But you severely over-complicate the issue. The distribution of the "value" random variable is completely irrelevant, as is calculus. Even if you allow the distribution to be continuous. What you did was find that the value of the expectation is zero *after* the integration. What I did was find that the value you sum, or integrate, over is zero everywhere. What you did requires deriving some properties of the distribution. Mine does not.

In item #2, the error is the misapplication of the Principle of Indifference. It’s what says that, if I say I roll an N-sided die, you should assume that each side has a 1/N probability. Even if I don’t say so. More formally, if N possibilities exist that are functionally equivalent based on the information you have, you should assign each the same probability. The trick is defining what “functionally equivalent” means. It isn’t true for the “other” value if you assume a value for x. I would have loved to discussed this with you, but you won’t discuss anything with me.
ReplyDelete
Replies
JeffJo12/24/15, 3:11 AM
. I really didn't expect you to address any point I raised, but I had to try. Since you like exercises, I'll leave you with some simple ones:

1) Let X and Y be two continuous, independent random variables on 0<=x<=1, 0<=y<=1. with probability density functions f(x) and g(y). Let h(x,y) be a continuous function that is finite when 0<=x<=1, 0<=y<=1. Write an expression for the expected value of h.

2) How many integrations does it use?

3) How many does your "expectation for switching" use?

4) What does that say about the number of random variables you are using?
ReplyDelete
Replies
JeffJo12/24/15, 8:07 AM
As a Christmas present, and trying desperately to demonstrate good will, let me show you the correct way to do what you tried to do (and really what you accomplished, but with sloppy technique). But I’m going to leave one part more generic than you did, to isolate the differences between our approaches.

Let X be a random variable that determines the amounts of money in the game. In probability, we usually use upper case, like X, to represent a random variable. It does not have a specific value, it is an the abstract concept representing a quantity that varies randomly. We use the equivalent lower case letter to represent a value of that variable in an instance of the experiment; in this case, x. Whether X is continuous or discrete[see note 1] isn’t particularly relevant to this analysis, even though the properties of the two have significant differences that can be relevant to other analyses.

Then, let the value of the “lesser” envelope be defined to be l(x), and of the “greater” to be g(x).

Similarly, let C be the (discrete) random variable representing your initial choice. I’ll use c=-1 to mean you chose the lesser envelope, and c=1 for the greater.

C’s distribution[2] function is Pr(c=-1) = Pr(c=1) = 1/2. Assume the probability density[3] function for X is f(x).

Using these two random variables, an expression for the change in value when you switch is (c*l(x) – c*g(x)). The expected value needs to “accumulate” all possibilities over these two random variables. The discrete one is a summation and the continuous one is an integration:

E(switch) = ∫ sum[(c*l(x) – c*g(x))*Pr(c)]*f(x)*dx

Probably because you thought l(x) and g(x) represented two different random variables, and not two values derived from single random variable, you reversed the order of the integration and the summation. That let you isolate the "different" random variables in the integration. While not incorrect, you did far more work than was needed – and required an assumption I don’t see the need for (“we used a u-substitution and took advantage of the fact that the integral goes from 0 to infinity”).

What you got, correcting a typo, was ∫ x f(x) dx - ∫ 0.5x g(x) dx. This essentially means the contribution of the “lesser” envelope to the expectation, minus the contribution of the “greater” envelope.

All that is unnecessary if you do the summation first. It evaluates to:

[(-1)*l(x) – (-1)*g(x)]/2 + [(1)*l(x) – (1)*g(x)]/2
= [-l(x)+g(x) + l(x)-g(x)]/2
= 0.

Making the integration, the variable change, and the entire distribution question moot.

BTW, your “x” was the lesser amount, l(x)=x, and g(x)=2x. Again, these are not two different random variables, they are values derived separately from one. My “x” was the total amount, l(x)=x/3, and g(x)=2x/3. You need to re-derive your answer if these functions are not linear; mine works for any set of functions defining what is in the envelopes, and for any distributions.

+++++

[1] And modeled as continuous using delta functions.
[2] One of the differences between continuous and discrete random variables, is that the distribution function for a discrete one is the actual probability of each possible value. Conventionally, we use the notation Pr(x=value).
[3] But with a true continuous random variable, no value has a non-zero probability. Instead, we use what is called a probability density function, typically called f(x). The probability distribution is defined over an interval, not at a value, as Pr(x0<=x<=x1) = ∫ f(x)*dx, for x=x0 to x1.
ReplyDelete
Replies
JeffJo12/26/15, 4:32 AM
"Obviously, I simplified the double integral down to a single one, because that part is so simple." If it is so simple, then show me the structure you gave for #1 is reduced to what you use, “∫ (x * f(x)/p(x) - 0.5x * g(x)/p(x)) p(x) dx”

The reason you can't, is because the second random variable you use is accumulated by a summation over the initial choice of lesser/greater.

And I have understood every part of your posts - you are just unwilling to address mine, so you ignore my points. As an example, I point to how you ignored the fact that your f(x) and g(x) are probability density functions, not probability distribution functions like you call them. But they are density functions of a transformation of the same random variable, not different ones. This is a basic point, it is a harder one than the trivial one you keep asking.

It seems obvious now that you have no interest in other people’s thoughts or ideas, even if they conflict only mildly with your own.
ReplyDelete
Replies
JeffJo12/27/15, 6:29 AM
“You are, as ever, wrong about most things you say.” I have not been wrong about anything – you just define your own opinion as right, and either ignore the facts I present, or treat them as opinions which must be wrong. From the top:

“I'm afraid you're trivializing the problem.” It is a trivial problem, but people develop “blind spots” about simple (few cases, not necessarily easy) probability problems that conflict with their intuition.

“This is not actually addressing the two envelopes problem, it's just restating it.” (1) I didn’t “state” anything, I summarized two conflicting solutions. Even if I did, it wasn’t a “restatement” because you didn’t state it, you linked to an article that includes many different problems:

1) Before I open an envelope, should I switch?
a. Why do people get different answers?
b. What is correct?
c. Why is the other one incorrect?
2) After I open an envelope, so I know one value, should I switch?
a. Why do people get different answers?
b. What is correct?
c. Why is the other one incorrect?
3) What is the impact of making it a (true) continuous distribution?

Your only statement was “come to peace with an explanation you can communicate to us.” The implication of your analysis was that you wanted to “come to peace” with 1c, arriving at 1b along the way. I did exactly that – it is you who were not “at peace” with how I did so. There was nothing wrong with how I did it, you just didn’t like it. Any interviewer worth hiring me would have recognized it.

Yes, it was a fairly trivial error – but it was an error, and a provable one. A fact you have not acknowledged. You tried to accomplish the same thing, but did so subjectively only. You merely asserted “it is NOT equally likely that the other envelope contains $40 or $10.” At first, you addressed the question for specific values of X (addressing problem 2, not 1). Then with specific distributions. You did say “the chances of either doubling or halving your money are generally not equal,” BUT THIS AN ASSERTION, NOT A PROOF THAT THEY ARE NOT EQUAL. I proved that there must be example where they are not equal, so the assertion must be true. AND I DID IT MUCH MORE SIMPLY THAN YOU DID.

You did point out that if “the amount of money is infinitely subdivisible” my proof doesn’t work. I realize that, and said as much. But in an interview, if this point gets raised, I can show the equivalent argument for a continuous distribution[1]. You just dismissed my correct argument, and implied there could not an equivalent argument for a continuous distribution. This is what I mean by ignoring my arguments.

And I frankly find your “homework” insulting, since I have shown you over and over that I know quite a lot about probability. More than you do, it seems, so you have no right to test me. In my opinion, you are just using it as an excuse to ignore any point I make.

I skipped your correct answers to my questions, because I wanted to address only why one was wrong. “The answer to your new question this time about how to derive the expression I used from a double integral.” You don’t seem to understand what a double integral is, or what a problem in two random variables is. Even after I explained it once. So here is a lesson. A random variable is a measure of a quantity that can vary unpredictably. If you have two, you need to use a joint density function f(x,y). If the RV’s are independent, you can separate them into f(x,y)=g(x)*h(y). Either way, to get the distribution function you must integrate over BOTH variables.

This isn’t what you did. You did have two random variables, but they are not the ones you claimed. You used one variable for both values, and one to indicate which value you have. So in “p(x) = 0.5( f(x) + g(x) )”, the “0.5” is the probability term for the second variable, and the two terms represent the summation over that variable. Your f(x) and g(x) are just transformations of the only distribution for the first. What you called “separating the double integral” was merely applying the property (A+B)=(A)+(B).
ReplyDelete
Replies
JeffJo12/28/15, 6:00 AM
(1 of 2)
1. “Your summaries are boring, useless and irrelevant.” My summaries carefully outline what is right about the two conflicting solutions, and prove the one and only part that is wrong about the incorrect one, thereby satisfying the requirement. Ad hominem attacks aside, they are simpler, and at the same time more rigorous, than yours. And yes, I do understand your argument – I just think mine supersedes it based on its simplicity. If you think there is a point I don’t understand, point it out and I will explain what is right, or wrong, about it.

2. “I'm actually quite proud of you. If you had presented this solution in your first post, I would say that it's a different approach to the problem that at least gives some good insights.” Condescension aside, it is a verbose treatment that shouldn’t be necessary. I included it just as a demonstration for you, since you think continuous distributions require a different approach. It really is necessary only if there is a chance of $0 (discrete case) or an arbitrarily small amount (continuous case) in both envelopes. If there is a minimum value Xmin>0, then my original argument extends to F(2*Xmin)-F(Xmin), since it is greater than zero but F(Xmin)-F(Xmin/2)=0. I just didn’t want you to dismiss this trivial, and equivalent, statement like you did my original one.

Had you responded with this level of acceptance to my original problem – that is, not calling my solution a technicality when (A) it was a correct proof, (B) your objection was a technicality, that (C) is easily and trivially addressed, we would not be in this position now.

“Don't you hate those people that nitpick the minor details, then misunderstand and dismiss the whole thing?” Yep. That’s what riled me.
ReplyDelete
Replies
JeffJo12/28/15, 7:36 AM
A re-presentation of my answer to the interviewer:

Hi, I’m Monty Hall, and I’m holding two heavy boxes. A random (but non-zero – that wouldn’t be any fun!) amount of molten gold ($1,000 per ounce) was poured into one box, and exactly twice as much was poured into the second. Then, the two were topped up to exactly one pound with worthless sand. You get to pick one box; but after you finalize your pick, I have three people here with some advice for you.

Contestant: I’ll pick box A.

Wild statistician: Let’s say, for the sake of argument, that Box A has 1 ounce of gold. Then the other box might contain 2 ounces, or half an ounce. Since these two possibilities are equally likely, your expectation value after switching would be half of their sum (0.5*$2,000 + 0.5*$500), or $1,250. That's 25% more than the $1,00 we assumed you have now.

You: But if I switch to Box B, whatever amount we assume is in there, the same argument says Box A has 25% more. And then again, and again. Pretty soon that is more than one pound of gold.

But I see your error – you assumed, without proof, that Box B was equally likely to contain 2 ounces, or half an ounce. That can be true for some amounts of gold in the boxes, but not all. For example, if Box A contains more than 8 ounces, then Box B can’t contain twice as much. Similarly, since “but non-zero” means there is a minimum amount Gmin, then if Box A has between Gmin and 2*Gmin, Box B can’t have half as much. These arguments wouldn’t apply if it was possible there was no gold in either box (that wouldn’t be any fun!) and there was no upper bound (get serious), but I’m sure the argument could be extended to those cases anyway.

JeffJo: Instead, assume both boxes together contain T ounces of gold. And while it may seem reasonable to do so, I don’t need to place any restrictions on T. Box A is either the lesser box with T/3 ounces, or the greater one with 2T/3. Since these two possibilities are equally likely, your current expectation value is half of their sum (0.5*T/3+0.5*2*T/3) = T/6+T/3 = T/2. If you switch, you technically should switch the order of this summation, but addition is commutative so the result is the same.

Contestant: But didn’t you make the same assumption?

JeffJo: No, WS’s assumption was tied to specific values – mine was only about whether you had the lesser, or greater, value. WS's specific error was a mis-application of the Principle of Indifference. It says that options which are "indifferent" are equally likely. "Lesser" and "greater" are indifferent, but "1/2 ounce" and "2 ounces" may not be, as you pointed out.

NaClhv: But you need to consider the greater and lesser values to be different random variables, symbolically derive separate distributions for them, and integrate over their entire ranges to find the expected value of switching!

JeffJo: No, you really do not. The expectation value when you switch is 0.5*(2T/3-T/3) + 0.5*(T/3-2T/3). This is true no matter what T can be, how it is distributed, if it is continuous or discrete, or whether you know the value or are just treating it as a simple variable. And it is zero, no matter what T is.
ReplyDelete
Replies
JeffJo12/29/15, 5:07 AM
“You said, about your second, new solution: "it is a verbose treatment that shouldn’t be necessary. I included it just as a demonstration for you, since you think continuous distributions require a different approach." No, your second solution is really fundamentally, deeply better than your first, deeply flawed approach, for the reasons I enumerated earlier.”

It is a verbose treatment that I included because I thought it would appeal to you, based on your proclivity to continuous distributions. A much better one, based on my original (which is rigorous and sufficient to the task” is (and this can be extended to continuous based on the hint I’ll drop; discrete is easier):

1) In order to have x dollars (continuous case: X s.t. x0<=x<2x0) the pair of values must be (low,high) = (x/2,x) or (x,2x). (Continuous case: F(x0)-F(x0/2) = F(2x0)-F(x0).)
2) The assumption of equal probability, based on x dollars in chosen envelope, is true if and only if Pr(x/2,x) = Pr(x,2x) for every x in the distributions range.
3) This is a contradiction for several reasons. First, it implies that if x is in the range, x/2 and 2x must also be, which is COMPLETELY unreasonable because it implies the values can be arbitrarily close to zero and arbitrarily large. If you think you can allow that, you can’t create a non-zero distribution for either the discrete, or continuous, cases. And please note that I said distribution, not density.

“Don't you see that you've achieved a deeper level of insight into the structure of the problem?” No. The only insight necessary is that your Wild Statistician applied the Principle of Indifference without checking to see if its prerequisites applied. Since my very simple demonstration shows that they cannot, his solution is trivially invalid.

“That you've actually (at least in part) addressed the wild statistician's flaw instead of actually arguing for him by fueling the reason to switch with your initial "if you have $1, the other one obviously must contain $2" argument?” ????? And you completely ignore the fact that the point was to find a flaw in his argument, and show that there was not a flaw in the argument that says switching can’t matter.

Yes, in that case you should switch, but that was not full argument. There is a corresponding problem at the maximum end – it’s just harder to convince nitpickers that there must be a maximum

“… an argument that works based on the structure of the entire distribution function instead of giving a single (misguided) counterexample.” Again, the examples was there to find an error in one argument. You just won’t admit that proving the W.S. is wrong is all that is necessary. So yupou focus on this technicality, which is easily remedied.

“On all that stuff about dirac delta functions: if you promise that you'll actually address my main point immediately after I demonstrate how to get my answer from a double integral, I'll go ahead and do it.” Why don’t you try addressing my description of what you did first? Then you’ll see that you have no double integral over the two random variables you claimed. And that if you actually show a valid modeling of a discrete distribution, its random variable will be the one that I said you treated as discrete.

“But I think your confusion here is intricately tied to you misunderstanding my solution,…”

I completely understand what you wrote. If there is any misunderstanding, you don’t understand what you wrote. But you are still ignoring the fact that it is way too complicated. Using a better choice of Random Variables, the function you integrate is identically zero. This renders any understanding of what you did that is different than what you wrote irrelevant.
ReplyDelete
Replies
JeffJo12/29/15, 6:58 AM
The following, except as noted, applies to any variation of the Two Envelope Problem before you look an envelope. This includes the discrete and continuous cases (and discrete-modeled-as-continuous), cases with minimum and maximum values, and cases where the values can be arbitrarily large or small.

1) Let [x0,x1) be a range of values with a non-zero probability Q, and x00.
4) The Wild Statistician’s assumption is proven impossible, by contradiction.
5) In the classic Two Envelope problem, currency is placed in the envelopes so the distribution is discrete, and has a minimum. Being discrete isn’t significant, but the minimum is: With it, (3a) is sufficient to demonstrate the contradiction that leads to the conclusion in (4).
6) My argument about my original “Solution 1” is sufficient to prove that WS’s solution isn’t correct.
a. It didn’t address whether his conclusion was right or wrong, just the argument.
b. If you also accept a maximum, the trivial parallel proves his conclusion is wrong.
c. If you don’t accept either the minimum or maximum, the simple extension of the argument in (3) is required to prove him wrong.
d. Being discrete, continuous, or discrete-modeled-as-continuous is irrelevant.
7) Nachlv said “The chances of either doubling or halving your money are generally not equal. This will be true for ANY reasonable probability distribution of possible values of money in the envelops.” Nachlv did not prove this must be true, Nachlv merely showed examples based on assuming various distributions (or values) first, and then showing it was not true for them. So as far as Nachlv showed, WS could be correct.
a. I proved it, well enough for the first response to an interviewer.
b. If the interviewer raises technicalities like Nachlv did, there are easy replies.
8) “However, what if the wild statistician insists on putting the problem in terms of expected gain conditioned on the different possible values of the money in your current envelope?” I addressed this completely.
a. Let T be the total amount.
b. The lesser amount is T/3, and the greater amount is 2T/3.
c. The expected gain when you switch is (T/3)*Pr(c=low)+(-T/3)*Pr(c=High) = 0.
d. This is true for any distribution of T, making any argument about distributions completely irrelevant. Again.
e. This is true for any value of T, before you integrate over it and its distribution. So even if you look in an envelope, and deduce something about T, the expectation is still zero.
9) Nachlv presented a 300+ word argument that showed that the expectation over all values of X, a random variable related to T, was zero.
a. It used incorrect terminology and sloppy math, but is correct in the end.
b. It did NOT show that the expectation for any value of X was zero.
c. Whether or not you accept it as “correct,” it is inferior to the argument in (8) which is shorter, simpler and more general.

Which points do you disagree with, and why?
ReplyDelete
Replies
JeffJo12/29/15, 2:15 PM
“You still ignored the one single question that I've been asking you to look at for many, many posts.” I have answered your trivial and insulting question, in a more general form than you asked it. You just missed it- probably because you did not read it. I have also demonstrated far better knowledge than it requires.

“I've pointed out the many mistakes you've made.” You have not pointed out a single one. Saying things like “this trivializes the problem” is an unsupported statement of opinion. If you think otherwise, please quote what I said that is in error, and explicitly point out the error. I even tried to make it easy for you by itemizing the points, but you use your insulting test as a way to avoid identification of the errors you claim exist.

+++++

When you said "f(x) = probability distribution of the money in the "lesser" envelope," you use f(x) as a density function. Not a distribution.

When you said "g(x) = probability distribution of the money in the "greater" envelope," and how you use it later, you are using one random variable but giving it two density functions. What you are trying to do, as I have said repeatedly, is represent two functions of a single random variable. That requires just the one random variable at this point.

The second random variable you use is the choice. It is discrete, with two possible values {low,high} and distribution (0.5, 0.5). You do not represent it as a random variable, tho. What you call p(x) is really the joint distribution-and-density function of your two random variables.

As I keep saying, the result is ends up correct, but is an overly-complex way of accomplishing what you want. There is a much simpler way, but you use excuses to avoid addressing it.

Now, what do you disagree with in my clearly stated, and clearly supported list?
ReplyDelete
Replies
JeffJo12/29/15, 2:32 PM
Egad, maybe it wasn't as clear as I thought - because two steps got deleted, and one corrupted somehow. They were there, but I didn't keep a copy.

1) Let x0<x1<2x0, and [x0,x1) be a range of values with a non-zero probability Q= Pr(x0<=x<x1).
2) WS's assumption is that Pr(x0/2<=x<x0) = Q = Pr(2x0<=x<2x1). This can be extended endlessly to Pr(x0*2^n<=x<=x1*2^n) for any n, positive or negative.
a. If the range of possible x's has a minimum, this requires non-zero probabilities below that minimum.
b. If the range of possible x's has a maximum, this requires non-zero probabilities above that minimum.
c. If the range goes from 0 to infinity, the Q must be zero.
3) All three cases violate the assumptions.
ReplyDelete
Replies
JeffJo12/30/15, 7:03 AM
"I strongly urge you to use this opportunity to take a new look at my argument and really understand it." And I urge you to do the same.

Specifically, if one random variable represents the value in both envelopes - as your "x" does, in an overly complicated way that you don't describe well - and the other represents the choice of low vs. high - as yours does, but you don't describe as a random variable at all - then integration is irrelevant. This is true whether you use a discrete distribution, as the original problem suggests; or a truly continuous one, without infinite values for the density; or a dirac-delta based continuous model of a distribution that includes discrete elements.

WHY? BECAUSE THE EXPRESSION YOU WOULD INTEGRATE IS IDENTICALLY ZERO FOR EVERY VALUE OF X.

This isn't hard to understand. But you do have to try.
ReplyDelete
Replies
JeffJo12/30/15, 10:26 AM
1) Suppose X is a uniformly-distributed continuous random variable defined on the range x0<=x<x1. Let f(x) be its probability density function. What is the significance of the expression ∫ A(x)*f(x)*dx, integrated from x=x0 to x=x1?

2) Even if can't figure out what f(x) is in question 1, do you even care that f(x)=1/(x1-x0) when you evaluate ∫ [A(x)-B(x)]*f(x)*dx if A(x)=B(x)? Do you need to know anything at all about f(x)?

From here on, say T is a random variable representing the total amount in the two envelopes. Then, S(t)=t/3 is a function representing the amount in the smaller envelope, and G(t)=2t/3 is a function representing the amount in the greater envelope. Further, A(t)=G(t)-S(t)=+t/3 is function representing the gain when you switch from the smaller envelope, and B(t)=S(t)-G(t)=-t/3 is function representing the gain when you switch from the greater envelope. Finally, let C be a discrete random variable representing which envelope you initially chose, so c={smaller, greater} with distribution {1/2,1/2}.

3A) Why do I use both upper case and lower case letters?

3B) What is different about how you handle C and T, since C must be discrete, and T could, in theory, be continuous or discrete?

3C) What is the significance of questions #1 and #2 to the expected value of a switch?
ReplyDelete
Replies
JeffJo12/30/15, 4:24 PM
“I take it from your posts that you are, in fact, okay with my changes?” Haven’t looked them because, for the most part, they are irrelevant (see my last post) and probably still have careless errors, which you can’t seem to understand and don’t seem to care that you don’t. Like:

“x = amount of money in your current envelope,
f(x) = probability distribution of the money in the "lesser" envelope, and
g(x) = probability distribution of the money in the "greater" envelope.”

It is not just “my notation” that you get wrong, it is the definition of the terms. A probability distribution function is an actual probability. The probability for a single value x of a true continuous random variable (as opposed to a discrete one you model with dirac delta functions) is zero. The probability distribution is F(x)= ∫ f(t) dt from t=-inf to x.

“They do not use your notation, but all that should be perfectly clear.” And I have said they end up being correct, despite these careless errors.

“I've also explained earlier that I've collapsed the full integral down to a single one, …” but not how. Like almost all of what you say, you present a de facto conclusion without any support. This one would be pretty difficult to support, since that isn’t what you did. The second integration is really a summation, since your second random variable is the discrete one representing the choice. The two you call different ones are FUNCTIONS of a single random variable, and you integrate over it exactly ONCE. There are two terms in it that you add, but you have a single integral.

“Note here, between 2 and 3, that all this business with t is YOUR argument, not mine.” Right. And it is precisely why we don’t need yours, since the integration becomes irrelevant.

“It is the argument that I dismissed as being trivial” but never addressed that it is trivial because the problem, when viewed correctly, is that trivial.

“… the argument that we already knew from the very beginning, because it was essentially already a part of the problem at the start.” Where?
I’ll look at your “changes” tomorrow. Maybe. But it’s hard to see why I should, when you won’t look at mine except to call it “trivial,” and use lame excuses like "you haven't provided an answer that is found on the first page of any probability textbook" when you continue to get things wrong from the second, and all subsequent, pages.
ReplyDelete
Replies
JeffJo12/31/15, 7:46 AM
"I mean - "probability distribution function" vs. "probability density function"? Really?" Knowing these terms is more significant than knowing a uniform density function is 1/(x1-x0). I keep trying to get you to ignore such questions by admitting to you that the names aren't important, just like that insulting question isn't important. But you keep using that insulting question, and others, as an excuse to ignore what I say.

"And, since you keep bring it up..." Did you see where they said "However, this use is not standard among probabilists and statisticians" ? Or notice that my point wasn't that your usage was not understood, but that it could be interpreted as not understanding the difference? The kind of point YOU keep raising with your insulting question?

Now, you should look at https://en.wikipedia.org/wiki/Conditional_probability_distribution#Continuous_distributions . The conditional density you keep writing as a function of a single random variable should be written as a function of two. This is why "p("smaller" envelope|x) = 0.5 f(x) / p(x)" is still an incorrect expression, even if the coefficient is now correct (I haven't bothered to check).

Let the smaller amount be the random variable S (values called s), the greater be G (and g), and the value in your be X (and x). I don't know how to do a subscript here, so I'll call the densities fS(s), fG(g), and fX(x). And there is a another we need - the joint density fSX(s,x).

These are highly related random variables. But the correct way to do what you are trying is to discover that relationship, which you haven't. The correct way to write the expression for what you called p is fS(s|X=x)=fSX(s,x)/fX(x). IT NEEDS THE JOINT DISTRIBUTION OF TWO DIFFERENT RANDOM VARIABLES. To average it you need a double integral, over ds and dx SEPARATELY. YOU DON'T DO THIS.

The (single) integration that you end up with is 0.5 ( ∫ x f(x) dx - ∫ x f(x) dx ). What you continue to ignore is that there can be many ways to get there. One is the complicated, technically-ioncorrect-but-may-end-up-right way you used. Another is the integral I presented in question #2 a few posts ago, ∫ [A(t)-B(t)] f(t) dt where A(t)=B(t). Did you not notice a similarity between yours and mine? THEY ARE ESSENTIALLY THE SAME THING, with the exception of the coefficient you changed which turns out to be irrelevant because it gets multiplied by zero.

You keep dismissing mine as trivialization. It isn't - the result is the exact same result try to get, without the complex machinations you use to transform the easy random variable T=total into X=your envelope, and then misrepresent the other random variables and claim you are using two when you aren't.

You just can't admit that a trivial-seeming analysis is sufficient for a trivial problem. And that the only thing you need to communicate to your interviewer is why the problem is trivial: (1) The answer that says you should switch contains an error. (2) The trivial one you dismiss does not, AND IT EASILY SHOWN TO BE CORRECT IF YOU DON'T DISMISS IT BASED ON ITS SIMPLICITY. (3) Its answer is that the expected gain, integrated over only the "choice" random variable, is zero. So actual integration over the "value" random variable is irrelevant, since it must be zero also.
ReplyDelete
Replies
JeffJo12/31/15, 7:56 AM
BTW, I have answered your insulting question three times now. Just in a more general form than you requested. You even acknowledged it. I have resisted the temptation to ask you if you realize that 1/(1-0)=1. Should I do so before letting any kind of discussion continue, since without such a statement it appears you don't understand simple arithmetic? This is the exact kind of thing you keep doing.
ReplyDelete
Replies
JeffJo1/1/16, 7:09 AM
(1 of 2 or 3 - let's see what I can fit)

“What I need from you is a confirmation and your willingness to follow me, to actually engage with me.” You describe two different things here. I am not going to “follow you,” but I have been trying from the beginning to “engage with you.”

The difference is that I feel I know just as much, or more, about probability in general, and this problem is specific, than you do. I feel I have demonstrated this knowledge amply. I do not need to, and am not willing to, be led by the nose in this discussion, as you keep trying to do. I want it to be a two-way endeavor. Do you? I ask, because you have not shown any willingness to engage with me. Only to lead me, and I do not need to be led.

Toward those stated goals, I have absolutely no intentions of providing a direct answer to your insulting question which you know I can answer. The one whose only purpose at this point is to make me subservient to you in a one-sided discussion. I thought this would have been extremely obvious by now.

I also have no intention of insulting you, as you have insulted me. But I would have left this discussion long ago if I didn’t want a proper, two-way discussion.
ReplyDelete
Replies
JeffJo1/1/16, 7:27 AM
(2 of 3)

Starting points I want to emphasize:

1) You did not make a problem statement. You linked to Wikipedia. Its setup begins by describing what you call the Wild Statistician’s solution, and then simply asserting “However, this violates common sense.” The implication is that it is common sense that the two envelopes are equivalent to you, so which you pick shouldn’t matter.
a. The WS solution is demonstrably correct, except for one unsupported assumption (see (3d), (6), and (7) below).
2) Later – not as part of the “problem setup” as you implied in your very first reply to me – it provides a different, and very long-winded solution that, like yours, ends up being correct. It is almost impossible to follow.
3) I provided simple explanations for both solutions. I did not “just restate” the problem as you claimed; I did restate the solution described in (1) above, but I solved the part described by (2) in a different way.
a. The purpose of restating (1) was to identify the error.
b. Yes, this solution is trivial. That’s part of my original point. It is also demonstrably correct. There is one is correctly applied assumption: that when you know nothing about either envelope, you have equal chances of having either.
c. The difference between (1a) and (3b) is that (1a) assumes you know something about one envelope. The value.
4) There is nothing wrong with a solution to (2) being trivial, if the solution is correct in all ways. I’ll point out that all you did was provide a different solution for the same thing.
a. Yours contained some marginally questionable steps, but ended up being right in the end.
b. But it ended up being right because, based on what you described as a trivialization, your integration was essentially ∫ k*Z(x)*f(x) dx, where Z(x) is identically zero for all x in the range of integration. So if you made a mistake in, for example, the constant k, you’d still get the right answer.
c. You have admitted to such a mistake.
5) One big question here, is which solution that parallels (2) was a better explanation: Mine, which is trivial and explained in two undeniably correct sentences; or yours, which contained many errors in nomenclature, one of which resulted in an error that did not produce the wrong result only because of a reason explained by mine.
a. In other words, is the “trivialization” you use as an excuse to ignore any attempt to discuss my solution, actually a better reason to do the same to yours?
ReplyDelete
Replies
naclhv2/14/16, 2:08 AM
It has been well over a month since I asked JeffJo for a reply. It saddens me to say that I must consider him to have abandoned this thread.

For the sake of my reader's understanding, and to clean up after JeffJo, I will now post the correct answers to the questions that came up during this discussion.

I had been asking one question to JeffJo, saying that it is the first of a short series of questions that will lead to a clear demonstration of his error, using only the simplest of algebra and probability theory. The following is that series of questions, in its entirety.

Question 1:

Let f(x) be a uniform probability density function over the real numbers in the interval [0,1]. What is the value of this function at x = 0.5?

My answer is that f(x) = 1. Tell me, am I right?

Answer:

Yes, you are right that f(x) = 1.

Question 2:

Now, let g(x) = 0.5 f(0.5x), as I stated in the original article. Given the same f(x) as in the previous question, what is the value of g(x) at x = 0.5?

Answer:

g(x) = 0.5 f(0.5 * 0.5)
= 0.5 * 1 = 0.5

Question 3:

Now using the same g(x) and f(x) as above, what is the value of x * f(x) - 0.5x * g(x) at x = 0.5?

Answer:

x * f(x) - 0.5x * g(x)
= 0.5 * 1 - 0.5 * 0.5 * 0.5
= 0.5 - 0.125 = 0.375

Question 4:

Now, is that expression in the previous question (that is, x * f(x) - 0.5x * g(x)) the expression that I'm integrating in my original article?

Answer: Yes, that is the expression that is being integrated in the original article.

Question 5:

And that expression - the integrand - is 0.375 at x = 0.5?

Answer: Yes, the integrand is 0.375 at x = 0.5

Question 6:

Is 0.375 = 0?

Answer: No, 0.375 is not equal to zero.

Question 7:

So, were you wrong when you repeatedly insisted that my integral reduces to yours because its integrand was zero for all x values?

Answer: Yes, I was wrong.

Now, did I actually expect JeffJo to go through with this exercise? Of course not - you can hardly expect someone who couldn't even say "Yes, f(x) = 1" to ever say "Yes, I was wrong". Even if he somehow managed to answer the first question, he would have had some excuse, distraction, evasion, or lie to avoid saying "yes, I was wrong" - even though he IS wrong, just as surely as 0.375 = 0 is wrong.
ReplyDelete
Replies
Anonymous8/24/16, 4:58 AM
It can be taken as an axiom that both envelopes are equally preferable. The probability of there being a given pair of amounts of money is then defined that way. If one envelope has $20, then the other envelope has $10 with a probability of 2/3, or $40 with a probability of 1/3.

So the solution cannot be described in terms of actual amounts of money, because this probability cannot be summed into a finite distribution.
ReplyDelete
Replies
John8/28/20, 2:14 AM
Jeeez, watching the sparring match between JeffJo and the author of this blog post. You guys are just too verbose.

There is a much much simpler solution of this problem; and that is to note that the ONLY assumption being made (call it A) is that it is equally likely that the other envelope contains the lesser or the greater amount. Since the calculation of the expected value from A is correct, yet the answer is wrong, by logic therefore, assumption A must be wrong.

As you can see, almost no math, just logic, is necessary in order to solve the problem.
ReplyDelete
Replies

Add comment

Blog pages

The two envelopes problem and its solution

61 comments :