Probability

alicanb · June 13, 2020, 7:15pm

Couple things:

You can import Multinomial directly from torch.distributions. ie. from torch.distributions import Multinomial

distribution.sample() takes a sample_size argument. So instead of sampling from numpy and converting into pytorch you can simply say Multinomial(10, fair_probs).sample((3,)) (sample_shape needs to be tuple).

anirudh · June 14, 2020, 3:12am

Thanks @alicanb. We have addressed your suggestions and updated the section in this commit

Emanuel_Afanador · June 16, 2020, 10:00pm

Hello, Preformatted text I have a question about question 3 (Markov Chain), I’m not sure about my answer:

P(A,B,C) = P(C|B,C)P(B,C) = P(C|B,A)P(B|A)P(A)

as A,B,C states have Markov chain property, P(C|B,A) = P(C|B)

P(A,B,C) = P(C|B)P(B|A)P(A)

thanks in advance

goldpiggy · June 18, 2020, 3:04am

Hi @Emanuel_Afanador, since 𝐵 only depends on 𝐴, and 𝐶 only depends on 𝐵, then

$P(A, B, C) = P(C | A, B) * P(A, B) = P(C | A, B) * [P(B | A) * P(A)] $ .

JohnG · June 23, 2020, 10:34pm

Wonder anyone has encountered the same problem as me related to the code above. In version 0.7 of Dive into Deep Learning, the code works as shown above, with all the probabilities converging to the expected value of 1/6. However, with code in version 0.8.0 of the same book, the curves (see the image on the right) do not look right. Both curves were obtained by running the code from the book(s) without any changes and ran on the same PC. So there might be bugs in version 0.8.0 of the book? Thanks!

StevenJokes · June 24, 2020, 12:18pm

Maybe it is just a coincidence that almost 90 groups of experiments is “die = 6”？
It would be more clear if you counts / 1000 # Relative frequency as the estimate.

ness001 · June 24, 2020, 1:55pm

In L2/5 Naive Bayes, in terms of Nvidia Turing GPUs, why Alex said adding more silicons is almost free for Nvidia?

goldpiggy · June 26, 2020, 3:18pm

Hi @ness001, great question! Check here for more details about GPUs 13.4. Hardware — Dive into Deep Learning 1.0.3 documentation

alaa-shubbak · October 23, 2020, 12:19pm

for question #3 can we calculate it like this :
P(A,B,C) = P(A/B,C) * P(B,C) and as B not depend on c
P(A,B,C) = P(A/B,C)*P(B)*P©
is it correct like this or not ? if not could you please explain why?
thanks in davaned

goldpiggy · October 26, 2020, 8:49pm

Hey @alaa-shubbak, that’s correct!

Aaron_L · November 22, 2020, 4:50am

For Q4:
If we do the test 1 twice, the two tests won’t be independent, since they are using the same method on the same patient. In fact, we will get the same result very possibly.

zhenling · August 18, 2021, 9:29am

For Q3
P(ABC)=P(C|AB)P(AB)=P(C|B)P(B|A)P(A)
is it right? is it the simplest answer for Q3?

zgpeace · March 15, 2022, 12:56am

install fail

!pip install d2l==0.17.4

HyunA_Kim · March 15, 2022, 4:40am

Can you try !pip install d2l, I succeeded, and where did you get this 0.17.4 version?

zgpeace · March 16, 2022, 1:28am

It does work. I use pytorch in colab. Thank you so much.

Abhishek_Verma · May 1, 2022, 6:34pm

In section 2.6.2.6
P(D1=1,D2=1) = P(D1=1,D2=1|H=0) * P(H=0) + P(D1=1,D2=1|H=1) * P(H=1)

Is this equivalent to (since D1 and D2 are independent)
P(D1=1,D2=1) = P(D1=1) * P(D2=1) ?
P(D1=1) has been calculated in equation 2.6.3 and P(D2=1) can be calculated similarly.

I am having a hard time proving this. Am I missing something?

Abhishek_Verma · May 2, 2022, 5:39am

“…by assuming the conditional independence”
my bad.

Tianrui_Zhang · May 24, 2022, 4:29pm

Maybe there’s a typo in 2.6.7 which should be 0.00176655 and I have 0.8321304237 in 2.6.8. Correct?

MrBean · June 29, 2022, 8:40am

For the last question
If we assume the test result is deterministic, then

P(D2=1|D1=1) = 1
P(D2=0|D1=0) = 1

Doing first experiment twice does not add additional information. Therefore, P(H=1|D1=1,D2=1) == P(H=1|D1=1). You can derive the equation by doing some arithmetic.

timengler · August 20, 2022, 8:16am

I don’t understand equation 2.6.3 . On the right side, why wouldn’t P(A) on the top cancel out with P(A) on the bottom, and since the other term on the bottom right which is the sum of all b in B for P(B|A) equals 1, wouldn’t that mean it would then just simplify to P(A|B) = P(B|A) which is obviously incorrect?