Fórmula para jogar dados (força não bruta)

Antes de mais nada, não tenho certeza de onde essa pergunta deve ser publicada. Estou perguntando se um problema estatístico é NP-Complete e se não é para resolvê-lo programaticamente. Estou postando aqui porque o problema das estatísticas é o ponto central.

Estou tentando encontrar uma fórmula melhor para resolver um problema. O problema é: se eu tenho 4d6 (4 dados comuns de 6 lados) e os rolar todos de uma vez, remova um dado com o número mais baixo (chamado "dropping") e, em seguida, some os 3 restantes, qual é a probabilidade de cada resultado possível ? Eu sei que a resposta é esta:

Sum (Frequency): Probability
3   (1):         0.0007716049
4   (4):         0.0030864198
5   (10):        0.0077160494
6   (21):        0.0162037037
7   (38):        0.0293209877
8   (62):        0.0478395062
9   (91):        0.0702160494
10  (122):       0.0941358025
11  (148):       0.1141975309
12  (167):       0.1288580247
13  (172):       0.1327160494
14  (160):       0.1234567901
15  (131):       0.1010802469
16  (94):        0.0725308642
17  (54):        0.0416666667
18  (21):        0.0162037037

A média é 12,24 e o desvio padrão é 2,847.

Encontrei a resposta acima por força bruta e não sei como ou se existe uma fórmula para isso. Suspeito que esse problema seja NP-Complete e, portanto, só pode ser resolvido com força bruta. Pode ser possível obter todas as probabilidades de 3d6 (3 dados normais de 6 lados) e inclinar cada uma delas para cima. Isso seria mais rápido que a força bruta, porque eu tenho uma fórmula rápida quando todos os dados são mantidos.

Programei a fórmula para manter todos os dados na faculdade. Eu perguntei ao meu professor de estatística e ele encontrou esta página , que ele me explicou. Há uma grande diferença de desempenho entre esta fórmula e a força bruta: 50d6 levou 20 segundos, mas 8d6 eliminou as falhas mais baixas após 40 segundos (o chrome fica sem memória).

Esse problema é NP-Completo? Se sim, forneça uma prova; se não, forneça uma fórmula de força não bruta para resolvê-lo.

Observe que eu não sei muito sobre o NP-Complete, portanto, posso estar pensando em NP, NP-Hard ou outra coisa. A prova da NP-Completeness é inútil para mim, a única razão pela qual peço é impedir as pessoas de adivinharem. E, por favor, fique comigo, pois já faz um longo tempo que não trabalhei nisso: não me lembro de estatísticas e preciso resolver isso.

Idealmente, estou procurando uma fórmula mais genérica para o número X de dados com os lados Y quando N deles forem descartados, mas estou começando com algo muito mais simples.

Editar:

Eu também preferiria a fórmula para emitir frequências, mas é aceitável apenas para probabilidades de saída.

Para os interessados, programei a resposta do whuber em JavaScript no meu GitHub (neste commit, apenas os testes realmente usam as funções definidas).

dice np

— SkySpiral7
fonte

Esta é uma pergunta interessante. Eu acho que deveria estar no tópico aqui. Obrigado pela sua consideração.

— gung - Restabelece Monica

Embora a configuração seja interessante, você ainda não fez uma pergunta respondível: a idéia de completude do NP depende de uma classe de problemas, enquanto você descreveu apenas uma. Exatamente como você deseja generalizar? Embora você sugira que o número de dados pode variar, várias opções adicionais são possíveis e podem gerar respostas diferentes: você pode alterar o número de faces, os valores nas faces, o número de dados e o número de dados descartados, todos de várias maneiras, com vários relacionamentos entre eles.

— whuber

@whuber Ela não conhece nenhuma teoria da complexidade, mas acho claro que está perguntando pela família de problemas gerados pela alteração do número de dados. Eu também acho que tenho um algoritmo eficiente para isso.

— Andy Jones

@ Andy vejo no final que ela está pedindo "uma fórmula mais genérica para o número X de dados com os lados Y quando N deles são descartados".

— whuber

@whuber Hah! Aparentemente, não é tão claro quanto eu pensava. Desculpe, minha culpa.

— Andy Jones

Respostas:

Solução

Seja dados, cada um dando chances iguais aos resultados . Seja o mínimo dos valores quando todos os dados forem lançados independentemente. $n=4$ $1, 2, \ldots, d=6$ $K$ $n$

Considere a distribuição da soma de todos os valores condicionais em . Seja essa soma. A função geradora para o número de maneiras de formar qualquer valor dado de , considerando que o mínimo é pelo menos , é $n$ $K$ $X$ $X$ $k$

\begin{matrix} (1) & f_{(n, d, k)} (x) = x^{k} + x^{k + 1} + \dots + x^{d} = x^{k} \frac{1 - x^{d - k + 1}}{1 - x} . \end{matrix}

$f_{(n,d,k)}(x) = x^k+x^{k+1} + \cdots + x^d = x^k\frac{1-x^{d-k+1}}{1-x}.\tag{1}$

Como os dados são independentes, a função geradora para o número de maneiras de formar valores de onde todos os dados mostram valores de ou mais, é $X$ $n$ $k$

\begin{matrix} (2) & f_{(n, d, k)} (x)^{n} = x^{k n} {(\frac{1 - x^{d - k + 1}}{1 - x})}^{n} . \end{matrix}

$f_{(n,d,k)}(x)^n = x^{kn}\left(\frac{1-x^{d-k+1}}{1-x}\right)^n.\tag{2}$

Essa função geradora inclui termos para os eventos em que excede , portanto, precisamos subtraí-los. Portanto, a função geradora para o número de maneiras de formar valores de , dado , é $K$ $k$ $X$ $K=k$

\begin{matrix} (3) & f_{(n, d, k)} (x)^{n} - f_{(n, d, k + 1)} (x)^{n} . \end{matrix}

$f_{(n,d,k)}(x)^n - f_{(n,d,k+1)}(x)^n.\tag{3}$

Notando que a soma do valores mais altos é a soma de todos os valores menos o menor, igual a . A função geradora, portanto, precisa ser dividida por . Torna-se uma função geradora de probabilidade ao multiplicar pela chance comum de qualquer combinação de dados, : $n-1$ $X-K$ $k$ $(1/d)^n$

\begin{matrix} (4) & d^{- n} \sum_{k = 1}^{d} x^{- k} (f_{(n, d, k)} (x)^{n} - f_{(n, d, k + 1)} (x)^{n}) . \end{matrix}

$d^{-n}\sum_{k=1}^dx^{-k}\left(f_{(n,d,k)}(x)^n - f_{(n,d,k+1)}(x)^n\right).\tag{4}$

Como todos os produtos e potências polinomiais podem ser computados em operações (são convoluções e, portanto, podem ser executadas com a discreta Transformada rápida de Fourier), o esforço computacional total é $O(n\log n)$ . Em particular,é um algoritmo de tempo polinomial. $O(k\,n\log n)$

Exemplo

O trabalho de deixar passar o exemplo em questão, com e . $n=4$ $d=6$

A fórmula para o PGF de condicional em dá $(1)$ $X$ $K\ge k$

\begin{aligned} f_{(4, 6, 1)} (x) & = x + x^{2} + x^{3} + x^{4} + x^{5} + x^{6} \\ f_{(4, 6, 2)} (x) & = x^{2} + x^{3} + x^{4} + x^{5} + x^{6} \\ \dots \\ f_{(4, 6, 5)} (x) & = x^{5} + x^{6} \\ f_{(4, 6, 6)} (x) & = x^{6} \\ f_{(4, 6, 7)} (x) & = 0. \end{aligned}

$\eqalign{ f_{(4,6,1)}(x) &= x+x^2+x^3+x^4+x^5+x^6 \\ f_{(4,6,2)}(x) &= x^2+x^3+x^4+x^5+x^6 \\ \ldots \\ f_{(4,6,5)}(x) &= x^5+x^6 \\ f_{(4,6,6)}(x) &= x^6 \\ f_{(4,6,7)}(x) &= 0. }$

Elevá-los à potência como na fórmula produz $n=4$ $(2)$

\begin{aligned} f_{(4, 6, 1)} (x)^{4} & = x^{4} + 4 x^{5} + 10 x^{6} + \dots + 4 x^{23} + x^{24} \\ f_{(4, 6, 2)} (x)^{4} & = x^{8} + 4 x^{9} + 10 x^{10} + \dots + 4 x^{23} + x^{24} \\ \dots \\ f_{(4, 6, 5)} (x)^{4} & = x^{20} + 4 x^{21} + 6 x^{22} + 4 x^{23} + x^{24} \\ f_{(4, 6, 6)} (x)^{4} & = x^{24} \\ f_{(4, 6, 7)} (x)^{4} & = 0 \end{aligned}

$\eqalign{ f_{(4,6,1)}(x)^4 &= x^4 + 4x^5 + 10 x^6 + \cdots + 4x^{23} + x^{24} \\ f_{(4,6,2)}(x)^4 &= x^8 + 4x^9 + 10x^{10}+ \cdots + 4x^{23} + x^{24} \\ \ldots \\ f_{(4,6,5)}(x)^4 &=x^{20} + 4 x^{21} + 6 x^{22} + 4x^{23} +x^{24}\\ f_{(4,6,6)}(x)^4 &= x^{24}\\ f_{(4,6,7)}(x)^4 &= 0 }$

Suas sucessivas diferenças na fórmula são $(3)$

\begin{aligned} f_{(4, 6, 1)} (x)^{4} - f_{(4, 6, 2)} (x)^{4} & = x^{4} + 4 x^{5} + 10 x^{6} + \dots + 12 x^{18} + 4 x^{19} \\ f_{(4, 6, 2)} (x)^{4} - f_{(4, 6, 3)} (x)^{4} & = x^{8} + 4 x^{9} + 10 x^{10} + \dots + 4 x^{20} \\ \dots \\ f_{(4, 6, 5)} (x)^{4} - f_{(4, 6, 6)} (x)^{4} & = x^{20} + 4 x^{21} + 6 x^{22} + 4 x^{23} \\ f_{(4, 6, 6)} (x)^{4} - f_{(4, 6, 7)} (x)^{4} & = x^{24} . \end{aligned}

$\eqalign{ f_{(4,6,1)}(x)^4 - f_{(4,6,2)}(x)^4 &= x^4 + 4x^5 + 10 x^6 + \cdots + 12 x^{18} + 4x^{19} \\ f_{(4,6,2)}(x)^4 - f_{(4,6,3)}(x)^4 &= x^8 + 4x^9 + 10x^{10} + \cdots + 4 x^{20} \\ \ldots \\ f_{(4,6,5)}(x)^4 - f_{(4,6,6)}(x)^4 &=x^{20} + 4 x^{21} + 6 x^{22} + 4x^{23} \\ f_{(4,6,6)}(x)^4 - f_{(4,6,7)}(x)^4 &= x^{24}. }$

The resulting sum in formula $(4)$ is

6^{- 4} (x^{3} + 4 x^{4} + 10 x^{5} + 21 x^{6} + 38 x^{7} + 62 x^{8} + 91 x^{9} + 122 x^{10} + 148 x^{11} + 167 x^{12} + 172 x^{13} + 160 x^{14} + 131 x^{15} + 94 x^{16} + 54 x^{17} + 21 x^{18}) .

$6^{-4}\left(x^3 + 4x^4 + 10x^5 + 21x^6 + 38x^7 + 62x^8 + 91x^9 + 122x^{10} + 148x^{11} + \\167x^{12} + 172x^{13} + 160x^{14} + 131x^{15} + 94x^{16} + 54x^{17} + 21x^{18}\right).$

For example, the chance that the top three dice sum to $14$ is the coefficient of $x^{14}$ , equal to

6^{- 4} \times 160 = 10 / 81 = 0.123 456 790 123 456 \dots .

$6^{-4}\times 160 = 10/81 = 0.123\,456\,790\,123\,456\,\ldots.$

It is in perfect agreement with the probabilities quoted in the question.

By the way, the mean (as calculated from this result) is $15869/1296 \approx 12.244598765\ldots$ and the standard deviation is $\sqrt{13\,612\,487/1\,679\,616}\approx 2.8468444$ .

A similar (unoptimized) calculation for $n=400$ dice instead of $n=4$ took less than a half a second, supporting the contention that this is not a computationally demanding algorithm. Here is a plot of the main part of the distribution:

Since the minimum $K$ is highly likely to equal $1$ and the sum $X$ will be extremely close to having a Normal $(400\times 7/2, 400\times 35/12)$ distribution (whose mean is $1400$ and standard deviation is approximately $34.1565$ ), the mean must be extremely close to $1400-1=1399$ and the standard deviation extremely close to $34.16$ . This nicely describes the plot, indicating it is likely correct. In fact, the exact calculation gives a mean of around $2.13\times 10^{-32}$ greater than $1399$ and a standard deviation around $1.24\times 10^{-31}$ less than $\sqrt{400\times 35/12}$ .

— whuber
fonte

Your answer is fast and is correct so I've marked it as the answer. Also in an edit I said it would also be nice to have frequencies if possible. For that you don't need to edit your answer since I can see that the 6^-4 multiplier is used to convert from frequency to probability.

— SkySpiral7

Edit: @SkySpiral has had trouble getting the below formula to work. I currently don't have time to work out what the issue is, so if you're reading this it's best to proceed under the assumption it's incorrect.

I'm not sure about the general problem with varying numbers of dice, sides, and drops, but I think I can see an efficient algorithm for the drop-1 case. The qualifier is that I'm not completely sure that it's correct, but right now I can't see any flaws.

Let's start by not dropping any dice. Suppose $X_n$ represents the $n$ th die, and suppose $Y_n$ represents the sum of $n$ dice. Then

p (Y_{n} = a) = \sum_{k} p (Y_{n - 1} = a - k) p (X_{n} = k)

$p(Y_n = a) = \sum_k p(Y_{n-1} = a - k)p(X_n=k)$

Now suppose $Z_n$ is the sum of $n$ dice when one die is dropped. Then

p (Z_{n} = a) = p (n th die is the smallest) p (Y_{n - 1} = a) + p (n th die is not the smallest) \sum_{k} p (Z_{n - 1} = a - k) p (X_{n} = k)

$p(Z_n = a) = p(\text{$n$th die is the smallest})p(Y_{n-1} = a) + \\ p(\text{$n$th die is not the smallest})\sum_k p(Z_{n-1} = a - k)p(X_n=k)$

If we define $M_n$ to be distribution of the minimum of $n$ dies, then

p (Z_{n} = a) = p (X_{n} \leq M_{n - 1}) p (Y_{n - 1} = a | X_{n} \leq M_{n - 1}) + p (X_{n} > M_{n - 1}) \sum_{k} p (Z_{n - 1} = a - k) p (X_{n} = k | X_{n} > M_{n - 1})

$p(Z_n = a) = p(X_n \leq M_{n-1})p(Y_{n-1} = a | X_n \leq M_{n-1}) + \\ p(X_n > M_{n-1})\sum_k p(Z_{n-1} = a - k)p(X_n=k | X_n > M_{n-1})$

and we can calculate $M_n$ using

p (M_{n} = a) = p (X_{n} \leq M_{n - 1}) p (X_{n} = a | X_{n} \leq M_{n - 1}) + p (X_{n} > M_{n - 1}) p (M_{n - 1} = a | X_{n} > M_{n - 1})

$p(M_n = a) = p(X_n \leq M_{n-1})p(X_n = a |X_n \leq M_{n-1}) + p(X_n > M_{n-1})p(M_{n-1} = a|X_n > M_{n-1})$

Anyway, together this all suggests a dynamic programming algorithm based on $Y_n, Z_n$ and $M_n$ . Should be quadratic in $n$ .

edit: A comment has been raised on how to calculate $p(X_n \leq M_{n-1})$ . Since $X_n, M_{n-1}$ can each only take on one of six values, we can just sum over all possibilities:

p (X_{n} \leq M_{n - 1}) = \sum_{a, b} p (X_{n} = a, M_{n - 1} = b, a \leq b)

$p(X_n \leq M_{n-1}) = \sum_{a,b} p(X_n = a, M_{n-1} = b, a \leq b)$

Similarly, $p(X_n = k | X_n > M_{n-1})$ can be calculated by applying Bayes rule then summing over the possible values of $X_n, M_{n-1}$ .

— Andy Jones
fonte

+1 This looks correct and you said that's it's quadratic. But it's been a few years since I took statistics (I'm primarily a programmer). So I'd like to fully understand this before marking it as the answer. Also I see you have p(nth is the smallest die) does this include if nth is tied with the smallest? Such as rolling all 3s.

— SkySpiral7

Good catch. If the

n

$n$ th die rolled is the same as the current minimum, we can regard that die as the one to be dropped. In which case the distribution is

Y_{n - 1}

$Y_{n-1}$ . I've swapped some

(<)

$(<)$ s for

(\leq)

$(\leq)$ s to reflect this.

— Andy Jones

Thank you. If I understand this correctly I think your formulas are the answer. However I don't know how to calculate p(X(n) > M(n-1)) (or the negation of it) or p(X(n)=k|X(n) > M(n-1)) so I can't use this answer yet. I'll mark this as the answer but I'd like more information. Can you edit your answer to explain these or should I post it as another question?

— SkySpiral7

Edited my answer.

— Andy Jones

Sorry I know it's been a year and a half but I've finally gotten around to implementing this formula into code. However the p(Z(n)=a) formula appears incorrect. Suppose 2 dice with 2 sides (drop lowest), what are the chances of the result being 1? The chance of X(n) being the smallest or tied is 3/4 and p(Y(n-1)=1) is 1/2 so that Z(n) returns at least 3/8 even though the correct answer is 1/4. The Z formula looks correct to me and I don't know how to fix it. So if it's not too much to ask: what do you think?

— SkySpiral7

I have a reasonably efficient algorithm for this that, on testing, seems to match results of pure brute force while relying less heavily on enumerating all possibilities. It's actually more generalized than the above problem of 4d6, drop 1.

Some notation first: Let $X_NdY$ indicate that you are rolling $X$ dice with $Y$ faces (integer values $1$ to $Y$ ), and considering only the highest $N$ dice rolled. The output is a sequence of dice values, e.g. $4_3d6$ yields $3, 4, 5$ if you rolled $1, 3, 4, 5$ on the four dice. (Note that I'm calling it a "sequence," but the order is not important here, particularly since all we care about in the end is the sum of the sequence.)

The probability $P(X_NdY = S)$ (or more specifically, $P(4_3d6 = S)$ ) is a simplified version of the original problem, where we are only considering a specific set of dice, and not all possible sets that add up to a given sum.

Suppose $S$ has $k$ distinct values, $s_0, s_1, ..., s_k$ , such that $s_i > s_{i+1}$ , and each $s_i$ has a count of $c_i$ . For example, if $S = 3, 4, 4, 5$ , then $(s_0,c_0) = (5,1)$ , $(s_1,c_1) = (4,2)$ , and $(s_2,c_2) = (3,1)$ .

You can calculate $P(X_NdY = S)$ in the following way:

P (X_{N} d Y = S) = \frac{(\prod_{i = 0}^{k - 1} (\binom{X - \sum_{h = 0}^{i - 1} c_{h}}{c_{i}})) (\sum_{j = 0}^{X - N} (\binom{c_{k} + X - N}{c_{k} + X - N - j}) (s_{k} - 1)^{j})}{Y^{X}}

$P(X_NdY = S) = \frac{ \left( \prod_{i=0}^{k-1} {X - \sum_{h=0}^{i-1} c_h \choose c_i} \right) \left( \sum_{j=0}^{X-N} { c_k+X-N \choose c_k+X-N-j} (s_k-1)^j \right)}{ Y^X }$

That's pretty messy, I know.

The product expression $\prod_{i=0}^{k-1}$ is iterating through all but the lowest of the values in $S$ , and calculating all the ways those values may be distributed among the dice. For $s_0$ , that's just $X \choose c_i$ , but for $s_1$ , we have to remove the $c_0$ dice that have already been set aside for $s_0$ , and likewise for $s_i$ you must remove $\sum_{h=0}^{i-1}c_h$ .

The sum expression $\sum_{j=0}^{X-N}$ is iterating through all the possibilities of how many of the dropped dice were equal to $s_k$ , since that affects the possible combinations for the un-dropped dice with $s_k$ as their value.

By example, let's consider $P[4_3d6=(5,4,4)]$ :

(s_{1}, c_{1}) = (5, 1)

$(s_1, c_1) = (5, 1)$

(s_{2}, c_{2}) = (4, 2)

$(s_2, c_2) = (4, 2)$

So using the formula above:

P [4_{3} d 6 = (5, 4, 4)] = \frac{(\binom{4}{1}) ((\binom{3}{3}) \cdot 3^{0} + (\binom{3}{2}) \cdot 3^{1})}{6^{4}} = \frac{5}{162} = 0.0 \bar{308641975}

$P[4_3d6=(5,4,4)] \\ = \frac{ {4 \choose 1} \left( {3 \choose 3} \cdot 3^0 + {3 \choose 2} \cdot 3^1 \right) }{ 6^4 } \\ = \frac{5}{162} = 0.0\overline{308641975}$

The formula breaks down on a domain issue when $s_k=1$ and $j=0$ in the summation, leading to a first term of $0^0$ , which is indeterminate and needs to be treated as $1$ . In such a case, a summation is not actually necessary at all, and can be omitted, since all the dropped dice will also have a value of $s_k = 1$ .

Now here's where I do need to rely on some brute force. The original problem was to calculate the probability of the sum being some value, and $X_NdY$ represents the individual dice left after dropping. This means you must add up the probabilities for all possible sequences $S$ (ignoring ordering) whose sum is the given value. Perhaps there is a formula to calculate this across all such values of $S$ at once, but I haven't even tried broaching that yet.

I've implemented this in Python first, and the above is an attempt to express it mathematically. My Python algorithm is accurate and reasonably efficient. There are some optimizations that could be made for the case of calculating the entire distribution of $\sum X_NdY$ , and maybe I'll do that later.

— Riley John Gibbs
fonte

As a programmer it might be easier for me to understand your Python code (although I've never used Python so it might be the same). Posting the code here is off topic but you could post a link to github etc.

— SkySpiral7

Your answer may be correct and it seems to reduce the complexity from O(Y^X) to O((Y+X-1)!/(X!*(Y-1)!)) but it still isn't as efficient as whuber's answer of O(c*X*log(X)). Thanks for your answer though +1.

— SkySpiral7