Github user srowen commented on a diff in the pull request:
https://github.com/apache/spark/pull/22090#discussion_r211133578
--- Diff: docs/mllib-evaluation-metrics.md ---
@@ -462,13 +462,13 @@ $$rel_D(r) = \begin{cases}1 & \text{if $r \in D$}, \\
0 & \text{otherwise}.\end{
<td>Normalized Discounted Cumulative Gain</td>
<td>
$NDCG(k)=\frac{1}{M} \sum_{i=0}^{M-1} {\frac{1}{IDCG(D_i,
k)}\sum_{j=0}^{n-1}
- \frac{rel_{D_i}(R_i(j))}{\text{ln}(j+1)}} \\
+ \frac{rel_{D_i}(R_i(j))}{\text{ln}(j+2)}} \\
\text{Where} \\
\hspace{5 mm} n =
\text{min}\left(\text{max}\left(|R_i|,|D_i|\right),k\right) \\
- \hspace{5 mm} IDCG(D, k) = \sum_{j=0}^{\text{min}(\left|D\right|,
k) - 1} \frac{1}{\text{ln}(j+1)}$
+ \hspace{5 mm} IDCG(D, k) = \sum_{j=0}^{\text{min}(\left|D\right|,
k)} \frac{1}{\text{ln}(j+2)}$
--- End diff --
@yueguoguo I think the "- 1" in the upper bounds of the sum needs to be
restored here?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]