[
https://issues.apache.org/jira/browse/MADLIB-934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15089836#comment-15089836
]
Frank McQuillan commented on MADLIB-934:
----------------------------------------
Closing as a "won't fix" since INT4 by design for memory management reasons.
> MADlib LDA should allow users to supply names of input columns
> --------------------------------------------------------------
>
> Key: MADLIB-934
> URL: https://issues.apache.org/jira/browse/MADLIB-934
> Project: Apache MADlib
> Issue Type: Bug
> Components: Module: Parallel Latent Dirichlet Allocation
> Reporter: Srivatsan
> Assignee: Rahul Iyer
> Fix For: v1.9
>
>
> When i attempt to create my own input table for LDA (one that has docid,
> wordid, count) which had 4 columns "docid", "wordid", "count" as well as a
> fourth column "word" (corresponding to the raw token). Of these, the type of
> the "count" column was bigint and not int. I am not sure what prompted the
> lda_train function to throw an error it said the input table did not contain
> docid, wordid and count columns, i did not check to see if it was because of
> the data type mismatch of the count column or if it was due to the additional
> column i had. Can you confirm which one is it?
> If it is just the case that it is a bigint vs int issue, can we allow user to
> supply the names of the docid, wordid and count columns? (instead of
> hard-coding it?).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)