Followup:
If we want to get at the document vectors after training, I think that, from reading the code here: https://github.com/piskvorky/gensim/blob/develop/gensim/models/doc2vec.py#L254 that you want to get at the model's 'docvecs' attribute. We know it's a DocvecArray because it is assigned here in the model. https://github.com/piskvorky/gensim/blob/develop/gensim/models/doc2vec.py#L569 Given that, we should be able to just print out the first vector in the trained model like this: print(model.docvecs[0]) More generally, we should be able to do something like: for index in range(len(model.docvecs)): print(model.docvecs[index]) to get at the vectors for all the trained documents. That being said, I have not executed any of this code on my machine. I'm only going by reading, so I might be misinterpreting something. Hence the suggestion to talk to folks who have actually used the library. :P _______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: https://mail.python.org/mailman/listinfo/tutor