I do this exact thing. "text" (the second input to the Field constructor) is MSWord text that I've extracted from the Word document
textField = new org.apache.lucene.document.Field(textFieldName,text, org.apache.lucene.document.Field.Store.NO, org.apache.lucene.document.Field.Index.TOKENIZED); doc.add(textField); Donna L. Gresh Services Research, Mathematical Sciences Department IBM T.J. Watson Research Center (914) 945-2472 http://www.research.ibm.com/people/g/donnagresh [EMAIL PROTECTED]