I am trying to extract text from various files using tikaOnDotnet . I have 
referred to the link 

http://www.dovetailsoftware.com/blogs/kmiller/archive/2010/07/02/using-the-
tika-java-library-in-your-net-application-with-ikvm

for guidance. I am storing the parsed content of a file "pdffile.pdf" in 
another file "test123.txt" .This is the code : 

 
public void TestMethod1()
        {
            TextExtractor _cut = new TextExtractor();
            var textExtractionResult = _cut.Extract(@"D:\Tika.pdf");
                      
            System.IO.StreamWriter file = new 
System.IO.StreamWriter("c:\\test123.txt");  
            
            file.WriteLine(textExtractionResult);
            file.Close();
        }

But the problem is that I am not getting the text of the file and the metadata  
in the test123.txt file . Only the Content-Length, Content-Type and 
resourcename are shown whereas the text and metadat are missing.
Plz help me rectify the problem

Reply via email to