yes, but if I use TikaEntityProcessor the result of my full-import is

<str name="Total Requests made to DataSource">0</str>
 <str name="Total Rows Fetched">1</str>

<str name="Total Documents Skipped">0</str>

<str name="">Indexing failed. Rolled back all changes.</str>




2012/2/16 alessio crisantemi <alessio.crisant...@gmail.com>

> Hi all,
> I have a problem to configure a pdf indexing from a directory in my solr
> wit DIH:
>
> with this data-config
>
>
> <dataConfig>
>  <dataSource type="BinFileDataSource" />
>  <document>
>   <entity
>     name="tika-test"
>     processor="FileListEntityProcessor"
>     baseDir="D:\gioconews_archivio\marzo2011"
>     fileName=".*pdf"
>     recursive="true"
>     rootEntity="false"
>     dataSource="null"/>
>   <entity processor="FileListEntityProcessor"
> url="D:\gioconews_archivio\marzo2011" format="text" >
>    <field column="author"  name="author" meta="true"/>
>    <field column="title" name="title" meta="true"/>
>      <field column="description" name="description" />
>      <field column="comments" name="comments" />
>
>      <field column="content_type" name="content_type" />
>      <field column="last_modified" name="last_modified" />
>   </entity>
>  </document>
> </dataConfig>
>
> I obtain this result:
>
>
>
>   <str name="command">full-import</str>
>
>   <str name="status">idle</str>
>
>   <str name="importResponse" />
>
> - <lst name="statusMessages">
>
>   <str name="Time Elapsed">0:0:2.44</str>
>
>   <str name="Total Requests made to DataSource">0</str>
>
>   <str name="Total Rows Fetched">43</str>
>
>   <str name="Total Documents Skipped">0</str>
>
>   <str name="Full Dump Started">2012-02-12 19:06:00</str>
>
>   <str name="">Indexing failed. Rolled back all changes.</str>
>
>   <str name="Rolledback">2012-02-12 19:06:00</str>
>   </lst>
>
>
> suggestions?
> thank you
> alessio
>

Reply via email to