[ 
https://issues.apache.org/jira/browse/GORA-24?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13546402#comment-13546402
 ] 

Lewis John McGibbney commented on GORA-24:
------------------------------------------

As the gora-sql module is now deprecated (due to licensing issues).
Please correct but my outlook on this one is as follows
- write support for MEDIUMBLOB into new gora-sql module
- accompany this with better error handling/message logging and additionally 
some additional guidance in the gora-sql-mapping.xml file

There is little we can do about this in Gora until the gora-sql module is 
written, therefore any problems which are experienced using gora-sql with Nutch 
2.x (or any other client applications for that matter) will need to be 
addressed at that level not within Gora.
                
> Throwing EOFException with MEDIUMBLOB type for inlinks column
> -------------------------------------------------------------
>
>                 Key: GORA-24
>                 URL: https://issues.apache.org/jira/browse/GORA-24
>             Project: Apache Gora
>          Issue Type: Bug
>          Components: storage-sql
>         Environment: MySQL
>            Reporter: Alexis
>             Fix For: 0.4
>
>
> I had an exception with DbUpdaterJob complaining that inlinks column of type 
> BLOB in webpage table was not big enough to store all the incoming links. So 
> I changed the column definition in gora-sql-mapping.xml from BLOB to 
> MEDIUMBLOB:
>     <field name="inlinks" column="inlinks" jdbc-type="MEDIUMBLOB"/>
> Now I systematically get an exception in the update step:
> java.io.IOException: java.sql.BatchUpdateException: Error reading from 
> InputStream java.io.EOFException
>       at org.apache.gora.sql.store.SqlStore.flush(SqlStore.java:341)
>       at org.apache.gora.sql.store.SqlStore.close(SqlStore.java:185)
>       at 
> org.apache.gora.mapreduce.GoraRecordWriter.close(GoraRecordWriter.java:55)
>       at 
> org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:567)
>       at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:408)
>       at 
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
> Caused by: java.sql.BatchUpdateException: Error reading from InputStream 
> java.io.EOFException
>       at 
> com.mysql.jdbc.PreparedStatement.executeBatchSerially(PreparedStatement.java:2020)
>       at 
> com.mysql.jdbc.PreparedStatement.executeBatch(PreparedStatement.java:1451)
>       at org.apache.gora.sql.store.SqlStore.flush(SqlStore.java:329)
>       ... 5 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to