HI again,

 

I got Nutch 2.2.1 configured with MySQL and running nicely individual
commands but whe running the Crawl script it crashes. I seems that it tries
to create a new schema in Mysql and it fails, it seems it would be an easy
fix, where does Nutch get the MySql schema info from? gora-sql-mapping.xml?
If so whats the proper syntax to specify Column type?

 

Thnx

 

bin/crawl urls TEST http://127.0.0.1:8888/solr/collection1/ 3

 

2013-09-25 16:58:23,632 INFO  crawl.InjectorJob - InjectorJob: starting at
2013-09-25 16:58:23

2013-09-25 16:58:23,633 INFO  crawl.InjectorJob - InjectorJob: Injecting
urlDir: urls

2013-09-25 16:58:25,526 INFO  store.SqlStore - creating schema: TEST_webpage

2013-09-25 16:58:25,632 ERROR crawl.InjectorJob - InjectorJob:
org.apache.gora.util.GoraException: java.io.IOException:
com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: Column length too
big for column 'text' (max = 16383); use BLOB or TEXT instead

            at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java
:167)

            at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java
:135)

            at
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)

            at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)

            at
org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)

            at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)

            at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

            at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

Caused by: java.io.IOException:
com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: Column length too
big for column 'text' (max = 16383); use BLOB or TEXT instead

            at
org.apache.gora.sql.store.SqlStore.createSchema(SqlStore.java:226)

            at
org.apache.gora.sql.store.SqlStore.initialize(SqlStore.java:172)

            at
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.
java:102)

            at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java
:161)

            ... 7 more

Caused by: com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: Column
length too big for column 'text' (max = 16383); use BLOB or TEXT instead

            at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
Method)

            at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAcces
sorImpl.java:57)

            at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstruc
torAccessorImpl.java:45)

            at
java.lang.reflect.Constructor.newInstance(Constructor.java:526)

            at com.mysql.jdbc.Util.handleNewInstance(Util.java:411)

            at com.mysql.jdbc.Util.getInstance(Util.java:386)

            at
com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1052)

            at com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)

            at com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3541)

            at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:2002)

            at com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2163)

            at
com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)

            at
com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127
)

            at
com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2427)

            at
com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2345)

            at
com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2330)

            at
org.apache.gora.sql.store.SqlStore.createSchema(SqlStore.java:224)

            ... 10 more

Reply via email to