[
https://issues.apache.org/jira/browse/NUTCH-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2124:
---
Priority: Blocker (was: Major)
> redirect following same link again and again , max redirect
[
https://issues.apache.org/jira/browse/NUTCH-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14899934#comment-14899934
]
Sebastian Nagel commented on NUTCH-2110:
Hi Asitang, the Injector is already able to store
[
https://issues.apache.org/jira/browse/NUTCH-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903524#comment-14903524
]
Sebastian Nagel commented on NUTCH-2110:
Ok, understood. One point to consider: shall all
[
https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14847281#comment-14847281
]
Sebastian Nagel commented on NUTCH-2106:
Avoiding conflicting dependencies is the reason for the
[
https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2106:
--
Assignee: Sebastian Nagel
> Runtime to contain Selenium and dependencies only once
>
[
https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2106.
Resolution: Fixed
Committed to trunk, r1704425. Thanks, Lewis!
> Runtime to contain
[
https://issues.apache.org/jira/browse/NUTCH-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943609#comment-14943609
]
Sebastian Nagel commented on NUTCH-2124:
I've tested the patch with the mentioned URL as only seed
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943637#comment-14943637
]
Sebastian Nagel commented on NUTCH-2132:
No question, this is a significant improvement over
[
https://issues.apache.org/jira/browse/NUTCH-2179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15034511#comment-15034511
]
Sebastian Nagel commented on NUTCH-2179:
+1: SolrIndexWriter should queue the deletions the same
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2172:
---
Attachment: NUTCH-2172-1.patch
Patch to add a template for conf/contenttype-mapping.txt
[
https://issues.apache.org/jira/browse/NUTCH-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2107:
--
Assignee: Sebastian Nagel
> plugin.xml to validate against plugin.dtd
>
[
https://issues.apache.org/jira/browse/NUTCH-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2107.
Resolution: Fixed
Fix Version/s: (was: 1.12)
(was: 2.4)
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2172:
---
Component/s: indexer
> index-more: document format of contenttype-mapping.txt
>
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2172:
--
Assignee: Sebastian Nagel
> Parsing whitespace not just tabs in
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2172:
---
Fix Version/s: 1.12
> Parsing whitespace not just tabs in contenttype-mapping.txt
>
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2172:
---
Issue Type: Improvement (was: Bug)
> Parsing whitespace not just tabs in
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2172:
---
Summary: index-more: document format of contenttype-mapping.txt (was:
Parsing whitespace not
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2172.
Resolution: Fixed
Committed to trunk, r1718223.
Thanks, [~nicola.tonellotto]! Although
[
https://issues.apache.org/jira/browse/NUTCH-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15047481#comment-15047481
]
Sebastian Nagel commented on NUTCH-2076:
After a second look: the problem is the return statement
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2172:
---
Attachment: NUTCH-2172-2.patch
It is about MIME types which are already normalized either by
[
https://issues.apache.org/jira/browse/NUTCH-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15034352#comment-15034352
]
Sebastian Nagel commented on NUTCH-2172:
This could be an improvement if we assume that MIME types
[
https://issues.apache.org/jira/browse/NUTCH-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2193:
---
Attachment: NUTCH-2193.patch
> Upgrade feed parser plugin to use rome 1.5
>
Sebastian Nagel created NUTCH-2193:
--
Summary: Upgrade feed parser plugin to use rome 1.5
Key: NUTCH-2193
URL: https://issues.apache.org/jira/browse/NUTCH-2193
Project: Nutch
Issue Type:
[
https://issues.apache.org/jira/browse/NUTCH-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15085327#comment-15085327
]
Sebastian Nagel commented on NUTCH-2143:
Excellent! Please, attach a
[
https://issues.apache.org/jira/browse/NUTCH-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15083285#comment-15083285
]
Sebastian Nagel commented on NUTCH-2168:
Hi [~kalanya], looks like the indexed raw content of the
[
https://issues.apache.org/jira/browse/NUTCH-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15083603#comment-15083603
]
Sebastian Nagel commented on NUTCH-2191:
As [~haraldk] mentioned in [this
[
https://issues.apache.org/jira/browse/NUTCH-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2143:
---
Attachment: NUTCH-2143-v3.patch
Ok, with the patch applied the unit testFetch() fails because
[
https://issues.apache.org/jira/browse/NUTCH-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2168.
Resolution: Fixed
Committed to 2.x, r1723851. Opened NUTCH-2198 to track the problem when
Sebastian Nagel created NUTCH-2198:
--
Summary: Indexing binary content by index-html causes Solr
Exception
Key: NUTCH-2198
URL: https://issues.apache.org/jira/browse/NUTCH-2198
Project: Nutch
[
https://issues.apache.org/jira/browse/NUTCH-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15090625#comment-15090625
]
Sebastian Nagel commented on NUTCH-2198:
Tried to reproduce the Solr exception by indexing on of
[
https://issues.apache.org/jira/browse/NUTCH-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2198:
---
Description:
(reported by [~kalanya] in NUTCH-2168)
If raw binary is indexed using the plugin
[
https://issues.apache.org/jira/browse/NUTCH-2169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2169.
Resolution: Fixed
Assignee: Sebastian Nagel
Committed to 2.x, r1723794.
> Integrate
[
https://issues.apache.org/jira/browse/NUTCH-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2143.
Resolution: Fixed
Committed to 2.x, r1723626. Thanks!
> GeneratorJob ignores batch id
[
https://issues.apache.org/jira/browse/NUTCH-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15068723#comment-15068723
]
Sebastian Nagel commented on NUTCH-2189:
+1 makes the urlfilter-domain more robust, patch looks
[
https://issues.apache.org/jira/browse/NUTCH-2065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15068757#comment-15068757
]
Sebastian Nagel commented on NUTCH-2065:
* in general: wouldn't a URL normalizer be preferable? If
[
https://issues.apache.org/jira/browse/NUTCH-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071661#comment-15071661
]
Sebastian Nagel commented on NUTCH-2189:
Yes, you're right!
> Domain filter must deactivate if no
[
https://issues.apache.org/jira/browse/NUTCH-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071662#comment-15071662
]
Sebastian Nagel commented on NUTCH-2189:
Yes, you're right!
> Domain filter must deactivate if no
[
https://issues.apache.org/jira/browse/NUTCH-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15023131#comment-15023131
]
Sebastian Nagel edited comment on NUTCH-2158 at 11/26/15 7:28 AM:
--
Patch
[
https://issues.apache.org/jira/browse/NUTCH-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2175:
--
Assignee: Sebastian Nagel
> Typos in property descriptions in nutch-default.xml
>
[
https://issues.apache.org/jira/browse/NUTCH-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15032108#comment-15032108
]
Sebastian Nagel commented on NUTCH-2177:
Rely on {{mapred.job.tracker}}, cf.
[
https://issues.apache.org/jira/browse/NUTCH-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15033595#comment-15033595
]
Sebastian Nagel commented on NUTCH-2177:
Yes, of course, I was just unable to copy-paste the right
[
https://issues.apache.org/jira/browse/NUTCH-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2158.
Resolution: Fixed
Thanks! Committed to trunk, r1716573.
> Upgrade to Tika 1.11
>
[
https://issues.apache.org/jira/browse/NUTCH-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15023123#comment-15023123
]
Sebastian Nagel commented on NUTCH-2158:
We need to the pass the rendered HTML, returned by the
[
https://issues.apache.org/jira/browse/NUTCH-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2158:
---
Attachment: NUTCH-2158-test-protocol-http.patch
Patch to adjust tests of protocol-http:
-
[
https://issues.apache.org/jira/browse/NUTCH-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2175:
---
Issue Type: Improvement (was: Bug)
> Typos in property descriptions in nutch-default.xml
>
[
https://issues.apache.org/jira/browse/NUTCH-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2175.
Resolution: Fixed
And a spell checker detected some more obvious misspellings...
Committed
[
https://issues.apache.org/jira/browse/NUTCH-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2175:
---
Summary: Typos in property descriptions in nutch-default.xml (was:
Misspelling at word
[
https://issues.apache.org/jira/browse/NUTCH-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-1712 started by Sebastian Nagel.
--
> Use MultipleInputs in Injector to make it a single mapreduce job
>
[
https://issues.apache.org/jira/browse/NUTCH-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15092924#comment-15092924
]
Sebastian Nagel commented on NUTCH-1712:
The merging is done together with minor improvements
[
https://issues.apache.org/jira/browse/NUTCH-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15331434#comment-15331434
]
Sebastian Nagel commented on NUTCH-2272:
Not included in [1.12 release
[
https://issues.apache.org/jira/browse/NUTCH-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328204#comment-15328204
]
Sebastian Nagel commented on NUTCH-827:
---
Hi [~stevegy], would you mind to open a new Jira for this
Sebastian Nagel created NUTCH-2281:
--
Summary: Support non-default FileSystem
Key: NUTCH-2281
URL: https://issues.apache.org/jira/browse/NUTCH-2281
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-2281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341680#comment-15341680
]
Sebastian Nagel commented on NUTCH-2281:
I tried to fix all tools but haven't tested all of them
Sebastian Nagel created NUTCH-2286:
--
Summary: CrawlDbReader -stats fetch time and interval
Key: NUTCH-2286
URL: https://issues.apache.org/jira/browse/NUTCH-2286
Project: Nutch
Issue Type:
[
https://issues.apache.org/jira/browse/NUTCH-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2272:
---
Fix Version/s: (was: 1.12)
1.13
> Index checker server to optionally
[
https://issues.apache.org/jira/browse/NUTCH-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2286:
---
Summary: CrawlDbReader -stats to show fetch time and interval (was:
CrawlDbReader -stats
[
https://issues.apache.org/jira/browse/NUTCH-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346585#comment-15346585
]
Sebastian Nagel commented on NUTCH-2272:
Not included in released 1.12: removed from CHANGES.txt,
[
https://issues.apache.org/jira/browse/NUTCH-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15351824#comment-15351824
]
Sebastian Nagel commented on NUTCH-2269:
Thanks for reporting the problems. Afaics, they can be
[
https://issues.apache.org/jira/browse/NUTCH-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2269:
---
Comment: was deleted
(was: The message
{noformat}
WARN output.FileOutputCommitter - Output
[
https://issues.apache.org/jira/browse/NUTCH-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2269:
---
Comment: was deleted
(was: The message
{noformat}
WARN output.FileOutputCommitter - Output
[
https://issues.apache.org/jira/browse/NUTCH-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2269:
---
Comment: was deleted
(was: The message
{noformat}
WARN output.FileOutputCommitter - Output
[
https://issues.apache.org/jira/browse/NUTCH-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1314:
---
Fix Version/s: 1.12
> Impose a limit on the length of outlink target urls
>
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157655#comment-15157655
]
Sebastian Nagel commented on NUTCH-2228:
The name of the failing test "testInvalidPatterns"
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2228:
---
Attachment: NUTCH-2228.patch
> index-replace unit test fails
> -
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157655#comment-15157655
]
Sebastian Nagel edited comment on NUTCH-2228 at 2/22/16 8:38 PM:
-
The name
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2228:
---
Patch Info: Patch Available
> index-replace unit test fails
> -
>
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157632#comment-15157632
]
Sebastian Nagel commented on NUTCH-2228:
That's only a problem if Nutch is built with Java 8.
[
https://issues.apache.org/jira/browse/NUTCH-2220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157831#comment-15157831
]
Sebastian Nagel commented on NUTCH-2220:
0 / +1
Since this breaks existing crawl configurations: a
[
https://issues.apache.org/jira/browse/NUTCH-2221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157816#comment-15157816
]
Sebastian Nagel commented on NUTCH-2221:
+1
Just to consider: the additional argument to
[
https://issues.apache.org/jira/browse/NUTCH-2216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1515#comment-1515
]
Sebastian Nagel commented on NUTCH-2216:
* this was the case before, but shouldn't
[
https://issues.apache.org/jira/browse/NUTCH-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1712.
Resolution: Fixed
Fix Version/s: 1.12
Committed to trunk (f5e430e).
> Use
[
https://issues.apache.org/jira/browse/NUTCH-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2204:
---
Attachment: NUTCH-2204.patch
> remove junit lib from runtime
> -
Sebastian Nagel created NUTCH-2204:
--
Summary: remove junit lib from runtime
Key: NUTCH-2204
URL: https://issues.apache.org/jira/browse/NUTCH-2204
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2204:
---
Summary: Remove junit lib from runtime (was: remove junit lib from runtime)
> Remove junit
[
https://issues.apache.org/jira/browse/NUTCH-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2204.
Resolution: Fixed
Committed to trunk, r1726318.
> remove junit lib from runtime
>
[
https://issues.apache.org/jira/browse/NUTCH-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15146685#comment-15146685
]
Sebastian Nagel commented on NUTCH-2144:
Hi [~thammegowda],
thanks! Everything looks good with the
[
https://issues.apache.org/jira/browse/NUTCH-2060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174628#comment-15174628
]
Sebastian Nagel commented on NUTCH-2060:
Afaics from the mentioned thread on the user mailing
[
https://issues.apache.org/jira/browse/NUTCH-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15210136#comment-15210136
]
Sebastian Nagel commented on NUTCH-2242:
Hi Jurian, thanks for reporting this problem. This is
[
https://issues.apache.org/jira/browse/NUTCH-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15178587#comment-15178587
]
Sebastian Nagel commented on NUTCH-2237:
Good idea! Nice patch, including unit tests. A few
[
https://issues.apache.org/jira/browse/NUTCH-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2237:
---
Fix Version/s: 1.12
> DeduplicationJob: Add extra order criteria based on slug
>
[
https://issues.apache.org/jira/browse/NUTCH-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2256:
--
Assignee: Sebastian Nagel
> Inconsistent log level practice
>
[
https://issues.apache.org/jira/browse/NUTCH-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264274#comment-15264274
]
Sebastian Nagel commented on NUTCH-2256:
Good catch, will fix right now. Thanks, [~songwang]!
>
[
https://issues.apache.org/jira/browse/NUTCH-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2256:
---
Fix Version/s: 2.3.2
1.12
2.4
> Inconsistent log level
[
https://issues.apache.org/jira/browse/NUTCH-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2256:
---
Affects Version/s: 1.11
> Inconsistent log level practice
> ---
>
[
https://issues.apache.org/jira/browse/NUTCH-2254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2254.
Resolution: Fixed
Committed, r6d2bfa9. Thanks, [~fedechicco]!
> Charset issues when using
[
https://issues.apache.org/jira/browse/NUTCH-2254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256225#comment-15256225
]
Sebastian Nagel commented on NUTCH-2254:
Hi [~fedechicco], the patch should work. Thanks!
I'll add
[
https://issues.apache.org/jira/browse/NUTCH-2254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2254:
--
Assignee: Sebastian Nagel
> Charset issues when using -addBinaryContent and -base64
[
https://issues.apache.org/jira/browse/NUTCH-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2256.
Resolution: Fixed
Fix Version/s: (was: 2.3.2)
Fixed and committed to 1.x
[
https://issues.apache.org/jira/browse/NUTCH-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-2256.
--
Also did a grep on all Java files for errors of the same kind - nothing found.
Thanks,
[
https://issues.apache.org/jira/browse/NUTCH-2164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2164:
---
Fix Version/s: 1.13
> Inconsistent 'Modified Time' in crawl db
>
[
https://issues.apache.org/jira/browse/NUTCH-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15291591#comment-15291591
]
Sebastian Nagel commented on NUTCH-1858:
It's hardly a work for a single person. First steps could
[
https://issues.apache.org/jira/browse/NUTCH-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reopened NUTCH-2252:
Tests fail to compile
[[1|https://builds.apache.org/job/Nutch-trunk/3365/console]]:
{noformat}
[
https://issues.apache.org/jira/browse/NUTCH-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280076#comment-15280076
]
Sebastian Nagel commented on NUTCH-2242:
Opened pull request
[
https://issues.apache.org/jira/browse/NUTCH-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15279942#comment-15279942
]
Sebastian Nagel commented on NUTCH-2242:
[~markus17]: Sorry, I didn't upload a final patch, simply
[
https://issues.apache.org/jira/browse/NUTCH-1785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250812#comment-15250812
]
Sebastian Nagel commented on NUTCH-1785:
The class o.a.n.indexer.NutchField supports only a couple
[
https://issues.apache.org/jira/browse/NUTCH-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2191.
Resolution: Fixed
Merged pull request #105. Build should succeed now. Thanks,
[
https://issues.apache.org/jira/browse/NUTCH-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reopened NUTCH-2191:
Build fails because protocol-htmlunit's build.xml claims to have unit tests but
there aren't
[
https://issues.apache.org/jira/browse/NUTCH-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15411716#comment-15411716
]
Sebastian Nagel commented on NUTCH-2297:
The wrong values are already in the temporary output of
Sebastian Nagel created NUTCH-2297:
--
Summary: CrawlDbReader -stats wrong values for earliest fetch time
and shortest interval
Key: NUTCH-2297
URL: https://issues.apache.org/jira/browse/NUTCH-2297
Sebastian Nagel created NUTCH-2291:
--
Summary: Fix mrunit dependencies
Key: NUTCH-2291
URL: https://issues.apache.org/jira/browse/NUTCH-2291
Project: Nutch
Issue Type: Bug
801 - 900 of 3271 matches
Mail list logo