Re: OOM problem

2014-02-10 Thread Ken Krugler
; } - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions training

Re: Client stuck

2013-09-25 Thread Ken Krugler
interesting. -- Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Cassandra Solr

Re: [POLL] Minimal JRE level as of HttpClient 4.4

2013-09-16 Thread Ken Krugler
to upgrade. [X] upgrade to Java 1.6: one step at a time. [ ] upgrade to Java 1.7: new features are more important. -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Cassandra Solr

Re: HttpClient performance with multiple threads; Re: AbstractNIOConnPool memory leak?

2013-01-07 Thread Ken Krugler
as a large-scale crawl. -- Ken On Jan 7, 2013, at 2:39am, Oleg Kalnichevski wrote: On Sun, 2013-01-06 at 15:48 -0800, Ken Krugler wrote: Hi Oleg, [snip] Ken, You might want to have a look at the lest code in SVN trunk (to be released as 4.3). Several classes such as the scheme registry

Re: HttpClient performance with multiple threads; Re: AbstractNIOConnPool memory leak?

2013-01-06 Thread Ken Krugler
-- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Cassandra Solr

Re: AbstractNIOConnPool memory leak?

2013-01-05 Thread Ken Krugler
On Jan 5, 2013, at 2:11pm, sebb wrote: On 5 January 2013 21:33, vigna vi...@di.unimi.it wrote: But why would you want a web crawler to have 10-20K simultaneously opened connections in the first place? (I thought I answered this, but it's not on the archive. Boh.) Having a few thousands

Re: AbstractNIOConnPool memory leak?

2013-01-05 Thread Ken Krugler
of the server, which could be a better approach to constraining the # of keep-alive requests. -- Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Mahout Solr

Re: AbstractNIOConnPool memory leak?

2013-01-05 Thread Ken Krugler
On Jan 5, 2013, at 3:31pm, vigna wrote: On 5 Jan 2013, at 3:10 PM, Ken Krugler kkrugler_li...@transpac.com wrote: So on a large box (e.g. 24 more powerful cores) I could see using upward of 10K threads being the optimal number. We are working to make 20-30K connections work on 64 cores

Re: java.net.URISyntaxException: Illegal character in query

2012-06-04 Thread Ken Krugler
http://about.me/kkrugler +1 530-210-6378 -- Ken Krugler http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Mahout Solr

Re: How to use logging abilities of HttpClient?

2012-04-10 Thread Ken Krugler
is the code: http://pastebin.com/H1PWqdBc Question same as always: why doesn't it work? -- Ken Krugler http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Mahout Solr

Re: No CookiePolicy.IGNORE

2012-01-31 Thread Ken Krugler
On Jan 31, 2012, at 7:30am, Oleg Kalnichevski wrote: On Mon, 2012-01-30 at 17:56 -0800, Ken Krugler wrote: OK, answering my own question - because ignoreCookies isn't a supported policy. I assume either one should be added, or the documentation fixed up. -- Ken Hi Ken

Re: Get value of a form field?

2012-01-30 Thread Ken Krugler
the value of a form field of a website? I guess that's already considered parsing and therefore not part of the httpclient? I am just a little bit irritated as it's possible to set form values when using http post. Thanks for you help :-) Best regards Stefan -- Ken Krugler

Re: Get value of a form field?

2012-01-30 Thread Ken Krugler
didn't see how to do that? 2012/1/30 Ken Krugler kkrugler_li...@transpac.com Assuming you've already read http://hc.apache.org/httpcomponents-client-ga/primer.html, could you provide more details of what exactly you're trying to do with HttpClient? -- Ken On Jan 30, 2012, at 12:56pm

No CookiePolicy.IGNORE

2012-01-30 Thread Ken Krugler
to use that policy. Thanks, -- Ken -- Ken Krugler http://www.scaleunlimited.com custom big data solutions training Hadoop, Cascading, Mahout Solr

Re: No CookiePolicy.IGNORE

2012-01-30 Thread Ken Krugler
OK, answering my own question - because ignoreCookies isn't a supported policy. I assume either one should be added, or the documentation fixed up. -- Ken On Jan 30, 2012, at 4:46pm, Ken Krugler wrote: Is there any reason why CookiePolicy doesn't include public static final String IGNORE

Re: Possibility of using non-blocking calls for connection pools?

2012-01-06 Thread Ken Krugler
during web crawling, when I had 300+ threads sharing one connection pool. Would it work to go for finer-grained locking, by using atomic counters to track enforce limits on per route/total connections? -- Ken -- Ken Krugler http://www.scaleunlimited.com custom big data

Re: HttpClient 4.1.2 and redirect handling...

2011-09-28 Thread Ken Krugler
profile at: http://www.linkedin.com/in/rjeffreyvincent I ♥ DropBox http://db.tt/9O6LfBX !! -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom big data solutions training Hadoop, Cascading, Mahout Solr

Re: Obtaining charset of page from HttpResponse.

2011-08-16 Thread Ken Krugler
- To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom data mining solutions

Re: Trying to send raw requests without URL encodings?

2011-04-03 Thread Ken Krugler
...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Bytes Missing from HTTP Response

2011-04-02 Thread Ken Krugler
in section 4.3.3 of [XML] that directly address this contingency. Which means it will look for a byte-order-mark and encoding declaration inside of the XML content. -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

HttpClient 3.1 safe as replacement for 3.0.1?

2011-03-10 Thread Ken Krugler
-in replacement for HttpClient 3.0.1? I haven't seen anything that would lead me to believe otherwise, but I also haven't found anybody using HC 3.1 with Hadoop in a large-scale cluster. Thanks, -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b

Re: Header and Content parsing and saving as html page

2011-02-11 Thread Ken Krugler
-- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h

Re: any open source crawlers using hc?

2011-01-27 Thread Ken Krugler
Apache Droids also uses HttpClient 4.x And Nutch uses HttpClient 3.1. -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail

Re: Get Server Response Header without downloading

2011-01-06 Thread Ken Krugler
, e-mail: httpclient-users-h...@hc.apache.org -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org

Re: Connection Pool like JDBC

2010-09-12 Thread Ken Krugler
Does httpclient has a connection pool like JDBC? Yes. -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users

Re: Invalid URL causes no Exception - instance keeps still

2010-09-09 Thread Ken Krugler
. -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail

Re: Why doesn't httpclient follow redirects on this URL?

2010-09-08 Thread Ken Krugler
hostUrl = new URI(host.toURI()).toURL(); return new URL(hostUrl, finalRequest.getURI().toString()).toExternalForm(); -- Ken http://ken-blog.krugler.org +1 530-265-2225 -- Ken Krugler +1 530-210-6378 http

Re: What is the determining factor for Max Connections?

2010-09-06 Thread Ken Krugler
the entire environment, not just HttpClient parameters. -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr

Re: SSL support in HttpCore 4.0.1

2010-09-05 Thread Ken Krugler
: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org http://ken-blog.krugler.org +1 530-265-2225 -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c

Re: ThreadSafeClientConnManager + jdk Executor

2010-08-06 Thread Ken Krugler
...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org http://ken-blog.krugler.org +1 530-265-2225 Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m

Re: Connection Shutdown Immediately on HttpClient 4.01

2010-06-24 Thread Ken Krugler
Hi Oleg, On Jun 24, 2010, at 6:54am, Oleg Kalnichevski wrote: On Thu, 2010-06-24 at 05:42 -0700, Ken Krugler wrote: On Jun 23, 2010, at 8:24pm, ctg3 wrote: I am having an issue with HttpClient 4.01. When I try to access www.google.com or any other website I get a java.net.SocketException

Re: GET request using httpclient4.0.1

2010-06-09 Thread Ken Krugler
to offer. I might wind up building my own version of HttpClient that uses a custom URI class, which wraps the standard URI class other than changes needed for non-standard subdomains. If so, I'll post my notes about how that worked. -- Ken Ken Krugler wrote: Hi Udit, I believe the problem

Re: A basic question about httpclient and httpcore

2010-05-24 Thread Ken Krugler
. Thanks Sachin Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional

Re: BindException while using HttpClient under load

2010-05-24 Thread Ken Krugler
at Nabble.com. - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org Ken Krugler +1 530-210-6378 http

Re: stale connection

2010-05-12 Thread Ken Krugler
, for any loss or damage arising in any way from its use. If you received this transmission in error, please immediately contact the sender and destroy the material in its entirety, whether in electronic or hard copy format. Thank you. Ken Krugler +1

Re: stale connection

2010-05-11 Thread Ken Krugler
idempotent = !(request instanceof HttpEntityEnclosingRequest); // Retry if the request is considered idempotent return idempotent; } } Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i

Re: defaulthttpclient - connected ?

2010-05-04 Thread Ken Krugler
or an authorized designee, you may not copy or use it, or disclose it to anyone else. If you received it in error please notify us immediately and then destroy it. Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Distinguish between temporary and permanent redirects.

2010-04-28 Thread Ken Krugler
http://ken-blog.krugler.org +1 530-265-2225 Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Simulating connection timeout?

2010-03-30 Thread Ken Krugler
On Mar 29, 2010, at 1:59pm, KARR, DAVID (ATTSI) wrote: -Original Message- From: Ken Krugler [mailto:kkrugler_li...@transpac.com] Sent: Monday, March 29, 2010 1:42 PM To: HttpClient User Discussion Subject: Re: Simulating connection timeout? On Mar 29, 2010, at 1:12pm, Sam Crawford

Re: Simulating connection timeout?

2010-03-29 Thread Ken Krugler
a connection timeout, as no response will be received. Alternatively, if you have a large regional network, try connecting to a host in another city/country. Thanks, Sam On 29 March 2010 20:53, Ken Krugler kkrugler_li...@transpac.com wrote: On Mar 29, 2010, at 12:31pm, KARR, DAVID (ATTSI) wrote: I'm

Re: URL/URI syntax issue in HC 4.0.1

2010-03-23 Thread Ken Krugler
: Ken Krugler [mailto:kkrugler_li...@transpac.com] Sent: Monday, March 22, 2010 2:39 PM To: HttpClient User Discussion Subject: Re: URL/URI syntax issue in HC 4.0.1 On Mar 22, 2010, at 2:15pm, natarajan_va...@emc.com natarajan_va...@emc.com wrote: The URL, http://win2k3_64_ora:9300/app, seems

Re: HttpClient Reset

2010-03-23 Thread Ken Krugler
. - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: HttpClient Reset

2010-03-23 Thread Ken Krugler
of the last request made that succeeded. -- Ken On Tue, Mar 23, 2010 at 4:01 PM, Ken Krugler kkrugler_li...@transpac.com wrote: I haven't used HttpClient 3.1 for a while now, so I'm not up on the typical connection management problems. I don't think that two threads will share the same

Re: HTTP pipelining example

2010-03-12 Thread Ken Krugler
to pipeline a bunch together). Thanks in advance, Brian Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient

Re: Too Many Files Open

2010-03-07 Thread Ken Krugler
...@hc.apache.org Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org

Re: responding to form login fields

2010-03-04 Thread Ken Krugler
it to anyone else. If you received it in error please notify us immediately and then destroy it. Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: isolate (possible) problems with webserver accessed via httpclient

2010-03-02 Thread Ken Krugler
for the failure case, so you'd see exactly what's coming back from the server. -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Apache httpclient read timeout

2010-02-17 Thread Ken Krugler
Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h

Re: Logging into a website issues with 4.0.1

2010-02-15 Thread Ken Krugler
/div /div Any help will be greatly appreciated. Thanks, Robert Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Logging into a website issues with 4.0.1

2010-02-15 Thread Ken Krugler
, Robert On Mon, Feb 15, 2010 at 10:36 AM, Ken Krugler kkrugler_li...@transpac.comwrote: In the automated login, I don't see the POST parameters: txtUserName=myUserNametxtPassword=myPassWord -- Ken On Feb 15, 2010, at 7:36am, Robert Stone wrote: *Thanks to Jeff* for getting me on my way

Re: new to the list

2010-02-12 Thread Ken Krugler
redirect = new HttpGet(uri); httpClient.execute(redirect, context); return context; thanks Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: HttpClient 4.0 encoding madness

2010-01-29 Thread Ken Krugler
-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: HttpClient 4.0 encoding madness

2010-01-29 Thread Ken Krugler
On Jan 29, 2010, at 3:35am, sebb wrote: On 29/01/2010, Ken Krugler kkrugler_li...@transpac.com wrote: On Jan 28, 2010, at 10:09pm, amoldavsky wrote: Hi Oleg, Thank you for the quick reply. So if there is a possibility that not the whole buffer is filled how can I insure or force

Re: When returning page is 404 no InputStream can be read

2010-01-27 Thread Ken Krugler
don't. The browser might be synthesizing something (depending on the browser) for empty 404 pages. What happens when you do a GET request to that server via curl (with the -v option)? -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l

Re: Best-Practices for Multithreaded use of HttpClient (with Cookies)?

2010-01-27 Thread Ken Krugler
Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: getting only the header

2010-01-26 Thread Ken Krugler
Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: getting only the header

2010-01-26 Thread Ken Krugler
with the HttpGet object, and that in turn calls request.abort(). -- Ken Ken Krugler wrote: On Jan 26, 2010, at 3:54am, Claudio Martella wrote: As I mentioned in the previous post, i'm using httpclient for a webcrawler i'm writing. at the moment i'm doing something like this: while

Re: method.releaseConnection()

2010-01-25 Thread Ken Krugler
http://ken-blog.krugler.org +1 530-265-2225 Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: HttpComponents documentation

2010-01-24 Thread Ken Krugler
-client/primer.html And then provide some input on what's so bad about it. -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: escaped absolute path not valid error with URLs having GB18030 or japansese characters in them

2010-01-22 Thread Ken Krugler
. - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Efficiently repeating identical requests

2010-01-09 Thread Ken Krugler
it to the socket. Is this possible with HttpClient, and if so how would I go about implementing this? Many thanks, Tony Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Efficiently repeating identical requests

2010-01-09 Thread Ken Krugler
, Tony Message du 09/01/10 21:03 De : Ken Krugler A : HttpClient User Discussion Copie à : Objet : Re: Efficiently repeating identical requests Hi Tony, I'm wondering why you need to do this level of optimization - are you running into issues with this type of POST request chewing up too many

Re: Connections per host and redirects

2010-01-09 Thread Ken Krugler
Hi Oleg, On Jan 9, 2010, at 1:35pm, Oleg Kalnichevski wrote: Ken Krugler wrote: I wanted to verify some behavior I'm seeing with HttpClient 4.0 I occasionally get a ConnectionPoolTimeoutException, even when I've got spare connections in my ThreadSafeClientConnManager pool. Looking

Blocked threads during requests

2010-01-09 Thread Ken Krugler
.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:319) If anybody (well, OK, Oleg) has input on things I could be doing wrong to trigger this type of behavior, and/or ways to avoid it, I'm all ears. -- Ken Ken Krugler +1 530-210-6378 http

Connections per host and redirects

2010-01-08 Thread Ken Krugler
that would only mask the problem. Thanks, -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users

Re: Authentication fails in HttpClient but OK in browser

2010-01-06 Thread Ken Krugler
-User mailing list archive at Nabble.com. Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr

Re: Retry on ConnectTimeoutException

2010-01-06 Thread Ken Krugler
Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org

Re: Setting cookie policy with HttpClient 4.0

2009-12-03 Thread Ken Krugler
Hi Oleg, On Dec 3, 2009, at 2:40am, Oleg Kalnichevski wrote: On Wed, 2009-12-02 at 19:15 -0800, Ken Krugler wrote: Below is an email from August 7th, which I'm reviving due to this becoming a bigger issue over in Bixo-land. I've continued to run into this issue with my crawls, but so far I'm

Re: Not able to upload bigger file (max size 4kb successful)

2009-10-05 Thread Ken Krugler
(Unknown Source) The exception has been caused by the server shutting down the connection prematurely most likely due to a error or configuration issue on the server side. This has nothing to do with HttpClient. Oleg -- Ken Krugler TransPac Software, Inc. http

Re: Last redirect URL

2009-09-30 Thread Ken Krugler
/tutorial/html/fundamentals.html#d4e205 The key bit of documentation was at: http://hc.apache.org/httpcomponents-client/tutorial/html/httpagent.html#d4e1022 -- Ken -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378

Re: Parallel Webcrawler Implementation

2009-09-24 Thread Ken Krugler
the number of parallel request to one host to be two. Not sure if that would be a factor in your case, given how you're creating a new client for each request. -- Ken -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378

Re: HttpClient 3.1 PutMethod - HttpClient 4 HttpPut

2009-09-23 Thread Ken Krugler
: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378

Re: French accents problem while sending postMethod

2009-09-22 Thread Ken Krugler
, to avoid issues with terminals set to us-ascii. Often problems like this are caused by server configuration - e.g. there are countless posts on the Solr mailing list about needing to configure Tomcat to treat incoming URLs as UTF-8. -- Ken -- Ken Krugler

Re: Last redirect URL

2009-09-21 Thread Ken Krugler
() and HttpResponse#getAllHeaders() - couldn't find anything. From past posts on the list, I thought httpMethod.getURI() would return the final URL. -- Ken -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378

Re: Charset trouble, questionmarks

2009-09-02 Thread Ken Krugler
Hi Magnus, On Sep 2, 2009, at 1:22am, Magnus Olstad Hansen wrote: Hello, I'm using HttpClient 4.0 to download a webpage the same way as shown in one of the examples. This is my method to return a webpage as a string: protected static String leechUrl(String url) throws

Re: Charset trouble, questionmarks

2009-09-02 Thread Ken Krugler
...@hc.apache.org -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient

Re: Downloading HTML frameset pages via HTTPClient

2009-08-25 Thread Ken Krugler
: httpclient-users-h...@hc.apache.org -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional

Re: Downloading HTML frameset pages via HTTPClient

2009-08-25 Thread Ken Krugler
they will help. Thanks, Melroy -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e

Re: Downloading HTML frameset pages via HTTPClient

2009-08-25 Thread Ken Krugler
On Aug 25, 2009, at 3:39pm, melroyr wrote: I have no idea how to set the user agent in HTTPClient The (really good) on-line documentation is your friend. http://hc.apache.org/httpcomponents-client/tutorial/html/ -- Ken melroyr wrote: I have written a program to download html pages

Re: Downloading HTML frameset pages via HTTPClient

2009-08-24 Thread Ken Krugler
Hi Melroy, On Aug 24, 2009, at 12:20pm, melroyr wrote: I have written a program to download html pages from harristeeter. However, when I run my program, I get the following !DOCTYPE HTML PUBLIC -//W3C//DTD HTML 4.01 Frameset//EN http://www.w3.org/TR/html4/frameset.dtd; html head

Re: [ANNOUNCEMENT] HttpComponents HttpClient 4.0 (GA) release

2009-08-22 Thread Ken Krugler
...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr

Re: Best Practice to Use HttpClient in Multithreaded Environment

2009-08-17 Thread Ken Krugler
of stack space used per thread, or DNS lookups becoming slow, etc. -- Ken Ken Krugler wrote: Hi Yan Cheng, I haven't used HttpClient 3.x for a while - switched to 4.0 and haven't looked back. But in general method A is going to work better. You can configure

Re: Reading and terminating stream in HttpClient 4

2009-08-17 Thread Ken Krugler
On Aug 17, 2009, at 11:27am, droidin.net wrote: Here's what I need to do 1. Read response as a stream 2. Feed it into SAX-based HTML parser on the fly 3. When certain tag is detected - terminate the stream In other words - I'm reading large documents from which I only need top 5%, can I do

Re: Invalid redirect location: http://wapp.baidu.com/f?kw=???????

2009-08-12 Thread Ken Krugler
them up before following them. -- Ken -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional

Setting cookie policy with HttpClient 4.0

2009-08-07 Thread Ken Krugler
, -- Ken http://ken-blog.krugler.org +1 530-265-2225 -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378

Safe close of connection with partial read of response

2009-08-03 Thread Ken Krugler
-- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h

When to use ConnectionKeepAliveStrategy

2009-08-03 Thread Ken Krugler
with a stale connection too optimistic? Thanks, -- Ken -- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org

Use of HttpEntity.getContentLength()

2009-08-03 Thread Ken Krugler
-- Ken Krugler TransPac Software, Inc. http://www.transpac.com +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org

Re: HttpConnectionManager ?

2009-07-08 Thread Ken Krugler
, which is definitely a speed win. Note that with the MultiThreadedHttpConnectionManager, you create one HttpClient, and then re-use it for all of your (multi-threaded) requests. -- Ken -- Ken Krugler +1 530-210-6378

Re: some detailed migration questions...

2009-07-08 Thread Ken Krugler
with 4.0. -- Ken -- Ken Krugler +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands, e-mail: httpclient-users-h...@hc.apache.org

Re: some detailed migration questions...

2009-07-08 Thread Ken Krugler
setup. Is there a way to use HttpParams to change values on a per-request basis? [snip] Thanks, -- Ken -- Ken Krugler +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org For additional commands

Proper handling of stale connections

2009-07-07 Thread Ken Krugler
, I really only want to retry the request when I get an IOException. And that there's no point in retrying more than once. Thanks, -- Ken -- Ken Krugler +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr

RE: Wire Logging

2009-06-03 Thread Ken Krugler
, with HttpClient 3.1 there was a weird thing where httpclient.wire was the name of the wire logger, not org.apache.whatever -- Ken -- Ken Krugler +1 530-210-6378 - To unsubscribe, e-mail: httpclient-users-unsubscr...@hc.apache.org

Re: Setting timeouts in 4.0

2009-05-05 Thread Ken Krugler
On Mon, May 04, 2009 at 04:30:15PM -0700, Ken Krugler wrote: Hi all, In Http 3.1, the Nutch code base would configure timeouts using the following snippet of code: MultiThreadedHttpConnectionManager connectionManager = new MultiThreadedHttpConnectionManager