Also look at Mechanize, which I believe can do a headless JS scrape of a site. 
It's purely a scraper, so less likely to be so test-centric.

Walter

> On Jun 19, 2017, at 10:18 AM, Jason Fleetwood-Boldt <[email protected]> 
> wrote:
> 
> 
> 
> I think he's scraping someone else's site. 
> 
> You obviously can't do this with Ruby alone, as there is no headless web 
> browser written entirely in Ruby (that's just nonsense)
> 
> If you can get phantomjs working on your production site, that's probably the 
> way to go. Look deep into the internals of Capybara to understand how it 
> drives phantomjs. With phantomjs, you basically have a headless web browser 
> and you can use Capybara's DSL to access parts of the page, including 
> evaluating scripts and parsing the DOM.
> 
> Just keep in mind phantomjs is an actual executable so it needs to be 
> compiled and built for your production environment explicitly, which might be 
> a little tricky depending on where your site is. 
> 
> But a little birdie told me a few months ago that the phantomjs team has 
> decided that once Chrome has a headless mode, which I believe is forthcoming, 
> they plan to abandon phantomjs in favor of Chrome's headless mode. Not sure 
> if that's really true or when that will happen. 
> 
> -Jason
> 
> 
> 
>> On Jun 18, 2017, at 8:36 AM, Colin Law <[email protected]> wrote:
>> 
>> On 18 June 2017 at 12:21, Martin Luy <[email protected]> wrote:
>>> I'm extracting content from some websites. Currently I evaluate HTML code
>>> using Nokogiri. But the relevant content is not contained in the responded
>>> body of the HTTP GET request. This is because there is some Javascript code
>>> like $(window).load() or $(document).ready() that will send some Ajax
>>> requests and fill the original HTML code.
>>> 
>>> So I'm searching for some library that automatically executes Javascript
>>> code and Ajax requests just like a normal browser.
>> 
>> Understood. Don't think I can help I am afraid. Does the site not work
>> with js disabled in the browser?
>> 
>> Colin
>> 
>>> 
>>> Martin
>>> 
>>> Von meinem Samsung Gerät gesendet.
>>> 
>>> 
>>> -------- Ursprüngliche Nachricht --------
>>> Von: Colin Law <[email protected]>
>>> Datum: 18.06.17 09:42 (GMT+01:00)
>>> An: "Ruby on Rails: Talk" <[email protected]>
>>> Betreff: Re: [Rails] browser simulator independent of web framework
>>> 
>>> On 17 June 2017 at 22:58, Martin L. <[email protected]> wrote:
>>>> Hi all,
>>>> 
>>>> Is there any browser simulator that fulfills these requirements:
>>>> 
>>>> - gem written in Ruby
>>>> - automatically performing Ajax and Javascript code (XSS is not an issue
>>>> in
>>>> my case)
>>>> - independent of the frameworks used by the website (Rails, JavaEE,
>>>> ASP.NET,
>>>> ...)
>>>> - only client-side
>>>> - no testing
>>>> - no browser dependency
>>> 
>>> What do you mean 'no testing'? If not for testing then what is it for?
>>> 
>>> Colin
>>> 
>>> --
>>> You received this message because you are subscribed to a topic in the
>>> Google Groups "Ruby on Rails: Talk" group.
>>> To unsubscribe from this topic, visit
>>> https://groups.google.com/d/topic/rubyonrails-talk/H_YImOIzNNo/unsubscribe.
>>> To unsubscribe from this group and all its topics, send an email to
>>> [email protected].
>>> To post to this group, send email to [email protected].
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/rubyonrails-talk/CAL%3D0gLvqHArcWpbZ5gsfCiPg0EF%3D4kD8QzZbC3KnicN58uAZ8A%40mail.gmail.com.
>>> For more options, visit https://groups.google.com/d/optout.
>>> 
>>> --
>>> You received this message because you are subscribed to the Google Groups
>>> "Ruby on Rails: Talk" group.
>>> To unsubscribe from this group and stop receiving emails from it, send an
>>> email to [email protected].
>>> To post to this group, send email to [email protected].
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/rubyonrails-talk/smyckbsskql3om4h9odf01hm.1497784510166%40email.android.com.
>>> 
>>> For more options, visit https://groups.google.com/d/optout.
>> 
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "Ruby on Rails: Talk" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected].
>> To post to this group, send email to [email protected].
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/rubyonrails-talk/CAL%3D0gLsaS8DzC2o0%3DHkKt_%2BMvvzqutm0mSHUj5jf7CoKvnqJEg%40mail.gmail.com.
>> For more options, visit https://groups.google.com/d/optout.
>> 
> 
> ----
> 
> Jason Fleetwood-Boldt
> [email protected]
> http://www.jasonfleetwoodboldt.com/writing
> 
> If you'd like to reply by encrypted email you can find my public key on 
> jasonfleetwoodboldt.com (more about setting GPG: https://gpgtools.org) 
> 
> 
> -- 
> You received this message because you are subscribed to the Google Groups 
> "Ruby on Rails: Talk" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to [email protected].
> To post to this group, send email to [email protected].
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/rubyonrails-talk/A0A534AE-3ECA-41F9-9170-432A98DD743D%40datatravels.com.
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups "Ruby 
on Rails: Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/rubyonrails-talk/7E161DC2-A4E6-4A12-8C13-1FE180EE57CB%40wdstudio.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to