I think he's scraping someone else's site. 

You obviously can't do this with Ruby alone, as there is no headless web 
browser written entirely in Ruby (that's just nonsense)

If you can get phantomjs working on your production site, that's probably the 
way to go. Look deep into the internals of Capybara to understand how it drives 
phantomjs. With phantomjs, you basically have a headless web browser and you 
can use Capybara's DSL to access parts of the page, including evaluating 
scripts and parsing the DOM.

Just keep in mind phantomjs is an actual executable so it needs to be compiled 
and built for your production environment explicitly, which might be a little 
tricky depending on where your site is. 

But a little birdie told me a few months ago that the phantomjs team has 
decided that once Chrome has a headless mode, which I believe is forthcoming, 
they plan to abandon phantomjs in favor of Chrome's headless mode. Not sure if 
that's really true or when that will happen. 

-Jason



> On Jun 18, 2017, at 8:36 AM, Colin Law <[email protected]> wrote:
> 
> On 18 June 2017 at 12:21, Martin Luy <[email protected]> wrote:
>> I'm extracting content from some websites. Currently I evaluate HTML code
>> using Nokogiri. But the relevant content is not contained in the responded
>> body of the HTTP GET request. This is because there is some Javascript code
>> like $(window).load() or $(document).ready() that will send some Ajax
>> requests and fill the original HTML code.
>> 
>> So I'm searching for some library that automatically executes Javascript
>> code and Ajax requests just like a normal browser.
> 
> Understood. Don't think I can help I am afraid. Does the site not work
> with js disabled in the browser?
> 
> Colin
> 
>> 
>> Martin
>> 
>> Von meinem Samsung Gerät gesendet.
>> 
>> 
>> -------- Ursprüngliche Nachricht --------
>> Von: Colin Law <[email protected]>
>> Datum: 18.06.17 09:42 (GMT+01:00)
>> An: "Ruby on Rails: Talk" <[email protected]>
>> Betreff: Re: [Rails] browser simulator independent of web framework
>> 
>> On 17 June 2017 at 22:58, Martin L. <[email protected]> wrote:
>>> Hi all,
>>> 
>>> Is there any browser simulator that fulfills these requirements:
>>> 
>>> - gem written in Ruby
>>> - automatically performing Ajax and Javascript code (XSS is not an issue
>>> in
>>> my case)
>>> - independent of the frameworks used by the website (Rails, JavaEE,
>>> ASP.NET,
>>> ...)
>>> - only client-side
>>> - no testing
>>> - no browser dependency
>> 
>> What do you mean 'no testing'? If not for testing then what is it for?
>> 
>> Colin
>> 
>> --
>> You received this message because you are subscribed to a topic in the
>> Google Groups "Ruby on Rails: Talk" group.
>> To unsubscribe from this topic, visit
>> https://groups.google.com/d/topic/rubyonrails-talk/H_YImOIzNNo/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to
>> [email protected].
>> To post to this group, send email to [email protected].
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/rubyonrails-talk/CAL%3D0gLvqHArcWpbZ5gsfCiPg0EF%3D4kD8QzZbC3KnicN58uAZ8A%40mail.gmail.com.
>> For more options, visit https://groups.google.com/d/optout.
>> 
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Ruby on Rails: Talk" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To post to this group, send email to [email protected].
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/rubyonrails-talk/smyckbsskql3om4h9odf01hm.1497784510166%40email.android.com.
>> 
>> For more options, visit https://groups.google.com/d/optout.
> 
> -- 
> You received this message because you are subscribed to the Google Groups 
> "Ruby on Rails: Talk" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to [email protected].
> To post to this group, send email to [email protected].
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/rubyonrails-talk/CAL%3D0gLsaS8DzC2o0%3DHkKt_%2BMvvzqutm0mSHUj5jf7CoKvnqJEg%40mail.gmail.com.
> For more options, visit https://groups.google.com/d/optout.
> 

----

Jason Fleetwood-Boldt
[email protected]
http://www.jasonfleetwoodboldt.com/writing

If you'd like to reply by encrypted email you can find my public key on 
jasonfleetwoodboldt.com (more about setting GPG: https://gpgtools.org) 

-- 
You received this message because you are subscribed to the Google Groups "Ruby 
on Rails: Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/rubyonrails-talk/A0A534AE-3ECA-41F9-9170-432A98DD743D%40datatravels.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to