Hi,

When you install the patch, did you see any fails? No fail is tolerated. I
am guessing there is something wrong with ivy.xml. I am suggesting
that checkout ALL
files in Nutch and then try it again.

Best,
Jiaxin

On Tuesday, February 17, 2015, Jaydeep Bagrecha <[email protected]> wrote:

> Hi all,
> I am trying to install and build selenium with nutch1.10 on Mac Yosemite.
>
>  having following error after downloading selenium patch(
> https://issues.apache.org/jira/browse/NUTCH-1933) and while using “ant
> runtime” command (as mentioned by Jiaxin below).Any suggestions to avoid it?
>
>  error: package org.openqa.selenium does not exist
>     [javac] import org.openqa.selenium.By;
>     [javac]                           ^
>  error: package org.openqa.selenium does not exist
>     [javac] import org.openqa.selenium.WebDriver;
>     [javac]                           ^
>  error: package org.openqa.selenium.firefox does not exist
>     [javac] import org.openqa.selenium.firefox.FirefoxDriver;
>     [javac]                                   ^
>  error: package org.openqa.selenium.firefox does not exist
>     [javac] import org.openqa.selenium.firefox.FirefoxProfile;
> error: cannot find symbol
>     [javac]   public static ThreadLocal<WebDriver> threadWebDriver = new
> ThreadLocal<WebDriver>() {
>     [javac]                             ^
>     [javac]   symbol:   class WebDriver
>     [javac]   location: class HttpWebClient
>  error: cannot find symbol
>     [javac]     protected WebDriver initialValue()
>     [javac]               ^
>     [javac]   symbol: class WebDriver
>  error: cannot find symbol
>     [javac]       FirefoxProfile profile = new FirefoxProfile();
>     [javac]       ^
>     [javac]   symbol: class FirefoxProfile
> error: cannot find symbol
>     [javac]       WebDriver driver = new FirefoxDriver(profile);
>     [javac]                              ^
>     [javac]   symbol: class FirefoxDriver
>  error: cannot find symbol
>     [javac]       driver = new FirefoxDriver();
>     [javac]                    ^
>     [javac]   symbol:   class FirefoxDriver
>     [javac]   location: class HttpWebClient
>
>  error: cannot find symbol
>     [javac]       new WebDriverWait(driver, 3);
>     [javac]           ^
>     [javac]   symbol:   class WebDriverWait
>     [javac]   location: class HttpWebClient
>
>  error: cannot find symbol
>     [javac]       String innerHtml =
> driver.findElement(By.tagName("body")).getAttribute("innerHTML");
>     [javac]                                             ^
>     [javac]   symbol:   variable By
>     [javac]   location: class HttpWebClient
>
> Thanks,
> Jaydeep
>
> On Feb 12, 2015, at 11:37 PM, Jiaxin Ye <[email protected]
> <javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
>
> Sure. I will do it once I confirm it works...
>
> On Thursday, February 12, 2015, Mattmann, Chris A (3980) <
> [email protected]
> <javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
>
>> This is great, Jiaxin, can you please make a wiki page on the Nutch
>> wiki that has this information?
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: [email protected]
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>
>> -----Original Message-----
>> From: Jiaxin Ye <[email protected]>
>> Reply-To: "[email protected]" <[email protected]>
>> Date: Thursday, February 12, 2015 at 9:39 PM
>> To: "[email protected]" <[email protected]>
>> Subject: Nutch-Selenium in Nutch 1.10
>>
>> >Hi Li, Shuo. You are so right. I finished installing and successfully run
>> >the butch with selenium and Firefox. I have a question though, does your
>> >Firefox plug out for always all the urls we crawled?
>> >
>> >
>> >Hi Prof Mattmann. I think here is the way we install selenium on MAC with
>> >OS higher than 10.6 I think...
>> >
>> >
>> >1. Download XQuatz, it's a dmp file, install it directly
>> >2. Download Nutch 1.10
>> >3. Download the patch and put it on the Nutch project directory
>> >4. patch -p0 < THE PATCH NAME
>> >5. DO NOT update the build.xml and the ivy.xml as the selenium tutorial
>> >in the github told you. The patch basically updated those .xml file for
>> >us. And the patch also installs lib-selenium and protocol selenium for us
>> >(Correct me if
>> > I am wrong)
>> >6. Update tika dependency if needed
>> >7. Go to the Nutch project directory and run ant runtime
>> >8. Download Firefox
>> >9. Open a new terminal and type
>> >    xvfb -screen scrn 1024x758x34 (I think you can set it smaller if you
>> >want...)
>> >    There should be some errors after entering the command (for me at
>> >least). Manually sudo create a /tmp/.X11-unix folder, and then set the
>> >mode to 1777. Rerun the command. xvfb should be working.
>> >10. Go to nutch > runtime > local and run the crawling command
>> >
>> >
>> >Hope it helps. :)
>> >
>> >
>> >Best,
>> >Jiaxin
>> >
>> >
>> >
>> >
>> >
>> >On Thu, Feb 12, 2015 at 1:08 PM, Shuo Li
>> ><[email protected] <javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
>> >
>> >I think I have possibly finished installing.
>> >
>> >
>> >What you need to do:
>> >0. git status and checkout what you have modified.
>> >1. patch -p0 < YOUR_PATCH_FILE
>> >2. ant clean jar
>> >3. ant runtime
>> >
>> >
>> >Will try crawling using selenium later on. Hope this helped. >_<
>> >
>> >
>> >On Thu, Feb 12, 2015 at 9:20 AM, Mattmann, Chris A (3980)
>> ><[email protected]
>> ><javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
>> >
>> >Yes I believe you need to install X11 - why don't you try and report back
>> >what you find thanks.
>> >
>> >Sent from my iPhone
>> >
>> >On Feb 12, 2015, at 8:28 AM, Jiaxin Ye <[email protected]
>> ><javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
>> >
>> >
>> >
>> >Hi professor, but can we use Selenium on Mac?
>> >
>> >On Thursday, February 12, 2015, Mattmann, Chris A (3980)
>> ><[email protected]
>> ><javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
>> >
>> >You need Selenium Jiaxin, in order to crawl dynamic pages in the
>> >polar dataset you have been assigned in my CSCI 572 search engines class.
>> >
>> >The instructions for integrating Selenium with Nutch 1.10-trunk
>> >are here:
>> >
>> >https://issues.apache.org/jira/browse/NUTCH-1933
>> >
>> >
>> >Cheers,
>> >Chris
>> >
>> >
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >Chris Mattmann, Ph.D.
>> >Chief Architect
>> >Instrument Software and Science Data Systems Section (398)
>> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >Office: 168-519, Mailstop: 168-527
>> >Email: [email protected]
>> >WWW:  http://sunset.usc.edu/~mattmann/
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >Adjunct Associate Professor, Computer Science Department
>> >University of Southern California, Los Angeles, CA 90089 USA
>> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >
>> >
>> >
>> >
>> >
>> >
>> >-----Original Message-----
>> >From: Jiaxin Ye <[email protected]>
>> >Reply-To: "[email protected]" <[email protected]>
>> >Date: Thursday, February 12, 2015 at 12:46 AM
>> >To: "[email protected]" <[email protected]>
>> >Subject: Re: Nutch-Selenium in Nutch 1.10
>> >
>> >>Well, good choice. I am thinking changing to ubuntu now. The thing is
>> why
>> >>do we need Selenium anyway? Just easier to perform crawling?
>> >>
>> >>On Thu, Feb 12, 2015 at 12:25 AM, Shuo Li
>> >><[email protected]> wrote:
>> >>
>> >>Interestingly, I'm a mac user but I don't want to screw my laptop so I'm
>> >>using vagrant with Ubuntu Trusty. It doesn't have GUI but Xvfb can still
>> >>be installed properly. The issue would be I don't know how to integrate
>> >>Selenium with Nutch 1.10.
>> >>
>> >>On Thu, Feb 12, 2015 at 12:04 AM, Jiaxin Ye
>> >><[email protected]> wrote:
>> >>
>> >>Hi all,
>> >>
>> >>
>> >>Anyone here knows where to find the setup tutorial for Selenium on Mac
>> ??
>> >>I find it difficult to install Xvfb on mac.
>> >>
>> >>
>> >>Best,
>> >>Jiaxin
>> >>
>> >>
>> >>On Tue, Feb 10, 2015 at 9:42 PM, Sapnashri Suresh
>> >><[email protected]> wrote:
>> >>
>> >>Hi Shuo Li,
>> >>
>> >>
>> >>We were facing a similar issue. Prof. Mattman suggested we look into
>> this
>> >>patch for Selenium on Nutch 1.10 :
>> >>https://issues.apache.org/jira/browse/NUTCH-1933.
>> >>
>> >>
>> >>Hope this helps!
>> >>
>> >>
>> >>Thanks,
>> >>Sapna
>> >>
>> >>On Tue, Feb 10, 2015 at 9:36 PM, Shuo Li
>> >><[email protected]> wrote:
>> >>
>> >>Yop,
>> >>
>> >>
>> >>I'm trying to install selenium in Nutch 1.10. However, this error pops
>> >>out:
>> >>
>> >>
>> >>error: package org.apache.nutch.storage does not exist
>> >>
>> >>
>> >>
>> >>I can only find this package in Nutch 2.x. Is there a way to use
>> Selenium
>> >>in 1.10?
>> >>
>> >>
>> >>Any advice would be appreciated.
>> >>
>> >>
>> >>Regards,
>> >>Shuo Li
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>--
>> >>Graduate Student
>> >>MS in CS (Data Science)
>> >>Viterbi School of Engineering
>> >>University of Southern California
>> >>
>> >>
>> >>Phone:
>> >>+1 650-307-9848 <tel:%2B1%20650-307-9848> <tel:%2B1%20650-307-9848>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>>
>>
>

Reply via email to