[ 
https://issues.apache.org/jira/browse/NUTCH-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrian Kunz updated NUTCH-3098:
-------------------------------
    Description: 
The commands in the apache/nutch Docker Readme do not work.
{code:java}
$ docker run -t -i -p 8080:8080 -p 8081:8081 --name nutchcontainer apache/nutch 
# inside the container shell (note no -d):
~ # cat /tmp/supervisord.log
cat: can't open '/tmp/supervisord.log': No such file or directory
{code}
Outside the container shell:
{code:java}
$ curl http://localhost:8080/admin 
curl: (52) Empty reply from server{code}
If I built from source (which takes almost 10 minutes, btw), it seems to work:
{code:java}
docker run -it -p 8180:8080 -p 8181:8081 --name nutch <my-registry>/apache/nutch
WARNING: The requested image's platform (linux/amd64) does not match the 
detected host platform (linux/arm64/v8) and no specific platform was requested
2024-12-09 15:53:15,432 INFO Set uid to user 0 succeeded
2024-12-09 15:53:15,436 INFO supervisord started with pid 1
2024-12-09 15:53:16,440 INFO spawned: 'nutchserver' with pid 7
2024-12-09 15:53:16,442 INFO spawned: 'nutchwebapp' with pid 8
2024-12-09 15:53:17,443 INFO success: nutchserver entered RUNNING state, 
process has stayed up for > than 0 seconds (startsecs)
2024-12-09 15:53:17,443 INFO success: nutchwebapp entered RUNNING state, 
process has stayed up for > than 0 seconds (startsecs){code}
Now I can access the server:
{code:java}
$ curl http://localhost:8181/admin 
{"startDate":1733759756634,"configuration":["default"],"jobs":[],"runningJobs":[]}
$ curl http://localhost:8180
$ # but not the web interface{code}
Another thing:

The Readme seems contradictory in it's use of ports:
 
{code:java}
You can now access the webapp at `http://localhost:8080` and you can interact 
with the REST API e.g. 
```bash curl http://localhost:8080/admin 
{"startDate":1625118207995,"configuration":["default"],"jobs":[],"runningJobs":[]}
 ```{code}
 
Here 8080 is both the web app and the server port, vs the default suggestion, 
where 8081 is server and 8080 is web app:
{code:java}
docker build -t apache/nutch . --build-arg BUILD_MODE=2 --build-arg 
SERVER_PORT=8081 --build-arg SERVER_HOST=0.0.0.0 --build-arg 
WEBAPP_PORT=8080{code}
 

  was:
The commands in the apache/nutch Docker Readme do not work.
{code:java}
$ docker run -t -i -p 8080:8080 -p 8081:8081 --name nutchcontainer apache/nutch 
# inside the container shell (note no -d):
~ # cat /tmp/supervisord.log
cat: can't open '/tmp/supervisord.log': No such file or directory
{code}
Outside the container shell:
{code:java}
$ curl http://localhost:8080/admin 
curl: (52) Empty reply from server{code}
If I built from source (which takes an enormous amount of time, btw), it seems 
to work:
{code:java}
docker run -it -p 8180:8080 -p 8181:8081 --name nutch <my-registry>/apache/nutch
WARNING: The requested image's platform (linux/amd64) does not match the 
detected host platform (linux/arm64/v8) and no specific platform was requested
2024-12-09 15:53:15,432 INFO Set uid to user 0 succeeded
2024-12-09 15:53:15,436 INFO supervisord started with pid 1
2024-12-09 15:53:16,440 INFO spawned: 'nutchserver' with pid 7
2024-12-09 15:53:16,442 INFO spawned: 'nutchwebapp' with pid 8
2024-12-09 15:53:17,443 INFO success: nutchserver entered RUNNING state, 
process has stayed up for > than 0 seconds (startsecs)
2024-12-09 15:53:17,443 INFO success: nutchwebapp entered RUNNING state, 
process has stayed up for > than 0 seconds (startsecs){code}
Now I can access the server:
{code:java}
$ curl http://localhost:8181/admin 
{"startDate":1733759756634,"configuration":["default"],"jobs":[],"runningJobs":[]}
$ curl http://localhost:8180
$ # but not the web interface{code}
Another thing:

The Readme seems contradictory in it's use of ports:
 
{code:java}
You can now access the webapp at `http://localhost:8080` and you can interact 
with the REST API e.g. 
```bash curl http://localhost:8080/admin 
{"startDate":1625118207995,"configuration":["default"],"jobs":[],"runningJobs":[]}
 ```{code}
 
Here 8080 is both the web app and the server port, vs the default suggestion, 
where 8081 is server and 8080 is web app:
{code:java}
docker build -t apache/nutch . --build-arg BUILD_MODE=2 --build-arg 
SERVER_PORT=8081 --build-arg SERVER_HOST=0.0.0.0 --build-arg 
WEBAPP_PORT=8080{code}
 


> Docker Setup from Readme does not work
> --------------------------------------
>
>                 Key: NUTCH-3098
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3098
>             Project: Nutch
>          Issue Type: Bug
>          Components: docker
>            Reporter: Adrian Kunz
>            Priority: Major
>
> The commands in the apache/nutch Docker Readme do not work.
> {code:java}
> $ docker run -t -i -p 8080:8080 -p 8081:8081 --name nutchcontainer 
> apache/nutch 
> # inside the container shell (note no -d):
> ~ # cat /tmp/supervisord.log
> cat: can't open '/tmp/supervisord.log': No such file or directory
> {code}
> Outside the container shell:
> {code:java}
> $ curl http://localhost:8080/admin 
> curl: (52) Empty reply from server{code}
> If I built from source (which takes almost 10 minutes, btw), it seems to work:
> {code:java}
> docker run -it -p 8180:8080 -p 8181:8081 --name nutch 
> <my-registry>/apache/nutch
> WARNING: The requested image's platform (linux/amd64) does not match the 
> detected host platform (linux/arm64/v8) and no specific platform was requested
> 2024-12-09 15:53:15,432 INFO Set uid to user 0 succeeded
> 2024-12-09 15:53:15,436 INFO supervisord started with pid 1
> 2024-12-09 15:53:16,440 INFO spawned: 'nutchserver' with pid 7
> 2024-12-09 15:53:16,442 INFO spawned: 'nutchwebapp' with pid 8
> 2024-12-09 15:53:17,443 INFO success: nutchserver entered RUNNING state, 
> process has stayed up for > than 0 seconds (startsecs)
> 2024-12-09 15:53:17,443 INFO success: nutchwebapp entered RUNNING state, 
> process has stayed up for > than 0 seconds (startsecs){code}
> Now I can access the server:
> {code:java}
> $ curl http://localhost:8181/admin 
> {"startDate":1733759756634,"configuration":["default"],"jobs":[],"runningJobs":[]}
> $ curl http://localhost:8180
> $ # but not the web interface{code}
> Another thing:
> The Readme seems contradictory in it's use of ports:
>  
> {code:java}
> You can now access the webapp at `http://localhost:8080` and you can interact 
> with the REST API e.g. 
> ```bash curl http://localhost:8080/admin 
> {"startDate":1625118207995,"configuration":["default"],"jobs":[],"runningJobs":[]}
>  ```{code}
>  
> Here 8080 is both the web app and the server port, vs the default suggestion, 
> where 8081 is server and 8080 is web app:
> {code:java}
> docker build -t apache/nutch . --build-arg BUILD_MODE=2 --build-arg 
> SERVER_PORT=8081 --build-arg SERVER_HOST=0.0.0.0 --build-arg 
> WEBAPP_PORT=8080{code}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to