Edit report at https://bugs.php.net/bug.php?id=74181&edit=1
ID: 74181 Comment by: [email protected] Reported by: james at jameswestby dot net Summary: Different configurations for serving www.php.net? Status: Open Type: Bug Package: Website problem Operating System: Linux PHP Version: Irrelevant Block user comment: N Private report: N New Comment: This doesn't address the problem you're having, but maybe you could modify the bot to use this to get the new versions, rather than scraping the site? http://php.net/releases/index.php?json Check out the notes in the right-hand sidebar here for details on other query-string options to provide: http://php.net/releases/ Previous Comments: ------------------------------------------------------------------------ [2017-02-28 14:47:29] james at jameswestby dot net Description: ------------ Hi, We have a bot that watches for new versions of php by scraping the website occaisionally. It sometimes reported that it couldn't find the latest release in the page. After debugging for a while I found that sometimes it was being redirected to `/search.php` by `http://www.php.net`. Testing lots of queries against `www.php.net` shows different behaviours, indicating perhaps a mix of servers for it, with different configurations? Test script: --------------- $ curl -I http://www.php.net/ Expected result: ---------------- Same behaviour whichever backend server is reached. Actual result: -------------- Here are two consecutive responses I received with curl -I: HTTP/1.1 200 OK Date: Tue, 28 Feb 2017 14:43:00 GMT Server: Apache Last-Modified: Tue, 28 Feb 2017 13:30:11 GMT Content-language: en X-Frame-Options: SAMEORIGIN Set-Cookie: COUNTRY=NA%2C86.31.132.166; expires=Tue, 07-Mar-2017 14:43:00 GMT; path=/; domain=.php.net Set-Cookie: LAST_NEWS=1488292980; expires=Wed, 28-Feb-2018 14:43:00 GMT; path=/; domain=.php.net Link: <http://php.net/index>; rel=shorturl Connection: close Content-Type: text/html; charset=utf-8 HTTP/1.1 301 Moved Permanently Server: nginx/1.6.2 Date: Tue, 28 Feb 2017 14:43:01 GMT Content-Type: text/html Content-Length: 184 Connection: keep-alive Location: http://php.net/ You can see that the nginx is behaving differently to the apache. The bot was seeing a redirect to `/search.php` sometimes, but I haven't seen that with curl yet. ------------------------------------------------------------------------ -- Edit this bug report at https://bugs.php.net/bug.php?id=74181&edit=1 -- PHP Webmaster List Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
