Spaarsh commented on issue #13896:
URL: https://github.com/apache/datafusion/issues/13896#issuecomment-2561346887

   @alihan-synnada This is more of an issue on the the CWI website. I went to 
the http://homepages.cwi.nl/~boncz/job/ URL (removing the last part of the url 
in the code), and found the imdb.tgz dataset link. Upon hovering over it, I 
came across the link that actually hosts the data: 
https://event.cwi.nl/da/job/imdb.tgz
   
   I changed the url in the script 
[here](https://github.com/apache/datafusion/blob/6cfd1cf1e030ccfe3b17621cc51fdcefcceae018/benchmarks/bench.sh#L463)
 and now the code works as expected:
   
![20241225_001005](https://github.com/user-attachments/assets/4ff8020c-8bae-4e2d-9c03-1b28a933426b)
   
   If this is an acceptable solution, I will make a PR.
   
   Here is the content of the page displayed at 
http://homepages.cwi.nl/~boncz/job/ (note that even the protocol is wrong, 
maybe due to changes in the host website, it should be https), notice the 
imdb.tgz at the bottom:
   
![image](https://github.com/user-attachments/assets/1562223a-4917-4d8a-8c7f-857ee19767d1)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to