kunwp1 opened a new issue, #3832:
URL: https://github.com/apache/texera/issues/3832

   ### What happened?
   
   The dataset upload on the Hub website is extremely slow compared to the 
local environment. When testing with a 5 GB file:
   
   - Local (single-node, latest main branch): Upload completes in approximately 
15 seconds.
   - Hub website: Upload takes over 10 minutes (estimated).
   
   The Hub deployment is currently 18 commits behind the latest main branch:
   
   - Hub commit: b0075f6350e86ed363209350f2a31ca6a4b48b9a
   - Local commit: 1c812a5e2d72a42a8b6698d286f41f618095a331
   
   No code changes related to dataset upload were made between these commits. I 
suspect that the slowdown may be caused by environmental factors (e.g., 
network, deployment configuration, infrastructure). 
   
   ### How to reproduce?
   
   1. Upload a 5 GB dataset using the Hub website.
   2. Upload the same 5 GB dataset using the local (single-node) environment on 
the latest main branch.
   3. Compare the upload durations.
   
   **Local Upload (Fast)**
   
https://github.com/user-attachments/assets/ae8c5af4-519d-44cc-acb7-de0a682c47f7
   
   **Hub Upload (Slow)**
   
https://github.com/user-attachments/assets/63433788-e2ff-4264-ad1d-527c746cb167
   
   ### Version
   
   1.1.0-incubating (Pre-release/Master)
   
   ### Commit Hash (Optional)
   
   _No response_
   
   ### What browsers are you seeing the problem on?
   
   _No response_
   
   ### Relevant log output
   
   ```shell
   
   ```
   
   ### Code of Conduct
   
   - [x] I agree to follow Apache Code of Conduct


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to