Re: [PR] [SPARK-50945][ML][PYTHON][CONNECT] Support Summarizer and SummaryBuilder on Connect [spark]

2025-02-07 Thread via GitHub


zhengruifeng commented on PR #49847:
URL: https://github.com/apache/spark/pull/49847#issuecomment-2644387096

   thanks, merged to master/4.0


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-50945][ML][PYTHON][CONNECT] Support Summarizer and SummaryBuilder on Connect [spark]

2025-02-07 Thread via GitHub


zhengruifeng closed pull request #49847: [SPARK-50945][ML][PYTHON][CONNECT] 
Support Summarizer and SummaryBuilder on Connect
URL: https://github.com/apache/spark/pull/49847


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-50945][ML][PYTHON][CONNECT] Support Summarizer and SummaryBuilder on Connect [spark]

2025-02-07 Thread via GitHub


xinrong-meng commented on PR #49847:
URL: https://github.com/apache/spark/pull/49847#issuecomment-2643729415

   LGTM, can we add an example (like below) to user-facing change in pr 
description? Thank you!
   ```
   data = [
   [Vectors.dense([1, 0, 0, -2]), 1.0],
   [Vectors.dense([4, 5, 0, 3]), 2.0],
   [Vectors.dense([6, 7, 0, 8]), 1.0],
   [Vectors.dense([9, 0, 0, 1]), 1.0],
   ]
   df = spark.createDataFrame(data, ["features", "weight"])
   
   summarizer = Summarizer.metrics("mean", "count")
   res1 = df.select(summarizer.summary(df.features))
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-50945][ML][PYTHON][CONNECT] Support Summarizer and SummaryBuilder on Connect [spark]

2025-02-06 Thread via GitHub


zhengruifeng commented on PR #49847:
URL: https://github.com/apache/spark/pull/49847#issuecomment-2641993211

   cc @HyukjinKwon and @wbo4958 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org