Apache spark does have a template for PR's
https://github.com/apache/spark/blob/master/.github/PULL_REQUEST_TEMPLATE

søn. 25. aug. 2024 kl. 13:41 skrev Mich Talebzadeh <
mich.talebza...@gmail.com>:

> Unfortunately it is not that straight forward
>
>
>    1. Committer Votes: The PR needs a sufficient number of "+1" votes
>    from *committers.*
>    2. Review Process: Address feedback from the community and committers
>    to ensure the PR meets the necessary standards.
>    3. Approval: Once approved by committers, the PR can be merged into
>    the main codebase.
>
>
> HTH
>
>
>
> On Sun, 25 Aug 2024 at 08:17, Mark Andreev <mark.andr...@gmail.com> wrote:
>
>> Thank you for your review.
>>
>> Could you explain how to merge this commit into the upstream? I don't
>> want this PR to be abandoned.
>>
>> Best regards,
>> Mark Andreev
>>
>>
>> On Wed, 21 Aug 2024 at 23:08, Mich Talebzadeh <mich.talebza...@gmail.com>
>> wrote:
>>
>>> Hi Mark,
>>>
>>> You have already done that and have made the request for review.
>>>
>>> +1 for me
>>>
>>> Mich Talebzadeh,
>>>
>>> Architect | Data Engineer | Data Science | Financial Crime
>>> PhD <https://en.wikipedia.org/wiki/Doctor_of_Philosophy> Imperial
>>> College London <https://en.wikipedia.org/wiki/Imperial_College_London>
>>> London, United Kingdom
>>>
>>>
>>>    view my Linkedin profile
>>> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/>
>>>
>>>
>>>  https://en.everybodywiki.com/Mich_Talebzadeh
>>>
>>>
>>>
>>> *Disclaimer:* The information provided is correct to the best of my
>>> knowledge but of course cannot be guaranteed . It is essential to note
>>> that, as with any advice, quote "one test result is worth one-thousand
>>> expert opinions (Werner
>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>Von Braun
>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>)".
>>>
>>>
>>> On Wed, 21 Aug 2024 at 22:20, Mark Andreev <mark.andr...@gmail.com>
>>> wrote:
>>>
>>>> Thank you, Mich.
>>>>
>>>> What is the correct procedure to request a review?
>>>>
>>>> On Tue, 20 Aug 2024 at 22:57, Mich Talebzadeh <
>>>> mich.talebza...@gmail.com> wrote:
>>>>
>>>>> Hi Mark,
>>>>>
>>>>> Added a comment to Jira to provide more clarity to Description
>>>>>
>>>>> When encountering mixed schema rows, the current error message
>>>>> "{actual} is not a valid external type for schema of {expected}" lacks
>>>>> sufficient detail to identify the problematic column. This ambiguity
>>>>> hinders troubleshooting and increases development time.
>>>>>
>>>>> To enhance error clarity, we propose incorporating the source column
>>>>> name into the error message. For example: "Column 'my_column' has an 
>>>>> actual
>>>>> type of {actual} which is not a valid external type for the expected 
>>>>> schema
>>>>> of {expected}."
>>>>>
>>>>> By providing this additional context, developers can more efficiently
>>>>> pinpoint and resolve schema mismatches.
>>>>>
>>>>>
>>>>> HTH
>>>>>
>>>>> Mich Talebzadeh,
>>>>>
>>>>> Architect | Data Engineer | Data Science | Financial Crime
>>>>> PhD <https://en.wikipedia.org/wiki/Doctor_of_Philosophy> Imperial
>>>>> College London <https://en.wikipedia.org/wiki/Imperial_College_London>
>>>>>
>>>>> London, United Kingdom
>>>>>
>>>>>
>>>>>    view my Linkedin profile
>>>>> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/>
>>>>>
>>>>>
>>>>>  https://en.everybodywiki.com/Mich_Talebzadeh
>>>>>
>>>>>
>>>>>
>>>>> *Disclaimer:* The information provided is correct to the best of my
>>>>> knowledge but of course cannot be guaranteed . It is essential to note
>>>>> that, as with any advice, quote "one test result is worth one-thousand
>>>>> expert opinions (Werner
>>>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>Von Braun
>>>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>)".
>>>>>
>>>>>
>>>>> On Tue, 20 Aug 2024 at 21:59, Mark Andreev <mark.andr...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> Hi,
>>>>>>
>>>>>> Could you review my small PR [SPARK-49044][SQL] ValidateExternalType
>>>>>> should return a child in error (
>>>>>> https://github.com/apache/spark/pull/47522 )?  Changes contain tests
>>>>>> that verify results.
>>>>>>
>>>>>> TLDR: After fix error message will contain extra information: [B is
>>>>>> not a valid external type for schema of string at
>>>>>> getexternalrowfield(assertnotnull(input[0, org.apache.spark.sql.Row,
>>>>>> true]), 1, f3)
>>>>>> If you need more information, please let me know. If you're busy,
>>>>>> please let me know the best time to reach you again.
>>>>>>
>>>>>> On Mon, 29 Jul 2024 at 18:15, Mark Andreev <mark.andr...@gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Hi Spark Devs,
>>>>>>>
>>>>>>> Please review my PR [ https://github.com/apache/spark/pull/47522 ]
>>>>>>> that relates to ticket [
>>>>>>> https://issues.apache.org/jira/browse/SPARK-49044 ].
>>>>>>>
>>>>>>> Context: When we have mixed schema rows, the error message "{actual}
>>>>>>> is not a valid external type for schema of {expected}" doesn't help to
>>>>>>> understand the column with the problem. I suggest adding information 
>>>>>>> about
>>>>>>> the source column.
>>>>>>>
>>>>>>> Example:
>>>>>>> https://github.com/mrk-andreev/example-spark-schema/blob/main/spark_4.0.0/src/test/scala/ErrorMsgSuite.scala
>>>>>>>
>>>>>>> Before fix: [B is not a valid external type for schema of string
>>>>>>> After fix: [B is not a valid external type for schema of string at
>>>>>>> getexternalrowfield(assertnotnull(input[0, org.apache.spark.sql.Row,
>>>>>>> true]), 1, f3)
>>>>>>>
>>>>>>> --
>>>>>>> Best regards,
>>>>>>> Mark Andreev
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Best regards,
>>>>>> Mark Andreev
>>>>>>
>>>>>
>>>>
>>>> --
>>>> Best regards,
>>>> Mark Andreev
>>>>
>>>

-- 
Bjørn Jørgensen
Vestre Aspehaug 4, 6010 Ålesund
Norge

+47 480 94 297

Reply via email to