[jira] [Commented] (SPARK-48009) Specifications for Apache Spark hadoop Avro append operation

Rushikesh Kavar (Jira) Fri, 26 Apr 2024 06:06:03 -0700


    [ 
https://issues.apache.org/jira/browse/SPARK-48009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17841213#comment-17841213
 ]


Rushikesh Kavar commented on SPARK-48009:
-----------------------------------------

import org.apache.spark.sql.SaveMode;
import org.example.avro.Writer;

import java.util.ArrayList;
import java.util.List;

public class OverrideAvro {

    public static void main(String[] args) {
            // C:\Users\kavarus\testing\spark-testing\data
        Writer.writeAvro(getMockData(), 
"C:\\Users\\kavarus\\testing\\spark-testing\\data", SaveMode.Overwrite);
    }

    public static List<Modal> getMockData() {
        List<Modal> lst = new ArrayList<>();
        lst.add(new Modal("1", "Test1", 26));
        lst.add(new Modal("2", "Test2", 28));
        return lst;
    }

}

> Specifications for Apache Spark hadoop Avro append operation
> ------------------------------------------------------------
>
>                 Key: SPARK-48009
>                 URL: https://issues.apache.org/jira/browse/SPARK-48009
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 3.4.3
>            Reporter: Rushikesh Kavar
>            Priority: Minor
>
> Consider a path /a/b/c 
> Assume, I write the avro to folder using apache spark.
> After it is written, Assume I try to append dataset to this to folder. 
> I want to see the specification of what happens in case of append.
> After doing PoC, I found out that when dataet which is appended is having 
> same schema as of existing data, data gets just appended. But I want to see 
> clear docs of what happens exactly in case of append.
> I am attaching my testing java code.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-48009) Specifications for Apache Spark hadoop Avro append operation

Reply via email to