RE: Beginner searching for guidance with Jira and issues

2018-03-21 Thread Efim Poberezkin
Thanks a lot Jose,

I’ll look into the issue you’ve recommended then. Will comment on Jira to 
indicate I’m working on it and ask further questions there if needed.

BR,
Efim

From: Joseph Torres [mailto:joseph.tor...@databricks.com]
Sent: Tuesday, March 20, 2018 8:41 PM
To: Efim Poberezkin 
Cc: dev@spark.apache.org
Subject: Re: Beginner searching for guidance with Jira and issues

Hi!

I can't speak for the other tasks, but SPARK-23444 I'd expect to be pretty 
complicated. It's not obvious what the right strategy is, and there's a bunch 
of minor stuff that needs to be cleaned up (e.g. tasks shouldn't print 
cancellation warnings when cancellation is expected).

If you're interested in working on continuous processing, 
https://issues.apache.org/jira/browse/SPARK-23503 could be a good newbie task. 
It's a pretty localized change to the EpochCoordinator class; basically, it 
needs to wait to call query.commit(n + 1) until after query.commit(n). I'm not 
sure how well I've managed to document the existing implementation, but I'd be 
happy to answer any questions about it.

Jose

On Tue, Mar 20, 2018 at 9:01 AM, Efim Poberezkin 
> wrote:
Good time of day,

I’d like to contribute to Spark development, but find it difficult to get into 
the process. I’m somewhat overwhelmed by Spark’s Jira as it’s hard for me to 
figure out the complexity of tasks and choose an appropriate one.
I’ve surfed Jira for some time and have selected a few issues I think I could 
try to solve:

https://issues.apache.org/jira/browse/SPARK-23444
https://issues.apache.org/jira/browse/SPARK-23693
https://issues.apache.org/jira/browse/SPARK-23673 - although for this one 
there’s an uncertainty that it’s needed at all, according to the comment

Also I think it would be interesting to work on Continuous Processing if there 
were some newbie tasks, but I wasn’t able to find them.
If you could give me some directions on any of these issues I’ve linked, or 
just point to some tasks that are suitable for a beginner, that’d help me a 
lot, I would appreciate any advice.

Best regards,
Efim



Re: Beginner searching for guidance with Jira and issues

2018-03-20 Thread Joseph Torres
Hi!

I can't speak for the other tasks, but SPARK-23444 I'd expect to be pretty
complicated. It's not obvious what the right strategy is, and there's a
bunch of minor stuff that needs to be cleaned up (e.g. tasks shouldn't
print cancellation warnings when cancellation is expected).

If you're interested in working on continuous processing, https://issues.
apache.org/jira/browse/SPARK-23503 could be a good newbie task. It's a
pretty localized change to the EpochCoordinator class; basically, it needs
to wait to call query.commit(n + 1) until after query.commit(n). I'm not
sure how well I've managed to document the existing implementation, but I'd
be happy to answer any questions about it.

Jose

On Tue, Mar 20, 2018 at 9:01 AM, Efim Poberezkin 
wrote:

> Good time of day,
>
>
>
> I’d like to contribute to Spark development, but find it difficult to get
> into the process. I’m somewhat overwhelmed by Spark’s Jira as it’s hard for
> me to figure out the complexity of tasks and choose an appropriate one.
>
> I’ve surfed Jira for some time and have selected a few issues I think I
> could try to solve:
>
>
>
> https://issues.apache.org/jira/browse/SPARK-23444
>
> https://issues.apache.org/jira/browse/SPARK-23693
>
> https://issues.apache.org/jira/browse/SPARK-23673 - although for this one
> there’s an uncertainty that it’s needed at all, according to the comment
>
>
>
> Also I think it would be interesting to work on Continuous Processing if
> there were some newbie tasks, but I wasn’t able to find them.
>
> If you could give me some directions on any of these issues I’ve linked,
> or just point to some tasks that are suitable for a beginner, that’d help
> me a lot, I would appreciate any advice.
>
>
>
> Best regards,
>
> Efim
>


Beginner searching for guidance with Jira and issues

2018-03-20 Thread Efim Poberezkin
Good time of day,

I'd like to contribute to Spark development, but find it difficult to get into 
the process. I'm somewhat overwhelmed by Spark's Jira as it's hard for me to 
figure out the complexity of tasks and choose an appropriate one.
I've surfed Jira for some time and have selected a few issues I think I could 
try to solve:

https://issues.apache.org/jira/browse/SPARK-23444
https://issues.apache.org/jira/browse/SPARK-23693
https://issues.apache.org/jira/browse/SPARK-23673 - although for this one 
there's an uncertainty that it's needed at all, according to the comment

Also I think it would be interesting to work on Continuous Processing if there 
were some newbie tasks, but I wasn't able to find them.
If you could give me some directions on any of these issues I've linked, or 
just point to some tasks that are suitable for a beginner, that'd help me a 
lot, I would appreciate any advice.

Best regards,
Efim