https://cwiki.apache.org/confluence/display/HUDI/20201103+Bi+Weekly+Sync+Minutes
Next meeting date : Nov 17, 2020 8PM PST
be better.
>
>
> On Mon, Nov 2, 2020 at 7:56 PM Vinoth Chandar wrote:
>
> > Looks like a lot of us are in favor.
> > Anyone objecting?
> >
> > On Mon, Nov 2, 2020 at 9:11 AM Mani Jindal wrote:
> >
> > > +1
> > >
> > > On Mon
; +1
> > >
> > > On Mon, Nov 2, 2020 at 11:57 AM Vinoth Chandar
> > wrote:
> > >
> > > > +1
> > > >
> > > > On Mon, Nov 2, 2020 at 8:44 AM Balaji Varadarajan
> > > > wrote:
> > > >
> > > >
+1
On Mon, Nov 2, 2020 at 8:44 AM Balaji Varadarajan
wrote:
> +1
> On Sunday, November 1, 2020, 09:13:44 PM PST, Gary Li <
> garyli1...@outlook.com> wrote:
>
> +1 for biweekly meeting.
> Gary LiFrom: Vinoth Chandar
> Sent: Friday, October 30, 2020 2:01:22 P
+ users list as well.
On Thu, Oct 29, 2020 at 10:59 PM Bhavani Sudha
wrote:
> Hello all,
> I was wondering if it would make sense to move the weekly sync meeting to
> bi-weekly to amortize time and be efficient, especially since people across
> different time zones attend. We could still retain
https://cwiki.apache.org/confluence/display/HUDI/20201027+Weekly+Sync+Minutes
Thanks
Vinoth
On Wed, 21 Oct 2020 at 7:45 AM, Vinoth Chandar <
> mail.vinoth.chan...@gmail.com> wrote:
>
> > For now, bloom filters are not actually leveraged in the read/query path
> > but only by the writer performing the index lookup for upserting. Hudi is
> > write optimized like a
https://cwiki.apache.org/confluence/display/HUDI/20201020+Weekly+Sync+Minutes
Thanks
Vinoth
For now, bloom filters are not actually leveraged in the read/query path
but only by the writer performing the index lookup for upserting. Hudi is
write optimized like an OLTP store and read optimized like OLAP, if
that makes sense.
As for bloom index performance, our tuning guide and FAQ talk abo
Hello all,
Can you please add any recent talks to our powered by/talks page?
I know we had an ApacheCon and may be one more talk?
I opened https://github.com/apache/hudi/pull/2155 for the PrestoCon panel
Thanks
Vinoth
https://cwiki.apache.org/confluence/display/HUDI/20200929+Weekly+Sync+Minutes
Thanks
Vinoth
> >
> > > You have mentioned Spark3.0 support in next release. We were actually
> >
> > > thinking of moving to Spark 3.0 but thought it’s too early with 0.6
> >
> > > release. Is 0.6 not fully tested with Spark 3.0 ?
> >
>
>
> On Sat, 19 Sep 2020 at 11:35 PM, Vinoth Chandar wrote:
>
> > We could add support to validate the data frame against a schema string
> >
> > passed
> >
> > to the data source writer. I guess you want the dataframe to be also
> >
> > converted int
https://cwiki.apache.org/confluence/display/HUDI/20200922+Weekly+Sync+Minutes
Thanks
VInoth
Hello all,
Pursuant to our conversation around release planning, I am happy to share
the initial set of proposals for the next minor/major releases (minor
release ofc can go out based on time)
*Next Minor version 0.6.1 (with stuff that did not make it to 0.6.0..) *
Flink/Writer common refactoring
ach dataset as each
>
> dataset is unique in our case and hence the need to validate the schema for
>
> each dataset while writing.
>
>
>
> On Tue, 15 Sep 2020 at 2:53 AM, Vinoth Chandar wrote:
>
>
>
> > Hi,
>
> >
>
> >
>
> >
>
>
I don't have sound deep thoughts on this. Uniformity is good less dependent
on other libraries is good. As long as we can implement the same
functionalityand
Taking care of concurrency etc, I am a +1
On Mon, Sep 14, 2020 at 9:26 AM Pratyaksh Sharma
wrote:
> Hi Raymond,
>
>
>
> I was actually loo
it feasible. Or it may be easier to
>
> hit by limiting the minor version to bug fix and docs update.
>
>
>
> On Tue, Sep 8, 2020 at 10:41 PM Pratyaksh Sharma
>
> wrote:
>
>
>
> > Missed this thread, the plan looks good to me as well.
>
> >
Wiki page updated with new timing!
https://cwiki.apache.org/confluence/display/HUDI/Apache+Hudi+Community+Weekly+Sync
On Thu, Sep 17, 2020 at 3:41 PM Gary Li wrote:
> I am okay with both :)
>
> Gary Li
> ____
> From: Vinoth Chandar
> Sent: Thur
n kid care duties some days. We can go ahead with the change
> if
> > timing is fine with everyone.
> >
> > Thanks,
> > Sudha
> >
> > On Tue, Sep 15, 2020 at 7:08 AM Vinoth Chandar
> wrote:
> >
> > > Folks,
> > >
> >
https://cwiki.apache.org/confluence/display/HUDI/20200915+Weekly+Sync+Minutes
Thanks
Vinoth
Folks,
Please chime in with your opinions. I still can see some regulars (e.g
Nishith, Sudha, Gary) who have not chimed in
On Tue, Sep 15, 2020 at 12:22 AM Pratyaksh Sharma
wrote:
> Hi,
>
> Just wanted to confirm the time for this week's sync up. @Vinoth Chandar
>
>
>
landing with the ACID semantic.
> Yes, through the scheduler engine like Apache Airflow, we can read these
> data from the storage then process them. But the key difference, we can
> avoid re-reading these data from the persistent storage again.
>
> [1] https://github.com/linkedin/
nks Vinoth. Yes that’s always an option with me to validate myself. I
> just wanted to confirm if Spark does it for me for all my datasets and I
> wonder why they haven’t provided it for write but provided it for read.
>
> On Sat, 12 Sep 2020 at 9:02 PM, Vinoth Chandar
Actually, we merged this PR which should make this possible.
https://github.com/apache/hudi/pull/1638#issuecomment-629990534
udit/wenning, can you comment on where we are on this?
On Thu, Sep 10, 2020 at 11:28 AM Pratyaksh Sharma
wrote:
> Hi Selvaraj,
>
> Currently Hudi works with Hadoop 2.7.
Hi,
IIUC, you want to be able to pass in a schema to write? AFAIK, Spark
Datasource V1 atleast does not allow for passing in the schema.
Hudi writing will just use the schema for the df you pass in.
Just throwing it out there. can you write a step to drop all unnecessary
columns before issuing th
Hi Sanjay,
Overall the two proposals sound reasonable to me. Thanks for describing
them so well.
General comment, it seems like you are implementing multi AZ replication by
matching commit times across AZs?
I do want to name these properties to be consistent with other Hudi
terminology. but we ca
it's possible that some of the commands are not erroring gracefully for
missing parameters?
hudi:tablename->savepoint create
for eg, would need a commit time for creating the savepoint,
if you are able to connect to the dataset, then it should all be working,
On Wed, Sep 9, 2020 at 3:27 AM Prat
aking progress. Just some questions pop into my
> head. I think my questions are all solvable. Happy to discuss more in the
> RFC if we move forward :)
>
> Best,
> Gary
>
> Gary Li
>
> From: Vinoth Chandar
> Sent: Wednesday, September 2
n.
>
> So, in short, this proposal tries to bring something:
>
>- performance: better performance when processing after data ingestion;
>- focus and fluent: inline ingestion and processing logic in some
>scenarios;
>- boundary: at a high level, introduce more abili
https://cwiki.apache.org/confluence/display/HUDI/20200908+Weekly+Sync+Minutes
Please find this week's sync notes
can give it a try.
>
> On Tue, Sep 8, 2020 at 5:55 PM Mehrotra, Udit
> wrote:
>
> > +1 on the process.
> >
> > On 9/8/20, 5:11 PM, "Vinoth Chandar" wrote:
> >
> > CAUTION: This email originated from outside of the organization. Do
> > n
r 1, 2020, 04:56:55 PM PDT, Gary Li <
> >> garyli1...@outlook.com> wrote:
> >>
> >> +1
> >> Gary LiFrom: Bhavani Sudha
> >> Sent: Wednesday, September 2, 2020 3:11:06 AM
> >> To: us...@hudi.apache.org
> >> Cc: dev@hudi.apache.org
>
Anyone else wants to chime in for a new time, that works for everyone?
Personally, I can do this time.
love to hear more inputs.
On Wed, Sep 2, 2020 at 10:16 AM Pratyaksh Sharma
wrote:
> Hi everyone,
>
> Currently we are having weekly sync ups between 9 PM - 10 PM PST on
> tuesdays. Since I h
That does sound like a backwards compatible change.
@prashant , any ideas here? (since you have the best context on the schema
validation checks)
On Thu, Sep 3, 2020 at 8:12 PM cadl wrote:
> Hi All,
>
> I want to change the type of one column in my COW table, from int to long.
> When I set “hood
Hi all,
I am really excited to share the good news about our new committers on the
project!
*Udit Mehrotra *: Udit has travelled with the project since sept/oct last
year and immensely helped us making Hudi work well with the AWS ecosystem.
His most notable contributions are towards driving large
wrote:
>
> Aah, yes. That’s right.
>
> On Sat, Aug 22, 2020 at 2:43 AM Vinoth Chandar
> wrote:
>
> > All of the remaining meta fields compress very very nicely. They have
> >
> > almost no overhead.
> >
> >
> &g
Hello all,
Put together a list to formalize the things we follow in code review
process today. Please chime in on the PR review, for comments.
https://github.com/apache/hudi/pull/2061
Thanks
Vinoth
Hi all,
Love to start a discussion around how we can formalize the release process,
timelines more so that we can ensure timely and quality releases.
Below is an outline of an idea that was discussed in the last community
sync (also in the weekly sync notes).
- We will do a "feature driven" majo
Hi,
While I agree on bringing more of these capabilities to Hudi natively, I
have few questions/concerns on the specific approach.
> And these calculation functions should be engine independent. Therefore,
I plan to introduce some new APIs that allow users to directly define
Today, if I am a Spa
Great! More docs here.
https://hudi.apache.org/docs/writing_data.html#key-generation
On Tue, Sep 1, 2020 at 3:26 AM Raghvendra Dhar Dubey
wrote:
> I got it working by adding an option
> hoodie.datasource.write.keygenerator.class =
> org.apache.hudi.keygen.ComplexKeyGenerator
>
> On Tue, Sep 1,
+1 this is a great way to also ramp on the code base
On Sun, Aug 30, 2020 at 8:00 AM Sivabalan wrote:
> As Hudi matures as a project, we need to get our devX and test infra rock
> solid. Availability of test utils and base classes for ease of writing more
> tests, stable integration tests, ease
https://cwiki.apache.org/confluence/display/HUDI/20200825+Weekly+Sync+Minutes
ting will make it more convenient for
> developers
>
> > to deal with code styles. On the other hand, it will also make the
>
> > community more complicated when considering related conventions and weigh
>
> > more factors.
>
> >
>
> > Best,
>
>
t; We have tried a couple of solutions, but so far without success :
>
> > - replay the data omitting the data of the persons who have requested to
>
> > be forgotten. We wanted to manipulate the commit times to rebuild the
>
> > history.
>
> > We found that we co
Hello all,
I am looking for help squashing the following flaky tests.
https://api.travis-ci.org/v3/job/719453775/log.txt
[INFO]
[ERROR] Failures:
[ERROR]
TestKeyRangeLookupTree.testFileGroupLookUpManyEntriesWithSameStartValue:76->testRangeOfInputs:154
expected: <[580cf7f7-9269-4670-a11a-66ce66e6f
- announce
Folks, please keep the follow ups to dev@ and users@
On Mon, Aug 24, 2020 at 9:26 PM vino yang wrote:
> Great news!
>
> Thanks to Bhavani Sudha for driving the release! And thanks to every one of
> the whole community!
>
> Best,
> Vino
>
> Bhavani Sudha 于2020年8月25日周二 上午11:37写道:
>
Hi folks,
As you have may have noticed, the 0.6.0 release is out. Huge shoutout to
our RM, Sudha for pulling this off!
As always, thanks for all our users/contributors. congrats everyone!
Onwards and upwards to the next one.
Thanks
Vinoth
On Thu, Aug 20, 2020 at 11:32 AM Vinoth Chandar wrote
+1 (binding)
- Ran the rc checks, I typically do
- Tested a smoke test on both cow, mor tables
- by running lot commits over longer period of time,
- verifying the state of the dataset
- count validation match.
On Sat, Aug 22, 2020 at 6:08 AM leesf wrote:
> +1 (binding)
> - mvn clean
Sharma
> > wrote:
> >
> > > This is a good option to have. :)
> > >
> > > On Thu, Aug 20, 2020 at 11:25 PM Vinoth Chandar
> > wrote:
> > >
> > > > IIRC _hoodie_record_key was supposed to this standardized key field.
> :)
> >
story) and save storage space
> using Hudi.
>
> Can anyone see a way to achieve this?
>
> Kind Regards,
> David Rosalia
>
>
> Get Outlook for Android<https://aka.ms/ghei36>
>
>
> From: Vinoth Chandar
> Sent: Friday, August
has been keeping checkstyle, IDE and spotless
> >>> agreeing
> >>>> on the same thing.
> >>>>
> >>>> Yes, it's the key thing. But, IMO, we can ignore the IDE here, if it
> >>> breaks
> >>>> the code style, chec
cumented here. but
> this
> > >> is
> > >>
> > >>
> > >> the ticket AFSIK: https://issues.apache.org/jira/browse/HUDI-1177
> > >>
> > >>
> > >>
> > >>
> > >>
> > >> On Wed, Aug 1
I would for all these new things to be revamped on top of Spark 3's newer
APIs
(it's kind of frustrating that the datasource APIs don't stabilize easily
in Spark)
I am thinking we can implement a "hudi3" format using Spark 3, with support
for SQL Merges, existing functionality and a redone Spark S
IIRC _hoodie_record_key was supposed to this standardized key field. :)
Anyways, it's good to provide this option to the user.
So +1 for. RFC/further discussion.
To level set, I want to also share some of the benefits of having an
explicit key column.
a) if you build your data lake using a bunch o
;
> On Fri, Aug 14, 2020 at 5:56 PM Vinoth Chandar wrote:
>
> > Thanks Sudha! This is means master is now open for regular PRs. Thanks
> for
> > your patience, everyone.
> >
> > On Fri, Aug 14, 2020 at 3:51 PM Bhavani Sudha
> > wrote:
> >
> > > Hel
https://cwiki.apache.org/confluence/display/HUDI/20200818+Weekly+Sync+Minutes
> > +1 on standardizing code formatting. On Tuesday, August 18, 2020,
> > 03:58:42 PM PDT, Vinoth Chandar wrote:
> >
> > can more people please chime in? This will affect all of us on a daily
> > basis :)
> >
> > On Thu, Aug 13, 2020 at
can more people please chime in? This will affect all of us on a daily
basis :)
On Thu, Aug 13, 2020 at 8:25 AM Gary Li wrote:
> Vote for mvn spotless:apply to do the auto fix.
>
> On Thu, Aug 13, 2020 at 1:13 AM Vinoth Chandar wrote:
>
> > Hi,
> >
> > Anyone has
but I am now able to remove
> flatMap and could use Dataset joins.
>
> Thanks again for all your help as always !!
>
>
>
>
> On Thu, Aug 13, 2020 at 1:42 PM Vinoth Chandar wrote:
>
> > Hi Tanuj,
> >
> > From this example, it appears as if you are tryin
tabilization and will cut the release
> > once we stabilize the builds hopefully tonight/tomorrow.
> > Thanks,Balaji.V
> > On Tuesday, August 11, 2020, 09:15:05 PM PDT, Vinoth Chandar <
> > vin...@apache.org> wrote:
> >
> > Hello all,
> >
> >
+1 thanks leesf. I actually find these very useful when composing the
reports also.:)
On Sun, Aug 9, 2020 at 5:32 PM vino yang wrote:
> Thanks to leesf for continuously updating Hudi weekly.
>
> It is great to see that more and more improvements are being proposed in
> the community.
>
> Best,
>
Hi,
On re-ingesting, do you mean to say you want to overwrite the table, while
not getting the changes in the incremental query? This has not come up
before.
As you can imagine, it'd tricky scenario, where we need some special
handling/action type introduced.
yes, yes on the next two questions.
Hi,
Anyone has thoughts on this?
esp leesf/vinoyang, given you both drove much of the initial cleanups.
On Mon, Aug 10, 2020 at 7:16 PM Shiyan Xu
wrote:
> in that case, yes, all for automation.
>
> On Mon, Aug 10, 2020 at 7:12 PM Vinoth Chandar wrote:
>
> > Overall,
Hi Tanuj,
>From this example, it appears as if you are trying to use sparkSession from
within the executor? This will be problematic. Can you please open a
support ticket with the full stack trace?
I think what you are describing is a join between Kafka and Hudi tables. So
I'd read from Kafka fir
for this is tomorrow night PST (Aug 12, PST).
We will keep this thread posted!
Thanks
Vinoth
On Tue, Aug 4, 2020 at 9:47 PM Vinoth Chandar wrote:
> Small correction:
>
> >> Vinoth working on code review, tests for PR 1876,
> This is landed!
>
>
> On Tue, Aug 4, 20
https://cwiki.apache.org/confluence/display/HUDI/20200811+Weekly+Sync+Minutes
Cheers
Vinoth
Overall, I think we should standardize this across the project.
But most importantly, may be revive the long dormant spotless effort first
to enable autofixing of checkstyle issues, before we add more checking?
On Mon, Aug 10, 2020 at 7:04 PM Shiyan Xu
wrote:
> Hi all,
>
> I noticed that through
rticle, I just tried to use Intellij IDEA to access
> Github features.
>
>
> leesf 于2020年8月8日周六 下午5:54写道:
>
> > helpful and thanks for writing up.
> >
> > Vinoth Chandar 于2020年8月8日周六 下午12:53写道:
> >
> > > Hello all,
> > >
> > > Few p
Hello all,
Few people have asked me this on separate occasions. So thought I'll add a
wiki page on how to checkout, push changes to PRs . Would be useful for all
committers.
https://cwiki.apache.org/confluence/display/HUDI/Resources#Resources-PushingChangesToPRs
Thanks
vinoth
Hi,
IIUC, what you want is for the deletes to be applied on different versions
of the data? so that no time travel query can read the deleted field again.
I am afraid this cannot be achieved as-is today and would need logging
these deletes for older base files - that might be one way to achieve th
Great job, Sudha/Brandon! This is great. Now we can keep improving the
performance here on.
On Wed, Aug 5, 2020 at 10:12 PM Bhavani Sudha
wrote:
> This PR is landed today and will be available in the next Presto release.
> Thanks to Brandan for the Presto fixes.
>
> - Sudha
>
>
> On Tue, Jul 14,
DI-845 : Parallel writing i.e allow multiple writers (Pushed out of
>0.6.0)
>- HUDI-860 : Small File Handling without memory caching (Pushed out of
>0.6.0)
>
>
> Thanks,
> Sudha
>
> On Mon, Aug 3, 2020 at 3:41 PM Vinoth Chandar wrote:
>
> > +1 (w
https://cwiki.apache.org/confluence/display/HUDI/20200804+Weekly+Sync+Minutes
Thanks
Vinoth
nd bump it's priority.
>
> Please share your thoughts or concerns.
>
> Thanks,
> Sudha
>
>
> On Mon, Aug 3, 2020 at 8:19 AM Vinoth Chandar wrote:
>
> > Given enough time has passed, Sudha can be our RM for 0.6.0.
> >
> > On the release blocker progress,
Given enough time has passed, Sudha can be our RM for 0.6.0.
On the release blocker progress, we landed few blockers over the weekend,
with some almost ready for landing
Will send out a status update again tomorrow night PST!
On Mon, Aug 3, 2020 at 8:17 AM Vinoth Chandar wrote:
> Hi an
gt; released? Can't find any dates on Hudi related pages.
>
> On Thu, Jul 30, 2020 at 10:36 AM Vinoth Chandar wrote:
>
> > Is anyone able to help with the at risk items? :)
> >
> > On Thu, Jul 30, 2020 at 7:07 AM leesf wrote:
> >
> > > @Vinoth Chandar
020 at 11:52 PM, Zijing Guo
> > >
> > > > wrote:
> > > >
> > > > > Thanks for the great session Vinoth! Can we have those session
> in a
> > > > > regular basis? I personally find today's session are super helpful!
> > >
from cursory looks of the parent pom.xml I
> > couldn't find anything wrong.
> >
> > Thanks,
> > Nishith
> >
> > On Fri, Jul 31, 2020 at 8:23 AM Vinoth Chandar
> wrote:
> >
> > > Hello all,
> > >
> > > integ-tests are currently fa
Hello all,
integ-tests are currently failing due to exceeding the log limit on master
branch. Nishith is actively debugging what's going on.
I request you to hold off merging more PRs in the meantime, until we
resolve this.
@ nishith , please update this thread, when master is stable again
than
Is anyone able to help with the at risk items? :)
On Thu, Jul 30, 2020 at 7:07 AM leesf wrote:
> @Vinoth Chandar Thanks for the reminder, marked to
> blocker, and next week would be ok to me.
>
> Vinoth Chandar 于2020年7月30日周四 上午11:35写道:
>
> > @leesf can we please mark
Thanks everyone who joined!
I am hanging out in #general on slack, if we want to finish off any
remaining questions. Please @vc me for questions.
On Thu, Jul 30, 2020 at 8:00 AM Vinoth Chandar wrote:
> yes! Please join
>
> On Thu, Jul 30, 2020 at 7:35 AM Pratyaksh Sharma
> wr
yes! Please join
On Thu, Jul 30, 2020 at 7:35 AM Pratyaksh Sharma
wrote:
> Hi Vinoth,
>
> Is this happening now?
>
> On Mon, Jul 27, 2020 at 3:50 AM Vinoth Chandar wrote:
>
> > Hi all,
> >
> > We will be using the conference link we use for
Q2
> > > > has been really hard with COVID and everything going on. Given that
> we
> > > are
> > > > at this point, I feel by delaying the RC by a week or so more if we
> can
> > > get
> > > > some of the 'At risk' items i
Hello all,
Just wanted to kickstart a thread to firm up the RC cut date for 0.6.0 and
pick a RM. (any volunteers?, if not I self nominate myself)
Here's an update on where we are at with the remaining release blockers. I
have marked items as "At risk" assuming we cut RC sometime next week.
Please
https://cwiki.apache.org/confluence/display/HUDI/20200728+Weekly+Sync+Minutes
Thanks!
Vinoth
Thanks for being so awesome, Raymond!
On Tue, Jul 28, 2020 at 4:23 PM Shiyan Xu
wrote:
> yup i can make a PR for this.
>
> On Tue, Jul 28, 2020 at 2:30 PM Vinoth Chandar wrote:
>
> > Makes sense. Can we update some docs with this IDE setup?
> >
> > On Tue, Jul
+1 we could support a built in Kafka based notification mechanism..
Could we keep that in hudi-utilities instead of hudi-client?
On Thu, Jul 23, 2020 at 11:18 PM wangxianghu wrote:
> Hi Gray
> Thanks for reply. It is the latter. We can use the callback to publish
> write commit message to extern
Makes sense. Can we update some docs with this IDE setup?
On Tue, Jul 28, 2020 at 10:32 AM Shiyan Xu
wrote:
> Sure... here it is
> https://gist.github.com/xushiyan/db4d4067657abe6b8872ef12473b7087
>
> On Tue, Jul 28, 2020 at 9:53 AM Vinoth Chandar wrote:
>
> > Unfortunate
; - Plugs in at the time of spark query planning to allow for automatic
> > > indexing optimizations based on the created index (something I found
> > > interesting and worth exploring especially for RFC-08)
> > >
> > > +1 on stepping the gas on RFC-08/15 for record level +
t;> On Mon, Jul 27, 2020 at 11:11 PM Y Ethan Guo
> >> wrote:
> >>
> >>> I see. I'll check the travis CI setup later. I'm unblocked now for
> >>> running the unit tests locally.
> >>>
> >>> Thanks,
> >>> - Ethan
&g
> > +1
> >
> > Having the metrics flexibly in common will help in building observability
> > in other modules.
> >
> > Thanks,
> > Nishith
> >
> > > On Jul 28, 2020, at 7:28 AM, Vinoth Chandar wrote:
> > >
> > > +1 as well.
+1 as well.
Given we support many reporters now. Could you please further
improve/retain modularity.
On Mon, Jul 27, 2020 at 6:30 PM vino yang wrote:
> Hi Modi,
>
> +1 for this proposal.
>
> I agree with your opinion that the metric report should not only report the
> client's metrics.
>
> And
this error, set spark.driver.allowMultipleContexts = true. The
> currently running SparkContext was created at:
>
> - "mvn test -Punit-tests -pl hudi-client -B" from CI: It encounters
> "java.lang.OutOfMemoryError: GC overhead limit exceeded".
>
> What I do now
ng roadmap and would be
> good to see some collaboration here as well.
>
> Thanks,
> Nishith
>
> On Sun, Jul 26, 2020 at 3:28 PM Vinoth Chandar wrote:
>
> > Hello all,
> >
> > In case you have not followed Hyperspace is a new indexing subsystem for
> > S
Hi Ethan,
For purposes of unblocking yourself, can you try running them locally via
mvn command via terminal?
thanks
vinoth
On Sun, Jul 26, 2020 at 4:12 PM Y Ethan Guo
wrote:
> Hi,
>
> I'm working on hudi-client module and I notice that if I run all unit tests
> under hudi-client locally in In
Hello all,
In case you have not followed Hyperspace is a new indexing subsystem for
Spark from Microsoft. It seemed like a very interesting project and I tried
to explore if it can help us with an indexing option inside Hudi.
TL;DR :
- Was exploring if hyperspace can be used an alternative fo
7:53 AM Adam Feldman wrote:
> Great! Thank you
>
> On Thu, Jul 23, 2020, 10:49 Vinoth Chandar wrote:
>
> > Hi Adam,
> >
> > Next week. July 30th 8AM PST.
> >
> > I will be sending dial in information over the weekend.
> >
> >
> >
> >
; >
> >
> > Sent from Yahoo Mail for iPhone
> >
> >
> > On Wednesday, July 15, 2020, 11:42 PM, Vinoth Chandar >
> > wrote:
> >
> > Great! Moving on to date. Would July 23/30 Thursday 8 AM PST work for
> > everyone?
> >
> > O
301 - 400 of 1285 matches
Mail list logo