Hi everyone, I’m reaching out to let the community know that I’ve started working on *ASTERIXDB-3745*: "Handle sampling errors when train_list contains incomplete or incompatible vectors*.*"
I noticed that the vector index creation can fail when the sampler encounters null or dimensionally mismatched vectors in an open-type dataset. I'm currently working on a patch to gracefully filter these out during the sampling phase to make the index creation more robust. My Jira account request is currently pending. Once it is approved, I will formally assign the issue to myself. In the meantime, I wanted to provide this update to avoid any duplicated effort on this ticket. Best regards, Tejesh Sakhamuri <https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail> Virus-free.www.avast.com <https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail> <#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>
