ollemartensson commented on PR #560: URL: https://github.com/apache/arrow-julia/pull/560#issuecomment-3239844435
> Could you split this to smaller PRs for easy to review? > > For example, we don't need to mix C data interface support and tensor support, right? > Yes definitely, the work is already divided into three separate commits so I can split this PR into three distinct PRs instead. > > While having a career worth of coding experience, the code is mostly generated using claude. > > > > Can we license the code as ASF copyrighted Apache license 2.0 safely? > > (Can we ensure that the code doesn't include any copyrighted code?) A valid concern of course. And I the truth is that there probably is no way to get certain since these models have been trained on a mix of copyrighted code. However the same goes for human written code as well I suppose. For this particular case I can be quite sure where the inspiration comes from since implementation instructions are created as a result of analysing the existing code bases mentioned in the PR and that no big architecture patterns are applied. In other words the AI cannot just come up with these types of solutions(yet) and need to be boxed in by instructions. It's a very important and interesting topic and honestly I have no ensurance to give. What I can do(and are willing to do) is to put in more work analysing each part of the contributed code for potential violations and document the process. So let me come back with three separate PRs, go over each of them searching for and cleaning up potential license violations. On the positive side my initial tests exceeded my expectations performance wise. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
