Hi Dave Thanks for stepping in, I was hoping for someone to help us do this properly. (And some weeks ago googled a bit, but it seems we should be aligned with the ASF policies I read them, but have forgotten now which they were...)
On Wed, Aug 6, 2025 at 2:14 AM Dave Fisher <w...@apache.org> wrote: > Before 2 there is some due diligence required. Maybe the following has > been discussed and I missed it, but: > > No, this is the first time we discuss it on this list, properly. (And it wasn't until this thread that I've been convinced we have to merge these two pieces of code. To start with the basics, we currently depend on this pip package. https://pypi.org/project/signal-processing-algorithms/ The public source code repo is https://github.com/mongodb/signal-processing-algorithms If you dig into it, you will quickly realize this is not MongoDB's main development repo, rather a public repo where new code is periodically published, perhaps in relation to some article being published at a conference. In particular, the repo doesn't use tags to denote release versions, so we would have to go by date and just tribal knowledge to check out the correct point in time that corresponds to the rather old version 1.something that Otava still depends on. It is the very heart of Otava, the math, while the code in apache/otava repo adds a CLI interface, support for CSV input as well as several databases, configuration files, and I believe some notification mechanism? Slack / email??? 2a. Is this signal processing code Apache License, v2.0? > Yes, as can be clearly verified from both of the above links. > 2b. What is the IP provenance of this signal processing code? Who holds > the copyright? > This was written 100% by MongoDB employees and published with the appropriate approval processes. For the version we use most of the code was written by William Brown and Jim O'Leary. Later code I was no longer there, but I believe was written by David Bradford and Alexander Costas. (David Daly will now better, and we're still hoping the current author would actually show up here...) > 2c. How large is this codebase? > > The initial dump seems to be 2600 lines. The commit history after that is relatively short, and in any case we don't currently use the newer features. For context, an old version of Otava I happened to have checked out as I write this, is 6200 lines. For historical context, we did approach MongoDB when we were submitting the proposal for the incubator project. As far as I could tell they were generally positive, but on a corporate level the response was more like "what's the point" / "what's the benefit". Once I confirmed that we don't need MongoDB's participation, as the code is already Apache licensed anyway, the discussion quickly died out. The two people who cared about this, Davdid Daly and Matt Asay, have since both left MongoDB. henrik -- *nyrkio.com <http://nyrkio.com/>* ~ *git blame for performance* Henrik Ingo, CEO hen...@nyrkio.com LinkedIn: www.linkedin.com/in/heingo +358 40 569 7354 Twitter: twitter.com/h_ingo