Re: Points of order on this WG

Nikunj R. Mehta Fri, 26 Jun 2009 10:29:48 -0700

Please don't skimp on due diligence before making such strongstatements. It creates unnecessary friction. More details below.


On Jun 25, 2009, at 10:49 PM, Maciej Stachowiak wrote:

On Jun 25, 2009, at 5:23 PM, Jonas Sicking wrote:
On Thu, Jun 25, 2009 at 12:42 PM, Nikunj R.
Mehta<[email protected]> wrote:
On Jun 24, 2009, at 11:35 PM, Ian Hickson wrote:
I have proposed to Mozilla a solution that provides access to anorganizedkey-value database such as that provided in the (open source)Berkeley DB.In essence, a database is a simple B-tree - it keeps keys sortedand permitsduplicate keys. It is able to find a key or a key prefix, whichenables
efficient searching through a very large number of items. If we are
ambitious (i.e., need more functionality), we can add indexes andjoins tothis spec. There is unlikely to be an interoperability nightmare,such asthat which is the most likely outcome with SQL, it does notmandate the useof any query language, and there is at least 40 years ofexperience with it,including in highly resource-constrained environments. (There are200
million copies of Berkeley DB in deployment [1]).
This is what so far seems like the best solution to me. I.e.something
that is more backend-ish than what a SQL API would be.

I'd love to see something that allows you to implement a SQL API on
top of. But that also allows you to implement something like MungoDB
very effectively.
I doubt you can efficiently or correctly implement SQL on top of aBerkeley-DB-style API.


If you are worrying, that's great; we can address your worries.

Just take a look at the top two hits for "MySQL Berkeley DB". Thefirst one talks about MySQL with the BDB storage engine - so yes, youcan correctly implement SQL on top of Berkeley DB [1]. The second onetalks about MySQL discontinuing BDB support due to extra-technicalreasons, so there are no efficiency reasons either [2].

As a side note, it should be noted Berkeley DB itself could not beused by WebKit or Gecko to implement the spec, because even thoughit is open source, the license is not compatible with the LGPL. Itseems unlikely that non-open-source browser engines could use iteither, unless they are willing to pay Oracle for a commerciallicense. So it's very important for the spec to be clear anddetailed, because everyone will have to implement it from scratch.

Huh? what? I hope you had read Oracle's BDB license document [3] andopen source FAQ [4]. IANAL, but I can get answers for your specificconcerns in the context of open source Berkeley DB. AFAICT, someonelike Mozilla would not face any trouble with the open source licenseof Berkeley DB. YMMV.

It's also not clear to me if a BDB-level API is sufficient fordeveloper needs. As I understand it, it's basically a giantdictionary with unstructured keys and values. So it's not providingmuch over LocalStorage, except for prefix matching and the abilityto hold large amounts of records or records that are individuallylarge. There's no way to efficiently query by one of several fields,as I understand it.

I trust that you are relatively new to storing data with B-trees. Theyare at the heart of Oracle's indices so efficiency is out of question.If you are wondering how can people store complex data items withmultiple fields and repeating values, look at Berkeley DB JavaEdition, which supports the EJB 3 persistence model [5]. FYI, there isno significant difference between the APIs of BDB Java Edition and theoriginal BDB. They also have identical licensing requirements.


[1] http://dev.mysql.com/doc/refman/5.0/en/bdb-storage-engine.html
[2] http://www.linux.com/archive/articles/56835
[3] 
http://www.oracle.com/technology/software/products/berkeley-db/htdocs/oslicense.html
[4] 
http://www.oracle.com/technology/software/products/berkeley-db/htdocs/licensing.html
[5] http://www.oracle.com/technology/products/berkeley-db/je/index.html

Re: Points of order on this WG

Reply via email to