Re: Is it possible to collect object usage information during compilation?

DaveG via Digitalmars-d Sat, 10 Jan 2015 12:55:49 -0800

On Saturday, 10 January 2015 at 18:31:18 UTC, Paolo Invernizziwrote:

On Saturday, 10 January 2015 at 17:31:42 UTC, DaveG wrote:
On Saturday, 10 January 2015 at 13:19:19 UTC, Martin Nowakwrote:
Here is a sketch for an optimal solution. I'm actuallyeagerly waiting that someone finally implements it.
http://dpaste.dzfl.pl/cd375ac594cf
I would also have to sell the idea of writing an ORM which iscertainly not on the roadmap, but this will certainly help myargument.
Maybe not, something simpler than a full ORM should becompelling also.
I guess you know about the ORM Vietnam [1], but also this [2]can be of some help in selling a simple D solution.
I would like to see, someday, something in D that:

 - can check at compile time the syntax of SQL;
- can check at compile time the SQL query statement againstthe current DB schema;- can read the output of a DB schema dump at CT, and parse itinto what is needed for the previous points (more complicated);
The first point should be easy today, the second and the lastone involve more work...
[1]http://blogs.tedneward.com/2006/06/26/The+Vietnam+Of+Computer+Science.aspx
[2] http://wozniak.ca/what-orms-have-taught-me-just-learn-sql
---
Paolo

I have no intention of writing anything as massive as EntityFramework or Hibernate. We have been successful over the past 4years with just a small collection of functions to reduce some ofthe pain (and redundancy) in writing a lot of dynamic SQL. Nowthat we have an opportunity to start fresh we have a chance to dosomething better.

The traditional problems with ORMs in general are well known andthese are the reasons why I have never used one in production.

1. Complexity. You basically need to learn an entire new language(sometimes literally). This is an investment which can be worthit if the abstraction is successful. The following problems arewhy I think the investment is not worth it.

2. Limitations. Unfortunately too often you need to drop in toSQL to really get things done. This alone is a non-starter. If Ineed to bypass the abstraction to do anything really interestingor complex, it has failed. Sometimes (usually) this is forperformance, other times it's because there is simply no way (orit's too complicated) to express what I want through theabstraction.

3. Compilation/Translation. The time to translate commands to SQL(or whatever backend) can be a high price. Most ORMs do some typeof caching now which is generally sufficient. In D most of thework can be done at compile time which is even better.

4. Unnecessary Data. Greedy data retrieval is way to common, thedefault is usually to get everything. For small queries and datasets you can write it off as "not a problem", but when your modelgets large and interconnects, this can be catastrophic. Again,thanks Martin for the clever basis for a solution in D.

5. DB Performance. The efficiency of the SQL that is actuallygenerated. People seem to focus on this because the generated SQLis generally quite verbose. Interestingly, in my experience, thisis often the smallest performance problem because the queryoptimizer (at least in SQL Server with good indexes andstatistics) will generate the same execution plan regardless.This is also a code gen problem that can be tweaked withoutbreaking user code.

You may have noticed that 4 of 5 problems are about performance.That's because, at least in our case, it is that important and itis that much of a problem. Current ORMs often look great, but inmy experience, the price is always to high. Some "micro-ORMs"avoid the performance problems, but they do so by sacrificingmost of the features (you still have to write raw SQL forexample). Some of the problems are inherit to solution and cannotbe "solved", but they can be reduced.

For a long time I thought some of these problems wherefundamental and had basically written off the concept of ORMs[see: Vietnam of Computer Science]. The good news is most of theproblems appear to be solvable.#1 is unavoidable obviously there will be something new (whetherit's a DSL or just an API)

#2 is really dependent on the other problems and implementation.
#3 is "just" implementation.
#4 has a conceptual solution, now it's "just" implementation.

#5 does not have a solution because it will depend on thebackend, but I think it's reasonable to expect a solution thatworks for almost all cases. It will be impossible to know withouttesting.

One final note. You may have noticed I didn't mention the schemasyncing problem (keeping database and code in sync). There was atime I would have said that was essential and while it would benice in a perfect world, I'm comfortable keeping them in syncmanually (or semi-manual with scripts). I can generate a bunch ofclasses from an existing database fairly easily and when I changea table I can manually update a class. If I was writing SQLdirectly I would have to update my query, this is really nodifferent. Doing validation in unit tests is perfectly acceptableto me.



Sorry for long post.
-Dave

Re: Is it possible to collect object usage information during compilation?

Reply via email to