Re: Does OpenJPA replace Collections?

Paul Copeland Mon, 13 Apr 2009 20:52:25 -0700

Hi Craig -

Do you mean a JIRA about the case where "OpenJPA will decide to replacethe collection"? I agree it is not a bug, just a potential issue forthe unwary. Would it be useful to create a small example and post it here?


- Paul

On 4/13/2009 7:41 PM, Craig L Russell wrote:

Hi Paul,

On Apr 13, 2009, at 5:15 PM, Paul Copeland wrote:
Craig - Thanks for the responses. This confirms that for a new Entitya collection field may be null unless the application initializes it.
When you say "flushed" does that include callingEntityManager.flush() before the transaction is committed? The specsays the field can be null until it is "fetched". My expectation isthat the field may remain null even after calling EntityManager.flush().
The surprising thing is that when you add elements to such anapplication initialized empty collection, in some situations OpenJPAwill decide to replace the collection. At that point if you areholding a reference to the value returned by getMyPcList() thatcollection will then be stale, possibly leading to inconsistentresults for the caller.
This is worth a JIRA if only to clarify the code and the behavior thatthe code exposes.
Craig
- Paul

(other comments below)

On 4/13/2009 4:18 PM, Craig L Russell wrote:
Hi Paul,

On Apr 13, 2009, at 9:04 AM, Paul Copeland wrote:
Are there any responses from the OpenJPA experts on my twoassertions below? If the assertions seem wrong I will put togetherexamples that demonstrate the behavior. If the assertions arecorrect that is not necessary.
From JPA Spec Section 2.1.7 - "If there are no associated entitiesfor a multi-valued relationship of an entity fetched from thedatabase,the persistence provider is responsible for returning an emptycollection as the value of the relationship."
Note the words "fetched from the database". My reading of this isthat if the Entity is new and has not been flushed to the database(even though persist() has been called) the value could be nullrather than an empty collection. So the behavior of OpenJPAreturning null (assertion #1) would be consistent with the spec.
That's how I read it as well. Until the new entity is flushed,there's no reason to have the entity manager provider messing withthe values.
- Paul

On 4/9/2009 12:22 PM, Paul Copeland wrote:
Thanks for the assistance Craig -
Here are two assertions that I have observed in my testing withOpenJPA 1.2.1 -
(1) A Field Access persistent collection has a null value when thefield is accessed if the collection is empty. This is the state ofthe field in the transaction after the entity is first persistedbefore the transaction is committed (these are the conditions thatoccur in my process). Corollary - the null field is NOTautomatically changed to an empty Collection when first accessed.A method returning the collection field will return null.
This is discussed above. The entity is not "fetched" but rathernewly persisted.
(2) The value of a null collection field (state as in #1 above)that has been assigned to an initialized non-null value may beautomatically replaced before the transaction is committed atwhich point references to the assigned value will be stale and nolonger updated (for instance when entity's are added to thecollection).
This is discussed above. Until flush, any user changes to thecollection should be reflected in the database.
But one other thing to consider. It's the application'sresponsibility to manage both sides of a relationship to beconsistent at commit. So if you're looking to update only the otherside of a relationship you're in trouble unless you use some OpenJPAspecial techniques.
Good point about updating both sides of the relation. In this case Iam using the OpenJPA API to detect if the other side has not beenloaded yet and only updating the other side when necessary. This isto avoid loading a potentially very large collection that is notgoing to be used during the life of that EntityManager. If and whenthe other side is loaded OpenJPA will include the new elements then.
This does not change the question about null or empty collectionshowever.
Craig
If the experts believe either of these assertions are incorrectthen I definitely want to investigate further.
- Paul

(further comments below)


On 4/9/2009 11:13 AM, Craig L Russell wrote:
Hi Paul,

On Apr 9, 2009, at 9:40 AM, Paul Copeland wrote:
Couple of clarifications -
A lazily loaded FIELD ACCESS collection is a null value wheninitially accessed if the Collection is EMPTY (I said "null"incorrectly below).
My comment below was intended to compare your "if null theninitialize" paradigm with my "initialize to an empty collectionduring construction". So if the first time you access thecollection it is null your code sets the value to an emptycollection. My recommended code would never encounter a nullcollection.
Your way works (as do other ways).  :-)
The test I have shows this behavior for a newly persisted Entityduring the same transaction where em.persist(entity) is called.This is with a LAZY loaded collection.
During persist, the provider should not replace fields. Replacingfields behavior should happen at commit (flush) time. So if younever explicitly initialize a field, it should have its Javadefault value until flush.
This is NOT what I am seeing. In fact the replacement happensduring the transaction under certain conditions where the proxy isapparently created during the transaction some time after the callto em.persist(entity) and before commit.
If you're talking about wrapping the persistent collection withan unmodifiable collection then you're talking about adding moreobjects. I thought you were trying to avoid any object construction?
I would construct the unmodifiable collection (if the idiomworked) only if and when the value is accessed and has alreadybeen loaded. Other things being equal, I don't want to constructtens of thousands of Collections in a tight loop that are neverused. Given database latencies it is a small point in overallperformance. As I said, there are good arguments either way andyour recommendation is one reasonable approach, but apparently nota JPA requirement.
In some applications there is a difference between an emptycollection and a null collection. There are properties that allowthat behavior to be implemented as well, although that'snon-standard and a bit more complicated.
It might be easier to look at a test case because I think we'retalking past each other.
Craig
On 4/9/2009 9:26 AM, Paul Copeland wrote:
Hi Craig -
My experience is not what you are describing. A lazily loadedFIELD ACCESS collection is a null value when initially accessedif the Collection is null (possibly a PROPERTY ACCESScollection behaves differently as mentioned by Pinaki , Ihaven't tested that).
To repeat what is below -

                 getMyPcList()
returns null if the Collection is empty unless you initializethe value with "new ArrayList()". This is what my testingshows with 1.2.1 - I wish it weren't this way since that mightit make it possible to use the Collections.unmodifiedList()idiom (as it is that idiom has unreliable behavior). If theexperts are pretty sure that I am wrong about this then Idefinitely want to investigate it further. I'd like to hear more.
I don't think you have given a reason to require initializingthe Collection at construction time or at first access -- thereare reasonable aesthetic and performance arguments either way.
- Paul


On 4/9/2009 7:01 AM, Craig L Russell wrote:
Hi Paul,
I like to think of entities as POJOs first, so I can test themwithout requiring them to be persistent. So if you want codeto be able to add elements to collections, the collectionsmust not be null.
If you construct the field as null and then "lazily"instantiate an empty collection, then anyway you end up withan empty collection the first time you access the field. Andconstructing an empty collection should not be even a blip onyour performance metric.
Considering everything, I still recommend that you instantiatean empty collection when you construct an entity.
Craig

On Apr 8, 2009, at 10:21 AM, Paul Copeland wrote:
Pinaki -
I tried your suggestion of not initializing the value ofmyPcList and I get a null pointer exception when adding to anempty list.
I noticed your example was for Property access and Russell(and I) were talking about Field access. Do you agree thatit is necessary to initialize an empty list when using Fieldaccess?
On Craig's advice to always construct a new ArrayList(), whyis that necessary instead of just constructing it in thegetter when it tests to null? Otherwise you are constructingan ArrayList that is unnecessary when the List is NOT empty(usually) and also unnecessary in the case of LAZY loading ifthe List is never accessed (perhaps also a frequent case).In some applications you might create lots of these objectsand normal optimization is to avoid calling constructorsunnecessarily. Just want to be clear about whether it isnecessary.
- Paul

On 4/8/2009 9:43 AM, Paul Copeland wrote:
Thanks Pinaki -
I think you are saying that at some point the proxy objectdoes replace the local List. Is that right?
I have seen that model - if (myPcList == null) myPcList =new ArrayList() - in various examples (not sure where now).Thanks for clearing that up. But then Craig Russellcontradicts you in his reply (below) where he recommendsalways initializing the Collection in the constructor (whichseems like a performance anti-pattern of wasted constructorcalls since usually it will be replaced by the proxy). Areyou and Craig saying opposite things here?
In my testing when the List is empty - (myPcList == null) -does indeed evaluate to true.
          getMyPcList().add(new MyPcObject())
Therefore I thought the above would cause a null pointerexception when the List is empty. You say that won't happenso I'll give it a try!
- Paul


On 4/8/2009 3:16 AM, Pinaki Poddar wrote:
Hi,
According to JPA spec:
"If there are no associated entities for a multi-valuedrelationship of an entity fetched from the database,the persistence provider is responsible for returning anempty collection as the value of the relationship."
That is what OpenJPA does. So the application do not needto return an empty list for a null (initialized) list.
OpenJPA proxies all returned collections. So applicationcode can simply do the following
// In the domain class
private List<MyPcObject> myPcList = null; // neverexplictly initialized
@OneToMany (mappedBy="ownerSide", fetch=FetchType.LAZY,cascade=CascadeType.PERSIST)
public  List<Promotion> getMyPcList()  {
  return myPcList; // return as it is
}

// In the application
List<Promotion> list = owner.getMyPcList();
assertNotNull(list);
assertTrue(java.util.List.class.isInstance(list));
assertNotSame(java.util.ArrayList.class, list.getClass());
list.add(new MyPcObject());
owner.setMyPcList(list);




On Apr 7, 2009, at 11:10 PM, Paul Copeland wrote:
Can OpenJPA replace a Collection when it is loaded?
With the code below when the list is initially empty youneed to create a List (ArrayList) so you can add elementsto it. When I persisted new objects on the ManyToOne sideand added them to the List that worked. But the firsttime the List was loaded it seemed to replace myArrayList with the newly loaded data and made an olderreference to the ArrayList stale (no longer updated whenmore elements were added to myPcList). This was all inone transaction.
So now I wonder if the initial null List is a special caseor if OpenJPA might replace the Collection anytime itdecides to load it again. Anyone know the answer?
If the list is persistent and the class is enhanced, thecollection will always reflect what's in the database.
If I don't create an initial ArrayList how can I addelements when the List is empty?
I'd recommend always having a non-empty list. Initialize itin the constructor to an empty list and don't check itafter that.
Here's what it would look like:
@OneToMany (mappedBy="ownerSide", fetch=FetchType.LAZY,cascade=CascadeType.PERSIST)private List<MyPcObject> myPcList = newArrayList<MyPcObject>();
List<Promotion> getMyPcList()
{
 return myPcList;
}
Craig
Craig L Russell
Architect, Sun Java Enterprise System http://db.apache.org/jdo
408 276-5638 mailto:craig.russ...@sun.com
P.S. A good JDO? O, Gasp!





-----
Pinaki Poddarhttp://ppoddar.blogspot.com/http://www.linkedin.com/in/pinakipoddar
OpenJPA PMC Member/Committer
JPA Expert Group Member
Craig L Russell
Architect, Sun Java Enterprise System http://db.apache.org/jdo
408 276-5638 mailto:craig.russ...@sun.com
P.S. A good JDO? O, Gasp!
Craig L Russell
Architect, Sun Java Enterprise System http://db.apache.org/jdo
408 276-5638 mailto:craig.russ...@sun.com
P.S. A good JDO? O, Gasp!
Craig L Russell
Architect, Sun Java Enterprise System http://db.apache.org/jdo
408 276-5638 mailto:craig.russ...@sun.com
P.S. A good JDO? O, Gasp!
Craig L Russell
Architect, Sun Java Enterprise System http://db.apache.org/jdo
408 276-5638 mailto:craig.russ...@sun.com
P.S. A good JDO? O, Gasp!

Re: Does OpenJPA replace Collections?

Reply via email to