This sounds all find and good, but I still haven't heard anything
about the far more common cause of delay: the garbage collector
locking up the thread or pre-empting the CPU. Or has Amit already
solved this problem?

As for alignment, OpenGL as implemented by JSR-239 (therefore
available on Android) solves this problem by using the JSR-239
java.nio Buffer classes. If you use one of these, you are guaranteed
word-alignment on the ARM.

NB: I say "jsr-239 java.nio" because there may be subtle
implementation differences between those and the original java.nio
classes. For example, JSR-239 does not expect flip() to be useful, nor
is supporting Mark required.

NB: the online docs for Android's java.nio do not mention alignment: I
found this info in one of the OpenGL ES specs (I forget which one): I
forget whether or not the Buffer must be allocated Direct or not to
get this benefit.

On Aug 19, 12:40 pm, Marc <[email protected]> wrote:
> I have spent a lot of time optimizing signal processing algorithms
> using both arm assembly and memory optimizations. The memory
> optimizations can be huge. I remember one project the speed more than
> doubled once we only using cache memory. That is the entire working
> data set was less than 8k.
>
> The process is fairly straight forward: Determine the size of your
> cache, then use only that much memory. This typically requires doing
> tricks like making the output buffer overlap the input buffer.
> Basically you need to know exactly where every byte of memory is being
> used. Java tends to be allocate as you need memory model, but for
> cache optimizations you need tighter control of memory usage.
>
> On Aug 19, 11:47 am, DanH <[email protected]> wrote:
>
> > "Hate to sound like I'm harping on the same stuff, but then (assuming
> > that the JVM/JIT compiler is doing good enough), the memory bottleneck
> > still remains."
>
> > Yep, much of our effort on iSeries went into the memory bottleneck
> > area.  Eg, we got fairly astounding improvements (ca 20%) when we
> > "packed" objects so that the fields of "SubclassOfA" filled in the
> > "holes" left from aligning the fields of "A".  And even more
> > improvement by packing the Char array owned by a String into the
> > String and arranging it so that the two shared a single header.
>
> > (BTW, with regard to alignment, note that most processors can handle,
> > eg, unaligned ints and longs, but often the storage accesses are
> > several times longer if unaligned, so alignment may be very important,
> > even if "unnecessary".)
>
> > On Aug 19, 12:54 pm, Amit <[email protected]> wrote:
>
> > > Hi Dan,
>
> > > Thanks for the response
>
> > > > In general, JITed Java code is as fast as or faster than the
> > > > equivalent native code, if the JIT is reasonably good, and if the
> > > > specific application can be coded efficiently in Java.  
>
> > > I was actually banking on this. I don't know too much of the hairy
> > > details (am not really a compiler person), but from what I have read
> > > recent improvements by Google to the Dalvik VM make it *comparable* if
> > > not equal in performance to native code ...
>
> > > Hate to sound like I'm harping on the same stuff, but then (assuming
> > > that the JVM/JIT compiler is doing good enough), the memory bottleneck
> > > still remains.
>
> > > Thanks,
> > > Amit
>
> > > On Aug 19, 10:11 pm, DanH <[email protected]> wrote:
>
> > > > In general, JITed Java code is as fast as or faster than the
> > > > equivalent native code, if the JIT is reasonably good, and if the
> > > > specific application can be coded efficiently in Java.  The problem is
> > > > that some specific data processing patterns are not easy to code
> > > > efficiently in Java, and I suspect that certain of the bit-bashing
> > > > algorithms used in image processing fall into this category.
>
> > > > In such cases the most efficient approach is "native Java", but I only
> > > > know of one JVM (the IBM iSeries "classic" JVM) that permits this, and
> > > > then only for system code.  Otherwise it's a bit of a tradeoff to get
> > > > the right partitioning between Java and native, since crossing the
> > > > Java/native boundary tends to be relatively expensive.
>
> > > > On Aug 19, 7:03 am, Fabrizio Giudici <[email protected]>
> > > > wrote:
>
> > > > > -----BEGIN PGP SIGNED MESSAGE-----
> > > > > Hash: SHA1
>
> > > > > On 8/19/10 13:35 , Amit wrote:
>
> > > > > > Now, I know that native code will *not* yield any significant
> > > > > > performance improvement over Java code
>
> > > > > Well, specifically for image processing this won't be true, for sure
> > > > > up to 2.1 included (as the bytecode is purely interpreted); in 2.2 we
> > > > > have JIT, but can't speak as I haven't seen it yet.
>
> > > > > - --
> > > > > Fabrizio Giudici - Java Architect, Project Manager
> > > > > Tidalwave s.a.s. - "We make Java work. Everywhere."
> > > > > java.net/blog/fabriziogiudici -www.tidalwave.it/people
> > > > > [email protected]
> > > > > -----BEGIN PGP SIGNATURE-----
> > > > > Version: GnuPG/MacGPG2 v2.0.14 (Darwin)
> > > > > Comment: Using GnuPG with Mozilla -http://enigmail.mozdev.org/
>
> > > > > iEYEARECAAYFAkxtHakACgkQeDweFqgUGxe83wCfSDP1NEN+TLD0iOCZ/zSvQDRw
> > > > > I5cAoJOEoC7eREU5KuPU7m93/GDj9VUr
> > > > > =2ZDf
> > > > > -----END PGP SIGNATURE-----
>
>

-- 
You received this message because you are subscribed to the Google
Groups "Android Developers" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/android-developers?hl=en

Reply via email to