There is a SubArray implementation, sub(), but it has some performance issues related to indexing that should be fixed before it becomes default for slicing.
For small arrays I would guess that hardcoded multiplication is faster than calling a BLAS. You can find info about you julia by calling versioninfo()
