No, ARM does not have these types of instruction. The only direct programmer control of memory cacheability is via the page tables.However, do you have any numbers to suggest that writing to uncached buffered memory is any faster than writing to cached memory on ARM? Most recent ARM cores optimize caches for memcpy performance, as it is a pretty common use case.