I haven't updated my source tree in a long time, but the ARM devices were using bionic/libc/arch-arm/bionic/memset.S. This is pure ARM assembly optimized for the pipeline and cache.
My ARM assembly is a little rusty, but it looks like they implement memset and bzero, the latter just calling memset with R1=0.
My ARM assembly is a little rusty, but it looks like they implement memset and bzero, the latter just calling memset with R1=0.