New memristor design uses built-in oxygen gradient to bring stability to reinforcement learning Tech Xplore