which gradient technique is more advantageous when the data is too big to handle in ram simultaneously?