Adam optimization = Momentum + RMSprop (Adaptive moment estimation)
Hyperparameters choice for Adam optimization (Recommendations)
alpha : needs to be tuned B1 : 0.9 → (dw) B2 : 0.999 → (dw^2) Epsilon : 10^-8