I have noticed that in most models step parameter is used to indicate the no. of steps to run over data but I have also noticed that in practical usage, we mainly use the fit function N epochs.
What is the distinction between running one thousand steps with one epoch and running one hundred steps with ten epoch? Which one is better? Is there any logic changes between consecutive epochs? Data shuffling?