I have a web application written in Flask. As suggested by everyone, I can't use Flask in production. So I thought of Gunicorn with Flask.
In Flask application I am loading some Machine Learning models. These are of size 8GB collectively. Concurrency of my web application can go up to 1000 requests. And the RAM of the machine is 15GB.
So what is the best way to run this application?