This is the second poster that I presented during the EuroPython 2018, the gist is how to deploy the required infrastructure to create a TensorFlow cluster, and then provision the software to train a Deep Learning model. For doing this, I used the Infrastructure Manager (http://www.grycap.upv.es/im/index.php) that supports API’s from different virtual platforms, making user applications Cloud-agnostic.
IM also integrates a contextualization system, based on Ansible, to enable the installation and configuration of all the required applications providing a fully functional Deep Learning infrastructure on the Cloud provider that we need.
Now, the poster:
And the PDF version in this link.