Skip to content

Not a Number

The cake is a lie

  • Home
  • Events
  • Education
  • About

Category: Distributed Systems

Posted on April 24, 2018March 17, 2020

Deploying Distributed TensorFlow with Infrastructure Manager and Ansible

The idea of this post is to show how you can deploy a basic TensorFlow architecture to train a model, using AWS and the tool Infrastructure Manager. All the code and scripts are on GitHub.

Continue reading “Deploying Distributed TensorFlow with Infrastructure Manager and Ansible”

Posted on March 10, 2018March 17, 2020

SLURM Cluster Configuration on Azure (Part III)

This is the third part of the tutorial to install and configure SLURM on Azure (part I, part II). With this post, we are going to complete the process and we show an example of the execution of one task.

Continue reading “SLURM Cluster Configuration on Azure (Part III)”

Posted on February 5, 2018March 17, 2020

SLURM Cluster Configuration on Azure (Part II)

This is the second post of the SLURM configuration and installation guide on Azure (part I is here). In this part, we are going to configure the NFS system, and finally, in the third post, we are going to set up the SLURM environment.

Continue reading “SLURM Cluster Configuration on Azure (Part II)”

Posted on January 13, 2018March 17, 2020

SLURM Cluster Configuration on Azure (Part I)

I got some free time to share this project, the deployment of a workload manager to ease the management of my research group’s cluster of GPUs.

Continue reading “SLURM Cluster Configuration on Azure (Part I)”

Recent Posts

  • WMT19-Winning MT MLLP Es↔Pt System
  • InterSpeech 2019 – Graz
  • InterSpeech 2019 – Real-Time One-Pass Decoder for Speech Recognition Using LSTM Language Models
  • Aachen – Summer 2019
  • PyConUS – 2019

Archives

  • November 2019
  • October 2019
  • September 2019
  • June 2019
  • February 2019
  • November 2018
  • July 2018
  • May 2018
  • April 2018
  • March 2018
  • February 2018
  • January 2018
  • November 2017
  • October 2017

Categories

  • Distributed Systems
  • Education
  • Events
  • Research

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
  • GitHub
  • YouTube
  • LinkedIn
  • Twitter
Proudly powered by WordPress