Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge of Develop into Master #85

Merged
merged 135 commits into from
Mar 21, 2019
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
135 commits
Select commit Hold shift + click to select a range
cd1fe70
small changes to inventory
Gerbenvandervries Jan 11, 2019
feb7deb
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 11, 2019
7e2ee4f
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 15, 2019
065f128
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 16, 2019
a277bbf
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 17, 2019
f976108
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 22, 2019
4b888e7
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 24, 2019
e5843e9
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 24, 2019
b7a5398
Improved SSHD config: ensure only strong keys can be used.
pneerincx Jan 27, 2019
d3543a5
Bugfixes and added handlers for LDAP role including nslcd, dbusd, odd…
pneerincx Jan 28, 2019
a82550e
Bugfixes and simplified syntax for creating local admin accounts.
pneerincx Jan 28, 2019
c32d861
...
pneerincx Jan 28, 2019
85f63fa
Merge pull request #44 from pneerincx/develop
erijpkema Jan 29, 2019
2e3c530
Bugfix: parse public keys correctly for OpenSSH no matter in which or…
pneerincx Jan 29, 2019
c3a23a1
Merge pull request #45 from pneerincx/develop
Gerbenvandervries Jan 30, 2019
68db823
Fixed perms on /local.
pneerincx Jan 30, 2019
a3b1c9d
One '-i' was missing,,,
erijpkema Jan 30, 2019
ddf69ab
Merge pull request #48 from rug-cit-hpc/fix/create_cluster_non_intera…
Gerbenvandervries Jan 30, 2019
9962986
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Jan 30, 2019
660968e
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Jan 30, 2019
4be724a
Re-ordered RPMs in alphabetic order.
pneerincx Jan 30, 2019
89c3529
Making spacewalk role idempotent...
pneerincx Jan 30, 2019
a824382
Cleanup.
pneerincx Jan 30, 2019
e6e8222
Cleanup: removed commented code.
pneerincx Jan 31, 2019
fc60737
Added version number check for Ansible version to main cluster.yml pl…
pneerincx Jan 31, 2019
3cd2deb
Merge branch 'master' of https://github.com/rug-cit-hpc/league-of-rob…
Gerbenvandervries Feb 6, 2019
b966b53
fixed user ids, and admin and user group for local users. wget
Gerbenvandervries Feb 6, 2019
5c1b0db
Merge pull request #50 from pneerincx/develop
erijpkema Feb 6, 2019
b4b233d
changed clusterdesign drawing
rasmanen Feb 6, 2019
b69ce37
Merge pull request #55 from rug-cit-hpc/fix/improved-picture
erijpkema Feb 6, 2019
48d1910
Added new resolver role, which configures dnsmasq to use the nameserv…
pneerincx Feb 6, 2019
bc48b18
Merge pull request #56 from pneerincx/develop
erijpkema Feb 6, 2019
f51ef78
Added ssh-host signer password.
erijpkema Feb 18, 2019
9e46b63
Did a cleanup. #59
erijpkema Feb 18, 2019
6873464
Merge pull request #59 from rug-cit-hpc/fix/remove-obsolete-files-#59
pneerincx Feb 19, 2019
96a0e56
Merge pull request #58 from rug-cit-hpc/feature/encrypted-host-signer
pneerincx Feb 19, 2019
90c671f
Merge branch 'develop' into develop
pneerincx Feb 20, 2019
fb129df
Merge pull request #53 from Gerbenvandervries/develop
pneerincx Feb 20, 2019
6436ea0
Minor fixes for single_role_playbooks so they will automagically pick…
pneerincx Feb 21, 2019
aa401f1
Bugfixes for creating local admin and regular users. And added featur…
pneerincx Feb 21, 2019
604348f
Separating ldap tasks
erijpkema Jan 30, 2019
7c39246
Moved the sshd template to the cluster role #28.
erijpkema Feb 19, 2019
dbb8a2d
Work in progress...
erijpkema Feb 21, 2019
f01258a
Added handlers
erijpkema Feb 21, 2019
c1afb7a
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Feb 21, 2019
f155190
fix for slurm controleMachine name, added .bashrc and .screenrc for n…
Gerbenvandervries Feb 22, 2019
188f684
Merge pull request #62 from Gerbenvandervries/develop
Gerbenvandervries Feb 22, 2019
de7ef32
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Feb 22, 2019
a1282fd
Improved dynamic inventory.py script: 1. allow the use od groups of g…
pneerincx Feb 22, 2019
1b02f8a
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Feb 22, 2019
b35133c
Merge pull request #61 from rug-cit-hpc/feature/no-ldap-slurm-option
pneerincx Feb 22, 2019
1ba0499
Added compiled *.pyc filesto .gitignore.
pneerincx Feb 22, 2019
fb693ae
Deleted *.pyc file from repo.
pneerincx Feb 22, 2019
639da6e
Replaced imports of playbooks to configure local users with improved …
pneerincx Feb 22, 2019
c1b52f4
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Feb 27, 2019
4cb41ed
Updated code to deal with review comments.
pneerincx Feb 28, 2019
5348e4d
Added Henk-Jan as local admin user for all vclusters.
pneerincx Feb 28, 2019
9d9437c
DOCSTRING relocated to begining of file.
pneerincx Feb 28, 2019
f59c25e
Merge pull request #63 from pneerincx/develop
erijpkema Feb 28, 2019
f1acaf4
Fix to remove the optionally prefixed jumphost when setting the hostn…
pneerincx Feb 28, 2019
e7dbba4
Added new public key for Wim Nap.
pneerincx Feb 28, 2019
45abb81
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Feb 28, 2019
315a94d
Merge pull request #64 from pneerincx/develop
erijpkema Feb 28, 2019
c29d79a
host ca for hc
Gerbenvandervries Mar 1, 2019
5e21b6e
Shared storage config (WIP).
pneerincx Mar 1, 2019
52cae2a
cars
Gerbenvandervries Mar 1, 2019
c4b28a8
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 1, 2019
8f33c1f
Added shared storage role (complete!).
pneerincx Mar 3, 2019
c22996b
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Mar 3, 2019
54debbe
Added a network interface for thalos
erijpkema Mar 4, 2019
d3e9207
Merge pull request #68 from rug-cit-hpc/985-interface-for-thalos
pneerincx Mar 4, 2019
71c6ff1
The gearshift node also needs to mount storage.
erijpkema Mar 6, 2019
8a90cc5
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 6, 2019
71dc490
Merge pull request #69 from rug-cit-hpc/feature/storage_vlans_on_gear…
Gerbenvandervries Mar 6, 2019
793b0f6
Added changes for federated login.
erijpkema Mar 7, 2019
4e18abd
Updated the base_ou
erijpkema Mar 7, 2019
c236d46
password-auth-ac should be in user-interface
erijpkema Mar 7, 2019
34bf879
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 7, 2019
8d71f75
Merge pull request #71 from rug-cit-hpc/fix/password_auth_ac_missing
Gerbenvandervries Mar 7, 2019
de7d0ca
Merge pull request #70 from rug-cit-hpc/feature/ldap-settings-for-scz
pneerincx Mar 11, 2019
d1e4db0
Merge pull request #66 from pneerincx/develop
erijpkema Mar 11, 2019
ea8f2f8
Add the fysical cluster nodes in /etc/hostsi
erijpkema Mar 11, 2019
17e7908
Merge pull request #72 from rug-cit-hpc/feature/monitor_fysical_nodes…
pneerincx Mar 12, 2019
9921435
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Mar 12, 2019
3a033b1
Bugfixes: removes unused storage roles from playbooks and added new s…
pneerincx Mar 12, 2019
4ec58cc
Relocated stuff from host_vars to group_vars as host_vars cannot be r…
pneerincx Mar 12, 2019
69f9203
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 13, 2019
2875b0c
Removed NFS based home dirs exported from the UI from cluster.yml pla…
pneerincx Mar 13, 2019
d2b6c44
Updated gitignore to disregard anything that starts with .vault_pass.txt
pneerincx Mar 13, 2019
7b2e386
Bugfix: updated handlers to always restart systemd-logind after dbus …
pneerincx Mar 13, 2019
2d5e808
Bugfixes for creating local regular users on Hyperchicken (tested on …
pneerincx Mar 13, 2019
f82b10e
Merge pull request #74 from pneerincx/develop
Gerbenvandervries Mar 14, 2019
ed41f82
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 14, 2019
a9fac3a
Added a modern key for ger
erijpkema Mar 14, 2019
3c20429
Merge pull request #75 from rug-cit-hpc/add_ger_key
pneerincx Mar 14, 2019
46b1f57
Removed confusing unused argument for different but not used pam-scri…
pneerincx Mar 14, 2019
db48d13
Bugfix for slurm and slurm-client roles: ensure we get the right host…
pneerincx Mar 14, 2019
8564ec9
Bugfixes: added missing instances of "enabled: yes" or "state: starte…
pneerincx Mar 14, 2019
934030c
Removed debug statements..
pneerincx Mar 14, 2019
7de3ec8
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 14, 2019
5919145
Added workaround for a bug in logind.
erijpkema Mar 14, 2019
49f4854
Merge pull request #76 from rug-cit-hpc/fix/restart-logind
rasmanen Mar 14, 2019
f33a3e8
Relocated stuff related to login to dedicated logins role. (Previousl…
pneerincx Mar 15, 2019
72b9738
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Mar 15, 2019
caa361e
Added workaround for dbus <-> logind interaction also to hc-cluster.yml.
pneerincx Mar 15, 2019
bd5dcaa
Made the role more generic
erijpkema Mar 14, 2019
41fc0a2
Merge pull request #77 from rug-cit-hpc/feature/local_storage_on_dai
pneerincx Mar 15, 2019
c22dd08
Bugfixes: moved deployment of local admin users to beginning of playb…
pneerincx Mar 15, 2019
453078e
Merge branch 'develop' into develop
pneerincx Mar 15, 2019
5aae0b1
Merge pull request #78 from pneerincx/develop
erijpkema Mar 15, 2019
599f0a7
Removed deprecated cluster specific scripts to configure the Slurm DB…
pneerincx Mar 15, 2019
c9b24d5
NHC per vnode type
Gerbenvandervries Mar 15, 2019
867b208
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
Gerbenvandervries Mar 15, 2019
a6ff4d8
.
Gerbenvandervries Mar 15, 2019
36864ff
Merge pull request #80 from Gerbenvandervries/develop
pneerincx Mar 15, 2019
bb6e7c3
Merge pull request #79 from pneerincx/develop
erijpkema Mar 15, 2019
52156f7
Relocated definition of firewall_allowed_tcp_ports to group_vars.
pneerincx Mar 15, 2019
22cfb1b
Relocated slurm DB table name var from secrets to group_vars/all.yml.
pneerincx Mar 15, 2019
cb61438
Fixed Talos resource vars to make NHC happy.
pneerincx Mar 15, 2019
d4f7411
Made NHC network interface check dynamic.
pneerincx Mar 15, 2019
fcf5aec
Fixed merge conflict.
pneerincx Mar 15, 2019
c97a282
Improved smarter NHC checks for network interfaces and shared storage…
pneerincx Mar 15, 2019
aa1a612
Added new key for kees (stealth)
erijpkema Mar 19, 2019
f71b6e7
Merge pull request #81 from pneerincx/develop
erijpkema Mar 19, 2019
ef90c72
Merge pull request #82 from rug-cit-hpc/feature/new-key-for-kees
pneerincx Mar 19, 2019
41bfa1e
Removed blank line.
pneerincx Mar 21, 2019
6ebcef0
Bugfix: use correct name for tmp LFS.
pneerincx Mar 21, 2019
3be59f3
Fixed login checks to create users and accounts in the Slurm accounti…
pneerincx Mar 21, 2019
09d01f3
Merge branch 'develop' of https://github.com/rug-cit-hpc/league-of-ro…
pneerincx Mar 21, 2019
7a669ff
Fixed type in taks name.
pneerincx Mar 21, 2019
92a7900
Bugfix: /var/spool/slurmd must be owned by root.
pneerincx Mar 21, 2019
73f760f
Bugfix: /var/spool/slurmd must be readable, but not writable for all …
pneerincx Mar 21, 2019
93dd717
Merge pull request #83 from pneerincx/develop
erijpkema Mar 21, 2019
58177f5
added extra groups, and datamanager users
Gerbenvandervries Mar 21, 2019
ef1e7da
Merge pull request #84 from Gerbenvandervries/develop
pneerincx Mar 21, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -4,9 +4,10 @@
*.md.html
*.vagrant
*.pydevproject
*.pyc
*.retry
*.swp
.vault_pass.txt
.vault_pass.txt*
documentation/.~lock.UMCG Research IT HPC cluster technical design.docx#
promtools/results/*
roles/hpc-cloud
Expand Down
69 changes: 69 additions & 0 deletions callback_plugins/homsaplog.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,69 @@
'''
callback: homsaplog
type: stdout
short_description: Homo sapiens friendly formatted output.
description: Use this callback to sort though extensive debug output.
'''

from __future__ import (absolute_import, division, print_function)
from ansible.plugins.callback.default import CallbackModule as CallbackModule_default
from ansible.plugins.callback import CallbackBase
try:
# Ansible 2.3
from ansible.vars import strip_internal_keys
except ImportError:
try:
# Anisble2.4
from ansible.vars.manager import strip_internal_keys
except ImportError:
# Ansible 2.5
from ansible.vars.clean import strip_internal_keys
try:
import simplejson as json
except ImportError:
import json
import sys
reload(sys).setdefaultencoding('utf-8')

__metaclass__ = type

class CallbackModule(CallbackModule_default): # pylint: disable=too-few-public-methods,no-init
'''
Override for the default callback module.

Render std err/out outside of the rest of the result which it prints with
indentation.
'''
CALLBACK_VERSION = 2.0
CALLBACK_TYPE = 'stdout'
CALLBACK_NAME = 'homsaplog'

def _dump_results(self, result, indent=4, sort_keys=True, keep_invocation=False):
'''Return the text to output for a result.'''

if result.get('_ansible_no_log', False):
return json.dumps(dict(censored=
"The output has been hidden due to the fact that 'no_log: true' was specified for this result."))

# All result keys starting with _ansible_ are for internal use only,
# so remove them from the result before we output anything.
reformatted_result = strip_internal_keys(result)

# remove invocation unless specifically wanting it
if not keep_invocation and self._display.verbosity < 3 and 'invocation' in result:
del reformatted_result['invocation']

# remove diff information from screen output
if self._display.verbosity < 3 and 'diff' in result:
del reformatted_result['diff']

# remove exception from screen output
if 'exception' in reformatted_result:
del reformatted_result['exception']

output = json.dumps(reformatted_result, indent=indent, ensure_ascii=False, sort_keys=sort_keys)
output = output.replace('\\r\\n\",', '",')
output = output.replace('\\r\\n', "\n\t")
output = output.replace('\\n\",', '",')
output = output.replace('\\n', "\n\t")
return output
9 changes: 0 additions & 9 deletions cinder-controller.yml

This file was deleted.

9 changes: 0 additions & 9 deletions cinder-storage.yml

This file was deleted.

63 changes: 43 additions & 20 deletions cluster.yml
Original file line number Diff line number Diff line change
@@ -1,9 +1,41 @@
---
- name: Sign host keys of all cluster hosts.
- name: Sanity checks before we start.
hosts: all
pre_tasks:
- name: Verify Ansible version meets requirements.
assert:
that: "ansible_version.full | version_compare('2.4', '>=')"
msg: 'You must update Ansible to at least 2.4.x to use this playbook.'

- import_playbook: local_admin_users.yml

- name: Install roles needed for all virtual cluster components.
hosts: all
roles:
- logins
- ssh_host_signer
- ssh_known_hosts
tasks:
- cron:
#
# Silly workaround for bug in interaction dbus <-> logind
# Need DBus 1.11.10 for a fix, but CentOS 7.6 is stuck on dbus 1.10.24.
#
name: Restart systemd-logind
minute: "/10"
user: root
job: /bin/systemctl restart systemd-logind
cron_file: restart_logind
become: true

- name: Mount the ceph volume
hosts:
- compute-vm
- sys-admin-interface
- deploy-admin-interface
become: True
roles:
- mount-volume

- name: Install roles needed for all virtual cluster components except jumphosts.
hosts: cluster
Expand All @@ -14,6 +46,8 @@
- ldap
- node_exporter
- cluster
- resolver
- shared_storage

- name: Install ansible on admin interfaces (DAI & SAI).
hosts:
Expand Down Expand Up @@ -46,18 +80,21 @@
- hosts: slurm
become: true
roles:
- slurm
- prom_server
- cadvisor
- slurm
vars:
# These variables are needed by the mariadb role.
# Which is a depencency of the slurm role.
# See roles/slurm/meta/main.yml
hostname_node0: "{{ ansible_hostname }}"
ip_node0: "{{ ansible_default_ipv4['address'] }}"

- name: Install virtual compute nodes
hosts: compute-vm
become: true
tasks:
roles:
- compute-vm
- isilon
- datahandling
- slurm-client

- name: Install User Interface (UI)
Expand All @@ -66,19 +103,5 @@
tasks:
roles:
- slurm_exporter
- user-interface
- datahandling
- isilon
- slurm-client

- name: Export /home on NFS server.
hosts: user-interface:&talos-cluster
roles:
- nfs_home_server

- name: Mount /home on NFS clients.
hosts: compute-vm&talos-cluster
roles:
- nfs_home_client

- import_playbook: users.yml
...
2 changes: 1 addition & 1 deletion documentation/Gearshift_technical_design.md
Original file line number Diff line number Diff line change
Expand Up @@ -103,7 +103,7 @@ Figure 4. Network design for gs-compute[0-9] node

Figure 5. Network design for gs-vcompute[0-9] virtual compute node

![](./media/media/image6.jpg)
![](./media/media/image6a.jpg)

### Compute cluster design

Expand Down
Binary file added documentation/media/media/image6a.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
44 changes: 0 additions & 44 deletions gearshift_hosts.ini
Original file line number Diff line number Diff line change
@@ -1,43 +1,3 @@
[databases]
gs-openstack
gs-compute10
gs-compute11

[keystone]
gs-openstack

[glance-controller]
gs-openstack

[heat]
gs-openstack

[horizon]
gs-openstack

[rabbitmq]
gs-openstack
gs-compute10
gs-compute11

[memcached]
gs-openstack

[neutron-controller]
gs-openstack physical_interface_mappings=provider:enp130s0f0

[nova-controller]
gs-openstack

[cinder-controller]
gs-openstack

[cinder-storage]
gs-compute[01:11] storage_volume=/dev/sdb

[nova-compute]
gs-compute[01:11] physical_interface_mappings=provider:enp130s0f0

[jumphost]
airlock

Expand Down Expand Up @@ -68,7 +28,3 @@ administration
[gearshift-cluster:children]
cluster
jumphost

[metal]
gs-openstack
gs-compute[01:11]
9 changes: 0 additions & 9 deletions glance-controller.yml

This file was deleted.

3 changes: 3 additions & 0 deletions group_vars/administration.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
---

volume_mount_point: "/apps"
Loading