Setup Runbook - NETESOLUTIONS/ERNIE GitHub Wiki

Table of Contents Azure Security Policy Assignment Linux VM Create Azure VM Set up system Customize Set up log rotation Script Tools Interactive Tools Linux SDKs Configure authentication, groups and users (Neo4j Server) SSH for System Accounts Add storage Configure swap Disable Linux Firewall Set up Backup Azure Monitor ClamAV Postgres Installation Default client parameters Tablespaces (Neo4j Server) Neo4j Python 3 C++ 14+ HipMCL Jenkins Jenkins jobs Upsource Slack Integrations monit HDInsight Cluster VM - Optional restricted access Create Subnet

Azure

Create the ERNIE Azure Resource Group
Create the ERNIE-LRS Recovery Service Vault
- Properties > Backup Configuration > Update > Storage replication type = Locally-redundant
- Create the backup policy
Create the ernie1-nsg Azure Network Security Group
Open port 22 only on the Azure firewall in the Network Security Group settings

Security Policy Assignment

Security Center > Security policy > {subscription} > View effective policy > {policy assignment} > Parameters >

This requires subscription owner privileges

Disk encryption should be applied on virtual machines = Disabled

Linux VM

Create Azure VM

Login to Azure under NETE Azure Pay-As-You-Go subscription
Add a VM:
- Name = ernie-{purpose}
Basic
- Region = East US 2
- Image = CIS CentOS Linux 7.5
- Select an appropriate server size
Disks
- Add an appropriate number of premium storage disks
Networking
- Virtual network = ERNIE-vnet
- Public IP = new
- NSG = Advanced > ernie1-nsg
- Accelerated networking = off
Management
- OS guest diagnostics = on
Tags
- Add vm = {VM name}
Create
Configure public DNS

Set up system

Customize

## Update OMI to 1.4.2-3+ ##
sudo rpm -Uvh https://packages.microsoft.com/config/rhel/7/packages-microsoft-prod.rpm
sudo yum update -y omi
sudo rm -rf /home/omi /var/spool/mail/omi

## Azure CLI ##
sudo rpm --import https://packages.microsoft.com/keys/microsoft.asc
{ cat <<'HEREDOC'
[azure-cli]
name=Azure CLI
baseurl=https://packages.microsoft.com/yumrepos/azure-cli
enabled=1
gpgcheck=1
gpgkey=https://packages.microsoft.com/keys/microsoft.asc
HEREDOC
} | sudo tee /etc/yum.repos.d/azure-cli.repo
sudo chmod a+r /etc/yum.repos.d/*
yum check-update
sudo yum install -y azure-cli

# Add the EPEL repo
sudo yum install -y epel-release

## Add the elrepo ##
# Latest Linux kernel updates

sudo rpm --import https://www.elrepo.org/RPM-GPG-KEY-elrepo.org
sudo rpm -Uvh http://www.elrepo.org/elrepo-release-7.0-3.el7.elrepo.noarch.rpm

## Add the Open Fusion repo ##
# Get GNU Parallel updates beyond v20160222, which is a very old version 

sudo rpm --import http://repo.openfusion.net/RPM-GPG-KEY-openfusion

{ cat <<'HEREDOC'
[OpenFusion]
name=Open Fusion
baseurl=http://repo.openfusion.net/centos7-x86_64
enabled=1
gpgcheck=1
HEREDOC
} | sudo tee /etc/yum.repos.d/OpenFusion.repo


# Add a Midnight Commander CentOS 7 binary repo
sudo wget http://download.opensuse.org/repositories/home:/laurentwandrebeck:/mc/CentOS_7/home:laurentwandrebeck:mc.repo -O /etc/yum.repos.d/home_laurentwandrebeck_mc.repo

References:

Set up log rotation

Upload logrotateconfiguration from the repo to e.g. ~/Workspaces/ERNIE/Config/storage/etc, then:

sudo cp -Rv ~/Workspaces/ERNIE/Config/storage/etc /

# Fix SELinux context user for some log files
sudo chcon -u system_u /var/log/*

Script Tools

sudo yum install -y parallel
PCRE grep: sudo yum install -y pcre-tools
jq: sudo yum install -y jq
lftp, an FTP client: sudo yum install -y lftp
7-Zip: sudo yum install -y p7zip

Interactive Tools

Midnight Commander, an Orthodox File Manager (OFM): sudo yum install -y mc
sudo yum install -y nano
- Some people don't use emacs nor vi
glances, advanced top-like server resource stats: sudo yum install -y glances
sudo yum install -y qrencode: QR code generation, e.g. for Google Authenticator

Linux SDKs

E.g. packages required for compiling monit sources:

libtool: sudo yum install -y libtool
PAM development support: sudo yum install -y pam-devel
SSL header files: sudo yum install -y openssl-devel

Configure authentication, groups and users

Increase sudo timeout: set sudo sed --in-place --regexp-extended 's/(Defaults.*env_reset).*/\1,timestamp_timeout=60/' /etc/sudoers
Create the core team group: sudo groupadd erniecore
[] TBD. Create end user group: sudo groupadd ernieusers
Create core team Linux users and add ernie_admin and core team users to the erniecore (as the primary group) and wheel.
Configure PAM
- sudo yum install -y google-authenticator
- Upload and copy PAM config files to /etc/pam.d/
Configure system banner: upload and copy issue.net file to /etc/
Configure SSH: upload and copy sshd_config file to /etc/ssh/

(Neo4j Server) SSH for System Accounts

Set up SSH for System Accounts

Add storage

For each additional disk:

Create a new VM drive in Azure Portal.
On the machine: partition, format and mount the drive.
- If throughput > 200MB/s or IOPS > 1000 (e.g. for Premium HDD storage, 1 TB), use xfs. Otherwise, (e.g. for Standard HDD or SDD storage, 1 TB) use ext4 file system. See How to Choose Your Red Hat Enterprise Linux File System for details.
Add the disk UUID to /etc/fstab. For example, add the following line:

UUID=43204a4e-48b4-4c44-8db2-bc411fe10da4 /data1 xfs defaults,nofail 1 2

Reference: Add a disk to a Linux VM

Configure swap

Add a 100G swap file

Disable Linux Firewall

[] TBD PAR-496 Evaluate a need in firewalling
Azure can do firewalling via Azure NSG so we don't need firewalld nor iptables firewalls. The hardening script enables iptables/ip6tables, but doesn't do anything with firewalld.
Stop and disable Linux firewalld service:

sudo systemctl stop firewalld
sudo systemctl disable firewalld

If the project decides to enforce tunneling, Azure firewall (Azure dashboard > Network security group) should be used.

Set up Backup

Azure Dashboard > Virtual Machines > {server} > Backup >

Recovery Services vault > Select existing = ERNIE-LRS
Choose backup policy = Maximum-9-points
Enable Backup

Azure Monitor

Azure Monitor setup as documented did not work: the monitor was enabled, but no data is being recorded.
Linux Diagnostic Extension 3.0 setup as documented failed with a Python syntax error.

ClamAV

sudo yum install -y epel-release
sudo yum install -y clamav-server clamav-data clamav-update clamav-filesystem clamav clamav-scanner-systemd clamav-devel clamav-lib clamav-server-systemd
sudo setsebool -P antivirus_can_scan_system 1
sudo setsebool -P clamd_use_jit 1
sudo sed -i -e "s/^Example/#Example/" /etc/clamd.d/scan.conf
sudo cp /etc/clamd.d/scan.conf /etc/clamd.d/scan.conf.backup
sudo sed -i -e "s/#LocalSocket /LocalSocket /" /etc/clamd.d/scan.conf
sudo cp /etc/freshclam.conf /etc/freshclam.conf.backup
sudo sed -i -e "s/^Example/#Example/" /etc/freshclam.conf
sudo freshclam
sudo bash -c "cat >/usr/lib/systemd/system/freshclam.service <<EOF
[Unit]
Description = freshclam scanner
After = network.target
[Service]
Type = forking
ExecStart = /usr/bin/freshclam -d -c 2
Restart = on-failure
PrivateTmp = true
[Install]
WantedBy=multi-user.target
EOF
"
sudo systemctl start freshclam
sudo systemctl enable freshclam
sudo systemctl start clamd@scan
sudo systemctl enable clamd@scan

This sets up:

ClamAV services
Periodic DB updates
- [] TODO. Check on that. There were root emails with warning messages.
Disabled on-access scan

To scan files periodically, create a Jenkins jobs, running sudo clamscan -i -r /home /erniedev_data1. This jobs would fail with exit code 1 on any infected files found. Running it under root should ensure no access errors, which trigger exit code 2.

For more info, see How to Install ClamAV on CentOS 7.

Postgres

Installation

Install Postgres per the recipes
Configure Postgres per the recipes
Set up user access
Add postgres user to the erniecore group: sudo usermod -a -G erniecore postgres and restart Postgres

Default client parameters

/etc/profile.d/postgres_defaults.sh:

export PGDATA=/var/lib/pgsql/11/data
export PGDATABASE=ernie

This makes scripts (which mostly connect to Postgres on the same server) less verbose and more portable between systems. It'd also help a lot if connection parameters ever need to change.
Users can override these defaults via the command line, particularly to connect under their own accounts, e.g psql -U dk.

For Jenkins-executed local scripts these could be set on the fly in Manage Jenkins > Configure System > Global properties > Environment variables.

Tablespaces

Allocate hard drive space and create tablespaces. See Postgres Server Performance Tuning. Make sure that the Postgres service user which was set up above (postgres) can read and write to the parent and the actual tablespace directories:

{module}_tbs per each module
p2_studies_tbs, theta_plus_tbs, sb_plus_tbs, tri_citations_tbs for large case study tables
index_tbs for all indexes
temp_tbs for the Postgres temp_tablespace and for all staging tables
user_tbs for the Postgres default_tablespace and for non-public (user) objects
ernie1_museum_tbs for the data moved from ERNIE1

(Neo4j Server) Neo4j

Install Java 11
Install Neo4j per the Neo4j recipes
Add neo4j user to the core group: sudo usermod -a -G erniecore neo4j

Python 3

Install Anaconda3 distribution
1. Navigate to the link for the most recent version, then download it on the server, e.g. wget https://repo.continuum.io/archive/Anaconda3-2018.12-Linux-x86_64.sh
2. chmod ug+x Anaconda3*.sh
3. sudo ./Anaconda3*.sh
4. Accept the license agreement
5. Enter the following location: /anaconda3
6. Enter defaults in other prompts, no to install Visual Studio Code and finish installation
Set up environment:
1. sudo alternatives --install /usr/local/bin/python python /usr/bin/python2.7 1
2. sudo alternatives --install /usr/local/bin/python python /anaconda3/bin/python 2
Install modules:
- sudo /anaconda3/bin/pip install psycopg2
- sudo /anaconda3/bin/pip install pandas
- sudo /anaconda3/bin/pip install tzlocal
- sudo /anaconda3/bin/pip install lxml
- sudo /anaconda3/bin/pip install inflect
- sudo /anaconda3/bin/pip install graphene_sqlalchemy
- sudo /anaconda3/bin/pip install Flask-GraphQL
Grant permissions to all users: sudo chmod o+rx -R /anaconda3
1. TBD [] Figure out what permissions are exactly needed for Anaconda and installed packages to be executable by all users.