Observability - GregLinthicum/From-Logistic-Regression-to-Long-short-term-memory-RNN GitHub Wiki

CloudWatch Container Insights

X-Ray with CloudWatch

Grafana Grafana's design for caters to analyzing and visualizing metrics such as system CPU, memory, disk and I/O utilization. ... Kibana, on the other hand, runs on top of Elasticsearch and is used primarily for analyzing log messages. Free.

Splunk up to $150 per ingested GB (can get data from Kinesis Firehose)

Container Observability

AWS:

  1. You create an IAM policy to allow your container instances to use the CloudWatch Logs APIs.

  2. Then you must attach that policy to ecsInstanceRole.

  3. Then, you install the CloudWatch agent on your container instances.

    a. On the Fargate launch type, you view the logs from your containers.

    b. On the EC2 launch type, you get different logs from your containers in one convenient location.

         b. 1. If you are not using the Amazon ECS-optimized AMI, you will have to edit the /etc/ecs/ecs.config file and add ECS_AVAILABLE_LOGGING_DRIVERS=["json-file","awslogs"] to specify that the awslogs logging driver is available on the container instance.
    

Above Kubernetes

Helm: Helm is a package manager for Kubernetes that helps users define, install, and manage Kubernetes applications and their dependencies using "charts" - which are packages of pre-configured Kubernetes resources. Helm by Nana

Istio: Istio is a service mesh platform that provides capabilities for traffic management, security, observability, and policy enforcement for microservices running on Kubernetes. Istio by Nana

Knative: Knative is a platform for building, deploying, and** managing serverless workloads **on Kubernetes. It provides abstractions for autoscaling, eventing, and building containerized applications without needing to manage the underlying infrastructure directly.

Prometheus: Prometheus is a** monitoring and alerting** toolkit designed for Kubernetes and other containerized environments. It collects metrics from applications and infrastructure, stores them in a time-series database, and provides querying and visualization capabilities.

KubeFlow: KubeFlow is an open-source platform for machine learning (ML) workflows on Kubernetes. It provides components and abstractions for building, deploying, and managing ML models and pipelines at scale.

Rancher: Rancher is a complete Kubernetes management platform that provides features for cluster provisioning, monitoring, alerting, logging, and multi-cluster management.