Defining "Healthy" - lydia-wu/cadence GitHub Wiki

Yukihiro, Hayley; 2021-09-03 Friday

Vision

Purpose of task: to introduce/familiarize with industry terms to describe the "health" of units

Explore the following inquiries:

Inquiry Thought process
What other YouTube links can you point the team toward to better understand data? drop links in the Toby extension, and drop a 1-3 sentence summary or bullet list of highlights
What language/dictionary can we use/create to characterize different units? "Dead", "Beating", "Intermittent"? Are these words sufficiently descriptive? What other words might we consider?

We will whiteboard this subject on Friday to discuss previous experience with classifying the "health" of units, along with research insight gained from this task.

Summary of Research 9/3:

  • Anomaly detection (aka outlier analysis) is a step in data mining that identifies data points, events, and/or observations that deviate from a dataset’s **normal behavior **
  • key performance indicators (KPIs) that evaluate the success of the organization
    • KPIs can determine the expected or normal behavior of the data
  • The absence of change can be an anomaly if it breaks a pattern that is normal for the data from that particular metric.
    • what are the insights for our data
    • we determine the normal metric
    • the metric can be either frequency (black Friday) or a specific value
  • Anomalies aren’t categorically good or bad, they’re just deviations from the expected value for a metric at a given point in time.
  • Time series data contains information necessary for making educated guesses about what can be reasonably **expected **in the future
    • Like blueprinting in cybersecurity, we need to become familiar with the data to determine what is normal → then we determine what is healthy data
    • Time series data anomaly detection must first create a baseline for normal behavior in primary KPIs
  • Understanding the types of outliers that an anomaly detection system can identify is essential to getting the most value from generated insights.
  • By discovering **normal tendency **or dealing out of data usually we find information but infrequent occurrence or data item sometimes may provide details which is very useful to us.
  • Anomaly detection is a technique used to identify unusual patterns that do not conform to expected behavior, called outliers.